Estimates of cross-validity for stepwise regression and with predictor selection

Citation
N. Schmitt et Re. Ployhart, Estimates of cross-validity for stepwise regression and with predictor selection, J APPL PSYC, 84(1), 1999, pp. 50-57
Citations number
25
Categorie Soggetti
Psycology
Journal title
JOURNAL OF APPLIED PSYCHOLOGY
ISSN journal
00219010 → ACNP
Volume
84
Issue
1
Year of publication
1999
Pages
50 - 57
Database
ISI
SICI code
0021-9010(199902)84:1<50:EOCFSR>2.0.ZU;2-2
Abstract
The effects of preselection of predictors (e.g., stepwise regression) on fo rmula estimates of cross-validity were examined. Three actual data sets wer e used to generate populations of varying sample size, population validity, and number of predictors. No formula estimate provided an unbiased estimat e of the population cross-validity, although some formula estimates were le ss biased than others. More important, having an adequate sample size (rela tive to number of predictors) was the issue most affecting the utility of t he formula estimates. Another conclusion was that adjusted R-2 provided by at least some popular software programs can provide gross overestimates of cross-validity and should not be used as such.