ITA
ENG

Estimates of cross-validity for stepwise regression and with predictor selection

Authors

Schmitt, N Ployhart, RE

Citation

N. Schmitt et Re. Ployhart, Estimates of cross-validity for stepwise regression and with predictor selection, J APPL PSYC, 84(1), 1999, pp. 50-57

Citations number

Categorie Soggetti

Psycology

Journal title

JOURNAL OF APPLIED PSYCHOLOGY

ISSN journal

00219010 → ACNP

Volume

Issue

Year of publication

1999

Pages

50 - 57

Database

ISI

SICI code

0021-9010(199902)84:1<50:EOCFSR>2.0.ZU;2-2

Abstract

The effects of preselection of predictors (e.g., stepwise regression) on fo rmula estimates of cross-validity were examined. Three actual data sets wer e used to generate populations of varying sample size, population validity, and number of predictors. No formula estimate provided an unbiased estimat e of the population cross-validity, although some formula estimates were le ss biased than others. More important, having an adequate sample size (rela tive to number of predictors) was the issue most affecting the utility of t he formula estimates. Another conclusion was that adjusted R-2 provided by at least some popular software programs can provide gross overestimates of cross-validity and should not be used as such.