The Sense and Non-Sense of Holdout Sample Validation in the Presence of Endogeneity

Citation
Ebbes, Peter et al., The Sense and Non-Sense of Holdout Sample Validation in the Presence of Endogeneity, Marketing science , 30(6), 2011, pp. 1115-1122
Journal title
ISSN journal
07322399
Volume
30
Issue
6
Year of publication
2011
Pages
1115 - 1122
Database
ACNP
SICI code
Abstract
Market response models based on field-generated data, need to address potential endogeneity in the regressors to obtain consistent parameter estimates. Another requirement is that market response models predict well in a holdout sample. With both requirements combined, it may seem reasonable to subject an endogeneitycorrected model to a holdout prediction task, and this is quite common in the academic marketing literature. One may be inclined to expect that the consistent parameter estimates obtained via instrumental variables (IV) estimation predict better than the biased ordinary least squares (OLS) estimates. This paper shows that this expectation is incorrect. That is, if the holdout sample is similar to the estimation sample so that the regressors are endogenous in both samples, holdout sample validation favors regression estimates that are not corrected for endogeneity (i. e., OLS) over estimates that are corrected for endogeneity (i. e., IV estimation). We also discuss ways in which holdout samples may be used sensibly in the presence of endogeneity. A key takeaway is that if consistent parameter estimates are the primary model objective, the model should be validated with an exogenous (rather than endogenous) holdout sample.