ASSESSING INFLUENCE IN VARIABLE SELECTION-PROBLEMS

Authors
Citation
C. Leger et N. Altman, ASSESSING INFLUENCE IN VARIABLE SELECTION-PROBLEMS, Journal of the American Statistical Association, 88(422), 1993, pp. 547-556
Citations number
19
Categorie Soggetti
Statistic & Probability","Statistic & Probability
Volume
88
Issue
422
Year of publication
1993
Pages
547 - 556
Database
ISI
SICI code
Abstract
Variable selection techniques are often used in combination with multi ple linear regression to produce a parsimonious model that fits the da ta well. It is clearly undesirable for the final model to depend stron gly on the inclusion of a few influential cases in the data set. This article discusses a measure of influence of single cases on the final model. based on a similar measure used in ordinary multiple regression . When variables are selected objectively, deletion of individual case s can strongly affect the choice of model. The influence of individual cases on the parameters of the selected model are often assessed as p art of the model building process. However, such conditional measures fail to evaluate the influence of the cases on the variable selection process. Modern computing environments make it feasible to use an unco nditional criterion to determine the influence of each case on the sel ection procedure. A number of examples are discussed to illustrate the differences between these approaches. Heuristics are developed to exp lain the examples. We conclude that, although the conditional approach gives valuable information about the selected model, the use of the u nconditional approach can lead to greater insight about the influence of individual observations on the process of model selection.