Selection criteria for scatterplot smoothers

Authors
Citation
B. Efron, Selection criteria for scatterplot smoothers, ANN STATIST, 29(2), 2001, pp. 470-504
Citations number
23
Categorie Soggetti
Mathematics
Journal title
ANNALS OF STATISTICS
ISSN journal
00905364 → ACNP
Volume
29
Issue
2
Year of publication
2001
Pages
470 - 504
Database
ISI
SICI code
0090-5364(200104)29:2<470:SCFSS>2.0.ZU;2-H
Abstract
Scatterplot smoothers estimate a regression function y = f(x) by local aver aging of the observed data points (x(i), y(i)). In using a smoother, the st atistician must choose a "window width," a crucial smoothing parameter that says just how locally the averaging is done. This paper concerns the data- based choice of a smoothing parameter for splinelike smoothers, focusing on the comparison of two popular methods, C-p and generalized maximum likelih ood. The latter is the MLE within a normal-theory empirical Bayes model. We show that C-p is also maximum likelihood within a closely related nonnorma l family, both methods being examples of a class of selection criteria, Eac h member of the class is the MLE within its own one-parameter curved expone ntial family. Exponential family theory facilitates a finite-sample nonasym ptotic comparison of the criteria. In particular it explains the eccentric behavior of C-p, which even in favorable circumstances can easily select sm all window widths and wiggly estimates of f(x). The theory leads to simple geometric pictures of both C-p and MLE that are valid whether or not one be lieves in the probability models.