CONSISTENCY OF DATA-DRIVEN HISTOGRAM METHODS FOR DENSITY-ESTIMATION AND CLASSIFICATION

Authors
Citation
G. Lugosi et A. Nobel, CONSISTENCY OF DATA-DRIVEN HISTOGRAM METHODS FOR DENSITY-ESTIMATION AND CLASSIFICATION, Annals of statistics, 24(2), 1996, pp. 687-706
Citations number
30
Categorie Soggetti
Statistic & Probability","Statistic & Probability
Journal title
ISSN journal
00905364
Volume
24
Issue
2
Year of publication
1996
Pages
687 - 706
Database
ISI
SICI code
0090-5364(1996)24:2<687:CODHMF>2.0.ZU;2-U
Abstract
We present general sufficient conditions for the almost sure L(1)-cons istency of histogram density estimates based on data-dependent partiti ons. Analogous conditions guarantee the almost-sure risk consistency o f histogram classification schemes based on data-dependent partitions. Multivariate data are considered throughout. In each case, the desire d consistency requires shrinking cells, subexponential growth of a com binatorial complexity measure and sublinear growth of the number of ce lls. It is not required that the cells of every partition be rectangle s with sides parallel to the coordinate axis or that each cell contain a minimum number of points. No assumptions are made concerning the co mmon distribution of the training vectors. We apply the results to est ablish the consistency of several known partitioning estimates, includ ing the k(n)-spacing density estimate, classifiers based on statistica lly equivalent blocks and classifiers based on multivariate clustering schemes.