FEATURE-SELECTION USING THE KALMAN FILTER FOR CLASSIFICATION OF MULTIVARIATE DATA

Citation
W. Wu et al., FEATURE-SELECTION USING THE KALMAN FILTER FOR CLASSIFICATION OF MULTIVARIATE DATA, Analytica chimica acta, 335(1-2), 1996, pp. 11-22
Citations number
20
Categorie Soggetti
Chemistry Analytical
Journal title
ISSN journal
00032670
Volume
335
Issue
1-2
Year of publication
1996
Pages
11 - 22
Database
ISI
SICI code
0003-2670(1996)335:1-2<11:FUTKFF>2.0.ZU;2-4
Abstract
A Kalman filter is developed as a feature selection method and classif ier for multivariate data. Three near-infrared (NIR) data sets and a p ollution data set are analyzed. For the two most difficult data sets ( data sets 1 and 3), the Kalman filter successfully selects the wavelen gths which lead to very good results with a correct classification rat e (CCR) equal to one. These results are much better than the best resu lts obtained from regularized discriminant analysis (RDA) using Fourie r transform Fl, principal component regression (PCA) and univariate fe ature selection methods as the variable reduction methods. For the sec ond data set which consists of more than two classes, the Kalman filte r gives similar results (CCR=1) to those of RDA. For the pollution dat a set (data set 4), the Kalman filter gives similar results to partial least-squares (PLS) using fewer variables. The disadvantage of the Ka lman filter is that it needs more memory and more computing time than PLS. The potential hazards of overfitting have been considered.