Feature selection in sequential projection pursuit

Citation
Q. Guo et al., Feature selection in sequential projection pursuit, ANALYT CHIM, 446(1-2), 2001, pp. 85-96
Citations number
21
Categorie Soggetti
Spectroscopy /Instrumentation/Analytical Sciences
Journal title
ANALYTICA CHIMICA ACTA
ISSN journal
00032670 → ACNP
Volume
446
Issue
1-2
Year of publication
2001
Pages
85 - 96
Database
ISI
SICI code
0003-2670(20011119)446:1-2<85:FSISPP>2.0.ZU;2-H
Abstract
A feature selection method is proposed to select a subset of variables in s equential projection pursuit (SPP) analysis in order to preserve as much sa mple clustering information as possible. The inhomogeneity of the complete data is explored by SPP, and the retained inhomogeneity information of a ca ndidate subset is measured by means of the percentage of consensus in gener alised procrustes analysis. The best subset is obtained by applying a genet ic algorithm (GA) which optimises the consensus between the subset and the complete data set. An improved algorithm is proposed which enables analysis of high-dimensional data. The method was studied on three high-dimensional industrial data sets. The results show that the proposed method successful ly identified inhomogeneity-bearing variables and leads to better subsets o f variables than the other studied feature selection methods in preserving interesting clustering information. (C) 2001 Elsevier Science B.V. All righ ts reserved.