Unsupervised forward selection: A method for eliminating redundant variables

Citation
Dc. Whitley et al., Unsupervised forward selection: A method for eliminating redundant variables, J CHEM INF, 40(5), 2000, pp. 1160-1168
Citations number
32
Categorie Soggetti
Chemistry
Journal title
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES
ISSN journal
00952338 → ACNP
Volume
40
Issue
5
Year of publication
2000
Pages
1160 - 1168
Database
ISI
SICI code
0095-2338(200009/10)40:5<1160:UFSAMF>2.0.ZU;2-0
Abstract
An unsupervised learning method is proposed for variable selection and its performance assessed using three typical QSAR data sets; The aims of this p rocedure are to generate a subset Of descriptors from any given data set in which the resultant variables are relevant,redundancy is eliminated, and m ulticollinearity is reduced. Continuum regression, an algorithm encompassin g ordinary least squares regression, regression on principal components, an d partial least squares regression, was used to construct models from the s elected variables: The variable selection routine is shown to produce simpl e, robust, and easily interpreted models for the chosen data sets.