IMPROVED VARIABLE SELECTION PROCEDURE FOR MULTIVARIATE LINEAR-REGRESSION

Authors
Citation
Ad. Walmsley, IMPROVED VARIABLE SELECTION PROCEDURE FOR MULTIVARIATE LINEAR-REGRESSION, Analytica chimica acta, 354(1-3), 1997, pp. 225-232
Citations number
7
Journal title
ISSN journal
00032670
Volume
354
Issue
1-3
Year of publication
1997
Pages
225 - 232
Database
ISI
SICI code
0003-2670(1997)354:1-3<225:IVSPFM>2.0.ZU;2-U
Abstract
This paper reports the development of an improved variable selection p rocedure for Multivariate Linear Regression (MLR). The procedure has b een compared to the more commonly applied techniques of Principle Comp onent Regression (PCR) and Partial Least Squares Regression (PLS) and was found to outperform both techniques in terms of prediction ability of a previously unseen sample when tested using three data sets (two UV and one FT-IR data set). The technique described will illustrate th at many of the shortcomings of the MLR method can be overcome by optim izing the selection of variables specifically for prediction, rather t han the ability to model the training data. The paper also demonstrate s that a very small calibration set consisting of the pure components only can be used to produce a good model for prediction. The procedure is iterative, and as such there are many possible combinations of var iables which can be found, this paper will demonstrate that the approa ch will reach an optimum quickly, and give a stable answer even if the training time is short. The procedure is however more computationally time consuming than PCR and PLS but as data collection is by far the most time consuming aspect, it is not considered to be a serious probl em. (C) 1997 Elsevier Science B.V.