Genetic algorithms applied to the selection of factors in principal component regression

Citation
U. Depczynski et al., Genetic algorithms applied to the selection of factors in principal component regression, ANALYT CHIM, 420(2), 2000, pp. 217-227
Citations number
24
Categorie Soggetti
Spectroscopy /Instrumentation/Analytical Sciences
Journal title
ANALYTICA CHIMICA ACTA
ISSN journal
00032670 → ACNP
Volume
420
Issue
2
Year of publication
2000
Pages
217 - 227
Database
ISI
SICI code
0003-2670(20000914)420:2<217:GAATTS>2.0.ZU;2-V
Abstract
Using principal component regression (PCR) as a multivariate calibration to ol, always brings up the question what subset of factors, i.e. principal co mponents (PCs) gives the best calibration model. Normally factor selection is based on deterministic methods like top-down procedures, forward-backwar d-stepwise variable selection or correlated principal component regression (CPCR). In contrast to this, we applied a stochastic method, i.e. a genetic algorithm (GA) for factor selection in this paper. A new kind of fitness f unction was applied which combined the prediction error of the calibration and an independent validation set, The performance of eigenvalue and correl ation ranking was compared. A general statistical criterion for judging the significance of differences between individual calibration models is intro duced. In this context it could be shown that for the uncertainties of the standard deviations representing the prediction errors a very simple approx imation formula holds which only includes the number of standards, For the current applications it is shown that the GA gives a result very close ro C PCR-solutions. (C) 2000 Elsevier Science B.V. All rights reserved.