GENETIC ALGORITHM-BASED METHOD FOR SELECTING WAVELENGTHS AND MODEL SIZE FOR USE WITH PARTIAL LEAST-SQUARES REGRESSION - APPLICATION TO NEAR-INFRARED SPECTROSCOPY

Citation
As. Bangalore et al., GENETIC ALGORITHM-BASED METHOD FOR SELECTING WAVELENGTHS AND MODEL SIZE FOR USE WITH PARTIAL LEAST-SQUARES REGRESSION - APPLICATION TO NEAR-INFRARED SPECTROSCOPY, Analytical chemistry, 68(23), 1996, pp. 4200-4212
Citations number
34
Categorie Soggetti
Chemistry Analytical
Journal title
ISSN journal
00032700
Volume
68
Issue
23
Year of publication
1996
Pages
4200 - 4212
Database
ISI
SICI code
0003-2700(1996)68:23<4200:GAMFSW>2.0.ZU;2-V
Abstract
Genetic algorithms (GAs) are used to implement an automated wavelength selection procedure for use in budding multivariate calibration model s based on partial least-squares regression. The method also allows th e number of latent variables used in constructing the calibration mode ls to be optimized along with the selection of the wavelengths. The da ta used to test this methodology are derived from the determination of aqueous organic species by near-infrared spectroscopy. The three data sets employed focus on the determination of (1) methyl isobutyl keton e in water over the range of 1-160 ppm, (2) physiological levels of gl ucose in a phosphate buffer matrix containing bovine serum albumin and triacetin, and (3) glucose in a human serum matrix. These data sets f eature analyte signals near the limit of detection and the presence of significant spectral interferences. Studies are performed to characte rize the signal and noise characteristics of the spectral data, and op timal configurations for the GA are found for each data set through ex perimental design techniques. Despite the complexity of the spectral d ata, the GA procedure is found to perform web, leading to calibration models that significantly outperform those based on full spectrum anal yses. In addition, a significant reduction in the number of spectral p oints required to build the models is realized.