Elimination of the uninformative calibration sample subset in the modifiedUVE (Uninformative Variable Elimination)-PLS (Partial Least Squares) method

Citation
J. Koshoubu et al., Elimination of the uninformative calibration sample subset in the modifiedUVE (Uninformative Variable Elimination)-PLS (Partial Least Squares) method, ANAL SCI, 17(2), 2001, pp. 319-322
Citations number
6
Categorie Soggetti
Spectroscopy /Instrumentation/Analytical Sciences
Journal title
ANALYTICAL SCIENCES
ISSN journal
09106340 → ACNP
Volume
17
Issue
2
Year of publication
2001
Pages
319 - 322
Database
ISI
SICI code
0910-6340(200102)17:2<319:EOTUCS>2.0.ZU;2-Z
Abstract
In order to increase the predictive ability of the PLS (Partial Least Squar es) model, we have developed a new algorithm, by which uninformative sample s which cannot contribute to the model very much are eliminated from a cali bration data set. In the proposed algorithm, uninformative wavelength (or i ndependent) variables are eliminated at the first stage by using the modifi ed UVE (Uninformative Variable Elimination)-PLS method that we reported pre viously. Then, if the prediction error of the ith (1 less than or equal to/ less than or equal ton) sample is larger than 3 sigma, the corresponding sa mple is eliminated as uninformative, where n is the total number of calibra tion samples and sigma is the standard deviation calculated from the other n-1 samples. Calculation of sigma by the leave-one-out manner enhances the ability to identify the uninformative samples. The final PLS model is const ructed precisely because both uninformative wavelength variables and uninfo rmative samples are eliminated. In order to demonstrate the usefulness of t he algorithm, we have applied it to two kinds of mid-infrared spectral data sets.