Decision criteria for soft independent modelling of class analogy applied to near infrared data

Citation
R. De Maesschalck et al., Decision criteria for soft independent modelling of class analogy applied to near infrared data, CHEM INTELL, 47(1), 1999, pp. 65-77
Citations number
25
Categorie Soggetti
Spectroscopy /Instrumentation/Analytical Sciences
Journal title
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS
ISSN journal
01697439 → ACNP
Volume
47
Issue
1
Year of publication
1999
Pages
65 - 77
Database
ISI
SICI code
0169-7439(19990419)47:1<65:DCFSIM>2.0.ZU;2-9
Abstract
SIMCA (soft independent modelling of class analogy) is a well known pattern recognition method which describes each class separately in a principal co mponents (PC) space. New objects are considered to belong to the class if t heir Euclidean distance towards the constructed PC space is not significant ly larger than the Euclidean distance of the class objects towards their PC space. The large number of wrongly rejected objects (alpha-error), which i s a known problem of SIMCA, was examined. Using scores, predicted by leave- one-out cross-validation, instead of the original scores, obtained after PC A on the class objects, to compute the distance towards the class model cle arly improves the decision criterion of the original SIMCA for a data set c onsisting of near infrared (NIR) spectra of tablets, The original SIMCA and modifications using different distance measures as defined by Hawkins (Mah alanobis distance) and Gnanadesikan were compared with respect to classific ation and their robustness towards the number of PCs selected to describe t he different classes. SIMCA modified with the Mahalanobis distance was foun d to be a good alternative of the original SIMCA which, for the presented N IR data set, seems to be more robust for finding outliers when the exact nu mber of PCs to build the model is not known. (C) 1999 Elsevier Science B.V. All rights reserved.