On the difference between low-rank and subspace approximation: improved model for multi-linear PLS regression

Citation
R. Bro et al., On the difference between low-rank and subspace approximation: improved model for multi-linear PLS regression, CHEM INTELL, 58(1), 2001, pp. 3-13
Citations number
33
Categorie Soggetti
Spectroscopy /Instrumentation/Analytical Sciences
Journal title
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS
ISSN journal
01697439 → ACNP
Volume
58
Issue
1
Year of publication
2001
Pages
3 - 13
Database
ISI
SICI code
0169-7439(20010928)58:1<3:OTDBLA>2.0.ZU;2-Q
Abstract
While both Tucker3 and PARAFAC models can be viewed as latent variable mode ls extending principal component analysis (PCA) to multi-way data, most fun damental properties of PCA do not extend to both models. This has practical importance, which will be explained in this paper. The fundamental differe nce between the PARAFAC and the Tucker3 model can be viewed as the differen ce between so-called low-rank and subspace approximation of the data. This insight is used to pose a modification of the multi-linear partial least sq uares regression (N-PLS) model. The modification is found by exploiting the basic properties of PLS and of multi-way models. Compared to the current p revalent implementation of N-PLS, the new model provides a more reasonable fit to the independent data and exactly the same predictions of the depende nt variables. Thus, the reason for introducing this improved model is not t o obtain better predictions, but rather the aim is to improve the secondary aspect of PLS: the modeling of the independent variables. The original ver sion of N-PLS has some built-in problems that are easily circumvented with the modification suggested here. This is of importance, for example, in pro cess monitoring, outlier detection and also, implicitly, for jackknifing of model parameters. Some examples are provided to illustrate some of these p oints. (C) 2001 Elsevier Science B.V. All rights reserved.