R. Bro et al., On the difference between low-rank and subspace approximation: improved model for multi-linear PLS regression, CHEM INTELL, 58(1), 2001, pp. 3-13
While both Tucker3 and PARAFAC models can be viewed as latent variable mode
ls extending principal component analysis (PCA) to multi-way data, most fun
damental properties of PCA do not extend to both models. This has practical
importance, which will be explained in this paper. The fundamental differe
nce between the PARAFAC and the Tucker3 model can be viewed as the differen
ce between so-called low-rank and subspace approximation of the data. This
insight is used to pose a modification of the multi-linear partial least sq
uares regression (N-PLS) model. The modification is found by exploiting the
basic properties of PLS and of multi-way models. Compared to the current p
revalent implementation of N-PLS, the new model provides a more reasonable
fit to the independent data and exactly the same predictions of the depende
nt variables. Thus, the reason for introducing this improved model is not t
o obtain better predictions, but rather the aim is to improve the secondary
aspect of PLS: the modeling of the independent variables. The original ver
sion of N-PLS has some built-in problems that are easily circumvented with
the modification suggested here. This is of importance, for example, in pro
cess monitoring, outlier detection and also, implicitly, for jackknifing of
model parameters. Some examples are provided to illustrate some of these p
oints. (C) 2001 Elsevier Science B.V. All rights reserved.