A PLS KERNEL ALGORITHM FOR DATA SETS WITH MANY VARIABLES AND FEWER OBJECTS .1. THEORY AND ALGORITHM

Citation
S. Rannar et al., A PLS KERNEL ALGORITHM FOR DATA SETS WITH MANY VARIABLES AND FEWER OBJECTS .1. THEORY AND ALGORITHM, Journal of chemometrics, 8(2), 1994, pp. 111-125
Citations number
24
Categorie Soggetti
Chemistry Analytical","Statistic & Probability
Journal title
ISSN journal
08869383
Volume
8
Issue
2
Year of publication
1994
Pages
111 - 125
Database
ISI
SICI code
0886-9383(1994)8:2<111:APKAFD>2.0.ZU;2-6
Abstract
A fast PLS regression algorithm dealing with large data matrices with many variables (K) and fewer objects (N) is presented. For such data m atrices the classical algorithm is computer-intensive and memory-deman ding. Recently, Lindgren et al. (J. Chemometrics, 7, 45-49 (1993)) dev eloped a quick and efficient kernel algorithm for the case with many o bjects and few variables. The present paper is focused on the opposite case, i.e. many variables and fewer objects. A kernel algorithm is pr esented based on eigenvectors to the 'kernel' matrix XX(T)YY(T), which is a square, non-symmetric matrix of size N x N, where N is the numbe r of objects. Using the kernel matrix and the association matrices XX( T) (N x N) and YY(T) (N x N), it is possible to calculate all score an d loading vectors and hence conduct a complete PLS regression includin g diagnostics such as R2. This is done without returning to the origin al data matrices X and Y. The algorithm is presented in equation form, with proofs of some new properties and as MATLAB code.