Overcoming multicollinearity in near infrared analysis for lycopene content estimation in tomatoes by using ridge regression

Citation
H. Pasternak et al., Overcoming multicollinearity in near infrared analysis for lycopene content estimation in tomatoes by using ridge regression, J TEST EVAL, 29(1), 2001, pp. 60-66
Citations number
22
Categorie Soggetti
Material Science & Engineering
Journal title
JOURNAL OF TESTING AND EVALUATION
ISSN journal
00903973 → ACNP
Volume
29
Issue
1
Year of publication
2001
Pages
60 - 66
Database
ISI
SICI code
0090-3973(200101)29:1<60:OMINIA>2.0.ZU;2-3
Abstract
High intercorrelation between absorbance at different wavelengths is common in near infrared analysis and was observed in an experiment to determine l ycopene in tomatoes. Simulation analysis and experiments were conducted to estimate the effects of this problem on the estimators and on the predictiv e ability of linear regression and ridge regression. Applying linear regres sion to the experimental data resulted in very large parameter values, impl ying poor predictive ability. When linear regression gives very large param eter values, the estimated parameters are practically random numbers and ar e not correlated to the true ones. Ridge regression yielded estimators with normal values, but which are still poorly correlated with the true paramet ers. However, the predictive ability of the derived equation is good and ma y be used in practice to determine lycopene content in tomatoes since it is relatively easy to update.