Regression analysis with spatially autocorrelated error: simulation studies and application to mapping of soil organic matter

Authors
Citation
Rm. Lark, Regression analysis with spatially autocorrelated error: simulation studies and application to mapping of soil organic matter, INT J GEO I, 14(3), 2000, pp. 247-264
Citations number
29
Categorie Soggetti
EnvirnmentalStudies Geografy & Development
Journal title
INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE
ISSN journal
13658816 → ACNP
Volume
14
Issue
3
Year of publication
2000
Pages
247 - 264
Database
ISI
SICI code
1365-8816(200004/05)14:3<247:RAWSAE>2.0.ZU;2-3
Abstract
Regression is often used in analysis of spatial data to obtain predictive r elationships between variables. The assumption that the errors from the reg ression model are statistically independent will often not be plausible, du e to spatial dependence in the sources of error. This is a problem for the regression analysis in that the resulting estimate of the standard deviatio n of the errors from the model is biased (downwards) which invalidates conf idence limits on predictions made with the model, and which could lead to a false conclusion that the regression is statistically significant. While t he estimates of the regression coefficient(s) are not necessarily biased th ey are not minimum-variance estimates when the errors are correlated. It is shown how the maximum likelihood method of estimating the regression model(ML) might be used to overcome this problem. It is proposed that stand ard variogram functions may be used to model the spatial dependence of the errors from the regression. In simulation studies it is shown that the meth od avoids bias in the estimation of the standard deviation of the regressio n error from a systematic sample (unless the spatial interval over which th e errors are correlated is of similar order to the dimensions of the system atic sample grid). The precision of the ML estimate of the error is poorer than achieved from a random sample, but this can be improved to some extent by constraining the parameters of the variogram function. The ML procedure is demonstrated in the analysis of some remote sensor data to predict organic matter content of the top soil within an arable field. The data on top soil had been collected on a systematic grid. Analysis of t he residuals from an ordinary least-squares regression indicated an appropr iate variogram function to model the spatial dependence of the errors. Conf idence limits for predictions using the regression were calculated from the ML estimate of the error.