Classification method for disease risk mapping based on discrete hidden Markov random fields

Citation
Charras-garrido, Myriam et al., Classification method for disease risk mapping based on discrete hidden Markov random fields, Biostatistics (Oxford. Print) , 13(2), 2012, pp. 241-255
ISSN journal
14654644
Volume
13
Issue
2
Year of publication
2012
Pages
241 - 255
Database
ACNP
SICI code
Abstract
Risk mapping in epidemiology enables areas with a low or high risk of disease contamination to be localized and provides a measure of risk differences between these regions.Risk mapping models for pooled data currently used by epidemiologists focus on the estimated risk for each geographical unit.They are based on a Poisson log-linear mixed model with a latent intrinsic continuous hidden Markov random field (HMRF) generally corresponding to a Gaussian autoregressive spatial smoothing.Risk classification, which is necessary to draw clearly delimited risk zones (in which protection measures may be applied), generally must be performed separately.We propose a method for direct classified risk mapping based on a Poisson log-linear mixed model with a latent discrete HMRF.The discrete hidden field (HF) corresponds to the assignment of each spatial unit to a risk class.The risk values attached to the classes are parameters and are estimated.When mapping risk using HMRFs, the conditional distribution of the observed field is modeled with a Poisson rather than a Gaussian distribution as in image segmentation.Moreover, abrupt changes in risk levels are rare in disease maps.The spatial hidden model should favor smoothed out risks, but conventional discrete Markov random fields (e.g. the Potts model) do not impose this.We therefore propose new potential functions for the HF that take into account class ordering.We use a Monte Carlo version of the expectation.maximization algorithm to estimate parameters and determine risk classes.We illustrate the method's behavior on simulated and real data sets.Our method appears particularly well adapted to localize high-risk regions and estimate the corresponding risk levels.