Criterion-based methods for Bayesian model assessment

Citation
Jg. Ibrahim et al., Criterion-based methods for Bayesian model assessment, STAT SINICA, 11(2), 2001, pp. 419-443
Citations number
31
Categorie Soggetti
Mathematics
Journal title
STATISTICA SINICA
ISSN journal
10170405 → ACNP
Volume
11
Issue
2
Year of publication
2001
Pages
419 - 443
Database
ISI
SICI code
1017-0405(200104)11:2<419:CMFBMA>2.0.ZU;2-1
Abstract
We propose a general Bayesian criterion fur model assessment. The criterion is constructed from the posterior predictive distribution of the data, and can he written as a sum of two components, one involving the means of the posterior predictive. distribution and the other involving the variances. I t can be viewed as a Bayesian goodness-of-fit statistic which measures the performance of a model by a combination of how close its predictions are to the observed data and the variability of the predictions. We call this pro posed predictive criterion the L measure, it is motivated hy earlier work o f Ibrahim and Laud (1994) and related to a criterion of Gelfand and Ghosh ( 1998). We examine the L measure in detail for the class of generalized line ar models and survival models with right censored or interval censored data . We also propose a calibration of the L measure, defined as the prior pred ictive distribution of the difference between the L measures of the candida te model and the criterion minimizing model, and call it the calibration di stribution. The calibration distribution will allow us to formally compare tao competing models based on their L measure values. We discuss theoretica l properties of the calibration distribution in detail, and provide Monte C arlo methods for computing it. For the linear model, we derive an analytic closed form expression for the L measure and the calibration distribution, and;also derive a closed form expression for the mean of the calibration di stribution. These novel developments will enable us to fully characterize t he properties of the L measure for each model under consideration and will facilitate a direct formal comparison between several models, including non -nested models. Informative priors based on historical data and computation al techniques are discussed. Several simulated and real datasets are used t o demonstrate the proposed methodology.