TOWARD A CURSE OF DIMENSIONALITY APPROPRIATE (CODA) ASYMPTOTIC THEORYFOR SEMIPARAMETRIC MODELS

Authors
Citation
Jm. Robins et Y. Ritov, TOWARD A CURSE OF DIMENSIONALITY APPROPRIATE (CODA) ASYMPTOTIC THEORYFOR SEMIPARAMETRIC MODELS, Statistics in medicine, 16(1-3), 1997, pp. 285-319
Citations number
34
Categorie Soggetti
Statistic & Probability","Medicine, Research & Experimental","Public, Environmental & Occupation Heath","Statistic & Probability","Medical Informatics
Journal title
ISSN journal
02776715
Volume
16
Issue
1-3
Year of publication
1997
Pages
285 - 319
Database
ISI
SICI code
0277-6715(1997)16:1-3<285:TACODA>2.0.ZU;2-9
Abstract
We argue, that due to the curse of dimensionality, there are major dif ficulties with any pure or smoothed likelihood-based method of inferen ce in designed studies with randomly missing data when missingness dep ends on a high-dimensional vector of variables. We study in detail a s emi-parametric superpopulation version of continuously stratified rand om sampling. We show that all estimators of the population mean that a re uniformly consistent or that achieve an algebraic rate of convergen ce, no matter how slow, require the use of the selection (randomizatio n) probabilities. We argue that, in contrast to likelihood methods whi ch ignore these probabilities, inverse selection probability weighted estimators continue to perform well achieving uniform n(1/2)-rates of convergence. We propose a curse of dimensionality appropriate (CODA) a symptotic theory for inference in non- and semi-parametric models in a n attempt to formalize our arguments. We discuss whether our results c onstitute a fatal blow to the likelihood principle and study the attit ude toward these that a committed subjective Bayesian would adopt. Fin ally, we apply our CODA theory to analyse the effect of the 'curse of dimensionality' in several interesting semi-parametric models, includi ng a model for a two-armed randomized trial with randomization probabi lities depending on a vector of continuous pretreatment covariates X. We provide substantive settings under which a subjective Bayesian woul d ignore the randomization probabilities in analysing the trial data. We then show that any statistician who ignores the randomization proba bilities is unable to construct nominal 95 per cent confidence interva ls for the true treatment effect that have both: (i) an expected lengt h which goes to zero with increasing sample size; and (ii) a guarantee d expected actual coverage rate of at least 95 per cent over the ensem ble of trials analysed by the statistician during his or her lifetime. However, we derive a new interval estimator, depending on the Randomi zation probabilities, that satisfies (i) and (ii).