Efficient semiparametric estimation of haplotype-disease associations in case.cohort and nested case.control studies

Citation
D. Zeng, et al., Efficient semiparametric estimation of haplotype-disease associations in case.cohort and nested case.control studies, Biostatistics (Oxford. Print) , 7(3), 2006, pp. 486-502
ISSN journal
14654644
Volume
7
Issue
3
Year of publication
2006
Pages
486 - 502
Database
ACNP
SICI code
Abstract
Estimating the effects of haplotypes on the age of onset of a disease is an important step toward the discovery of genes that influence complex human diseases.A haplotype is a specific sequence of nucleotides on the same chromosome of an individual and can only be measured indirectly through the genotype.We consider cohort studies which collect genotype data on a subset of cohort members through case.cohort or nested case.control sampling.We formulate the effects of haplotypes and possibly time-varying environmental variables on the age of onset through a broad class of semiparametric regression models.We construct appropriate nonparametric likelihoods, which involve both finite- and infinite-dimensional parameters.The corresponding nonparametric maximum likelihood estimators are shown to be consistent, asymptotically normal, and asymptotically efficient.Consistent variance.covariance estimators are provided, and efficient and reliable numerical algorithms are developed.Simulation studies demonstrate that the asymptotic approximations are accurate in practical settings and that case.cohort and nested case.control designs are highly cost-effective.An application to a major cardiovascular study is provided.