A moment-based method for estimating the proportion of true null hypotheses and its application to microarray gene expression data

Authors
Citation
Lai, Yinglei, A moment-based method for estimating the proportion of true null hypotheses and its application to microarray gene expression data, Biostatistics (Oxford. Print) , 8(4), 2007, pp. 744-755
ISSN journal
14654644
Volume
8
Issue
4
Year of publication
2007
Pages
744 - 755
Database
ACNP
SICI code
Abstract
Due to advances in experimental technologies, it is feasible to collect measurements for a large number of variables.When these variables are simultaneously screened by a statistical test, it is necessary to consider the adjustment for multiple hypothesis testing.The false discovery rate has been proposed and widely used to address this issue.A related problem is the estimation of the proportion of true null hypotheses.The long-standing difficulty to this problem is the identifiability of the nonparametric model.In this study, we propose a moment-based method coupled with sample splitting for estimating this proportion.If the p values from the alternative hypothesis are homogeneously distributed, then the proposed method will solve the identifiability and give its optimal performances.When the p values from the alternative hypothesis are heterogeneously distributed, we propose to approximate this mixture distribution so that the identifiability can be achieved.Theoretical aspects of the approximation error are discussed.The proposed estimation method is completely nonparametric and simple with an explicit formula.Simulation studies show the favorable performances of the proposed method when it is compared to the other existing methods.Two microarray gene expression data sets are considered for applications.