A RATE OPTIMAL PROCEDURE FOR RECOVERING SPARSE DIFFERENCES BETWEEN HIGH-DIMENSIONAL MEANS UNDER DEPENDENCE

Citation
Jun Li et Ping-shou Zhong, A RATE OPTIMAL PROCEDURE FOR RECOVERING SPARSE DIFFERENCES BETWEEN HIGH-DIMENSIONAL MEANS UNDER DEPENDENCE, Annals of statistics , 45(2), 2017, pp. 557-590
Journal title
ISSN journal
00905364
Volume
45
Issue
2
Year of publication
2017
Pages
557 - 590
Database
ACNP
SICI code
Abstract
The paper considers the problem of recovering the sparse different components between two high-dimensional means of column-wise dependent random vectors. We show that dependence can be utilized to lower the identification boundary for signal recovery. Moreover, an optimal convergence rate for the marginal false nondiscovery rate (mFNR) is established under dependence. The convergence rate is faster than the optimal rate without dependence. To recover the sparse signal bearing dimensions, we propose a Dependence-Assisted Thresholding and Excising (DATE) procedure, which is shown to be rate optimal for the mFNR with the marginal false discovery rate (mFDR) controlled at a pre-specified level. Extensions of the DATE to recover the differences in contrasts among multiple population means and differences between two covariance matrices are also provided. Simulation studies and case study are given to demonstrate the performance of the proposed signal identification procedure.