CO-CLUSTERING SEPARATELY EXCHANGEABLE NETWORK DATA

Citation
David Choi et Patrick J. Wolfe, CO-CLUSTERING SEPARATELY EXCHANGEABLE NETWORK DATA, Annals of statistics , 42(1), 2014, pp. 29-63
Journal title
ISSN journal
00905364
Volume
42
Issue
1
Year of publication
2014
Pages
29 - 63
Database
ACNP
SICI code
Abstract
This article establishes the performance of stochastic blockmodels in addressing the co-clustering problem of partitioning a binary array into subsets, assuming only that the data are generated by a nonparametric process satisfying the condition of separate exchangeability. We provide oracle inequalities with rate of convergence OP(n-1/4) corresponding to profile likelihood maximization and mean-square error minimization, and show that the blockmodel can be interpreted in this setting as an optimal piecewise-constant approximation to the generative nonparametric model. We also show for large sample sizes that the detection of co-clusters in such data indicates with high probability the existence of co-clusters of equal size and asymptotically equivalent connectivity in the underlying generative process.