Co-clustering separately exchangeable network data

David Choi; Patrick J. Wolfe

arxiv: 1212.4093 · v5 · pith:QFWG4VORnew · submitted 2012-12-17 · 🧮 math.ST · cs.SI· math.CO· stat.ML· stat.TH

Co-clustering separately exchangeable network data

David Choi , Patrick J. Wolfe This is my paper

classification 🧮 math.ST cs.SImath.COstat.MLstat.TH

keywords dataco-clusteringco-clustersgenerativenonparametricprocessaddressingapproximation

0 comments

read the original abstract

This article establishes the performance of stochastic blockmodels in addressing the co-clustering problem of partitioning a binary array into subsets, assuming only that the data are generated by a nonparametric process satisfying the condition of separate exchangeability. We provide oracle inequalities with rate of convergence $\mathcal{O}_P(n^{-1/4})$ corresponding to profile likelihood maximization and mean-square error minimization, and show that the blockmodel can be interpreted in this setting as an optimal piecewise-constant approximation to the generative nonparametric model. We also show for large sample sizes that the detection of co-clusters in such data indicates with high probability the existence of co-clusters of equal size and asymptotically equivalent connectivity in the underlying generative process.

This paper has not been read by Pith yet.

Co-clustering separately exchangeable network data

discussion (0)