pith. sign in

arxiv: 1506.03729 · v1 · pith:CRSKUQNDnew · submitted 2015-06-11 · 🧮 math.PR · cs.IT· cs.LG· cs.SI· math.IT

Recovering communities in the general stochastic block model without knowing the parameters

classification 🧮 math.PR cs.ITcs.LGcs.SImath.IT
keywords communitiesmodelparametersalgorithmdegreesoptimalregimeaccuracy
0
0 comments X
read the original abstract

Most recent developments on the stochastic block model (SBM) rely on the knowledge of the model parameters, or at least on the number of communities. This paper introduces efficient algorithms that do not require such knowledge and yet achieve the optimal information-theoretic tradeoffs identified in [AS15] for linear size communities. The results are three-fold: (i) in the constant degree regime, an algorithm is developed that requires only a lower-bound on the relative sizes of the communities and detects communities with an optimal accuracy scaling for large degrees; (ii) in the regime where degrees are scaled by $\omega(1)$ (diverging degrees), this is enhanced into a fully agnostic algorithm that only takes the graph in question and simultaneously learns the model parameters (including the number of communities) and detects communities with accuracy $1-o(1)$, with an overall quasi-linear complexity; (iii) in the logarithmic degree regime, an agnostic algorithm is developed that learns the parameters and achieves the optimal CH-limit for exact recovery, in quasi-linear time. These provide the first algorithms affording efficiency, universality and information-theoretic optimality for strong and weak consistency in the general SBM with linear size communities.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Node-Private Community Detection in Stochastic Block Models

    math.ST 2026-04 unverdicted novelty 7.0

    Node-private community detection in sparse SBMs requires a logarithmic privacy budget for exact recovery, with matching upper and lower bounds on the minimax risk that separate statistical signal from privacy cost.