Accurate Community Detection in the Stochastic Block Model via Spectral Algorithms

Alexandre Proutiere; Se-Young Yun

arxiv: 1412.7335 · v1 · pith:I333JUHTnew · submitted 2014-12-23 · 💻 cs.SI · cs.DS

Accurate Community Detection in the Stochastic Block Model via Spectral Algorithms

Se-Young Yun , Alexandre Proutiere This is my paper

classification 💻 cs.SI cs.DS

keywords alphacommunitiesspectralverticesalgorithmsfracmodelblock

0 comments

read the original abstract

We consider the problem of community detection in the Stochastic Block Model with a finite number $K$ of communities of sizes linearly growing with the network size $n$. This model consists in a random graph such that each pair of vertices is connected independently with probability $p$ within communities and $q$ across communities. One observes a realization of this random graph, and the objective is to reconstruct the communities from this observation. We show that under spectral algorithms, the number of misclassified vertices does not exceed $s$ with high probability as $n$ grows large, whenever $pn=\omega(1)$, $s=o(n)$ and \begin{equation*} \lim\inf_{n\to\infty} {n(\alpha_1 p+\alpha_2 q-(\alpha_1 + \alpha_2)p^{\frac{\alpha_1}{\alpha_1 + \alpha_2}}q^{\frac{\alpha_2}{\alpha_1 + \alpha_2}})\over \log (\frac{n}{s})} >1,\quad\quad(1) \end{equation*} where $\alpha_1$ and $\alpha_2$ denote the (fixed) proportions of vertices in the two smallest communities. In view of recent work by Abbe et al. and Mossel et al., this establishes that the proposed spectral algorithms are able to exactly recover communities whenever this is at all possible in the case of networks with two communities with equal sizes. We conjecture that condition (1) is actually necessary to obtain less than $s$ misclassified vertices asymptotically, which would establish the optimality of spectral method in more general scenarios.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Consistency of Lloyd's Algorithm Under Perturbations
cs.LG 2023-09 unverdicted novelty 5.0

Lloyd's algorithm on perturbed sub-Gaussian mixture samples has exponentially bounded mis-clustering rate after O(log n) iterations when initialized properly and perturbation is small relative to noise.