pith. sign in

arxiv: 1605.05349 · v1 · pith:ZJJYP233new · submitted 2016-05-17 · 📊 stat.ML

Orthogonal symmetric non-negative matrix factorization under the stochastic block model

classification 📊 stat.ML
keywords factorizationmatrixblockclusteringmethodmodelnon-negativestochastic
0
0 comments X
read the original abstract

We present a method based on the orthogonal symmetric non-negative matrix tri-factorization of the normalized Laplacian matrix for community detection in complex networks. While the exact factorization of a given order may not exist and is NP hard to compute, we obtain an approximate factorization by solving an optimization problem. We establish the connection of the factors obtained through the factorization to a non-negative basis of an invariant subspace of the estimated matrix, drawing parallel with the spectral clustering. Using such factorization for clustering in networks is motivated by analyzing a block-diagonal Laplacian matrix with the blocks representing the connected components of a graph. The method is shown to be consistent for community detection in graphs generated from the stochastic block model and the degree corrected stochastic block model. Simulation results and real data analysis show the effectiveness of these methods under a wide variety of situations, including sparse and highly heterogeneous graphs where the usual spectral clustering is known to fail. Our method also performs better than the state of the art in popular benchmark network datasets, e.g., the political web blogs and the karate club data.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Matrix Factorization Framework for Community Detection under the Degree-Corrected Block Model

    cs.SI 2026-01 unverdicted novelty 7.0

    DCBM community detection is reformulated as constrained nonnegative matrix factorization, producing a scalable method with strong initialization that matches inference quality on large graphs.