pith. sign in

arxiv: 1805.04920 · v1 · pith:EYAMBL3Enew · submitted 2018-05-13 · 💻 cs.SI · physics.soc-ph

Community Detection by Information Flow Simulation

classification 💻 cs.SI physics.soc-ph
keywords algorithmdataflowinformationcommunitiescommunitydetectionexisting
0
0 comments X
read the original abstract

Community detection remains an important problem in data mining, owing to the lack of scalable algorithms that exploit all aspects of available data - namely the directionality of flow of information and the dynamics thereof. Most existing methods use measures of connectedness in the graphical structure. In this paper, we present a fast, scalable algorithm to detect communities in directed, weighted graph representations of social networks by simulating flow of information through them. By design, our algorithm naturally handles undirected or unweighted networks as well. Our algorithm runs in $\mathcal{O}(|E|)$ time, which is better than most existing work and uses $\mathcal{O}(|E|)$ space and hence scales easily to very large datasets. Finally, we show that our algorithm outperforms the state-of-the-art Markov Clustering Algorithm (MCL) in both accuracy and scalability on ground truth data (in a number of cases, we can find communities in graphs too large for MCL).

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.