pith. sign in

arxiv: 1406.3837 · v1 · pith:RDZGDIRCnew · submitted 2014-06-15 · 📊 stat.ML · cs.LG

An Incremental Reseeding Strategy for Clustering

classification 📊 stat.ML cs.LG
keywords algorithmaccuracyclustergraphmagnitudeorderreseedingterms
0
0 comments X
read the original abstract

In this work we propose a simple and easily parallelizable algorithm for multiway graph partitioning. The algorithm alternates between three basic components: diffusing seed vertices over the graph, thresholding the diffused seeds, and then randomly reseeding the thresholded clusters. We demonstrate experimentally that the proper combination of these ingredients leads to an algorithm that achieves state-of-the-art performance in terms of cluster purity on standard benchmarks datasets. Moreover, the algorithm runs an order of magnitude faster than the other algorithms that achieve comparable results in terms of accuracy. We also describe a coarsen, cluster and refine approach similar to GRACLUS and METIS that removes an additional order of magnitude from the runtime of our algorithm while still maintaining competitive accuracy.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.