pith. sign in

arxiv: 2007.11416 · v1 · pith:MCUVB7YHnew · submitted 2020-07-21 · 💻 cs.LG · stat.ML

Spectral Clustering using Eigenspectrum Shape Based Nystrom Sampling

classification 💻 cs.LG stat.ML
keywords clusteringnystromalgorithmsanalyzingapproximationsheuristiclow-ranksampling
0
0 comments X
read the original abstract

Spectral clustering has shown a superior performance in analyzing the cluster structure. However, its computational complexity limits its application in analyzing large-scale data. To address this problem, many low-rank matrix approximating algorithms are proposed, including the Nystrom method - an approach with proven approximate error bounds. There are several algorithms that provide recipes to construct Nystrom approximations with variable accuracies and computing times. This paper proposes a scalable Nystrom-based clustering algorithm with a new sampling procedure, Centroid Minimum Sum of Squared Similarities (CMS3), and a heuristic on when to use it. Our heuristic depends on the eigen spectrum shape of the dataset, and yields competitive low-rank approximations in test datasets compared to the other state-of-the-art methods

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.