Determining the Number of Clusters via Iterative Consensus Clustering

Carl Meyer; Kevin Valakuzhy; Shaina Race

arxiv: 1408.0967 · v1 · pith:MEGLSDTDnew · submitted 2014-08-05 · 📊 stat.ML · cs.CV· cs.LG

Determining the Number of Clusters via Iterative Consensus Clustering

Shaina Race , Carl Meyer , Kevin Valakuzhy This is my paper

classification 📊 stat.ML cs.CVcs.LG

keywords consensusmatrixclustersnumberdatadetermineensembleiterative

0 comments

read the original abstract

We use a cluster ensemble to determine the number of clusters, k, in a group of data. A consensus similarity matrix is formed from the ensemble using multiple algorithms and several values for k. A random walk is induced on the graph defined by the consensus matrix and the eigenvalues of the associated transition probability matrix are used to determine the number of clusters. For noisy or high-dimensional data, an iterative technique is presented to refine this consensus matrix in way that encourages a block-diagonal form. It is shown that the resulting consensus matrix is generally superior to existing similarity matrices for this type of spectral analysis.

This paper has not been read by Pith yet.

Determining the Number of Clusters via Iterative Consensus Clustering

discussion (0)