pith. sign in

arxiv: 1011.5270 · v2 · pith:3PZKZ54Gnew · submitted 2010-11-24 · 📊 stat.ML · cs.LG

Classifying Clustering Schemes

classification 📊 stat.ML cs.LG
keywords clusteringschemesfunctorialityclassificationconditionsconstructdefinedexample
0
0 comments X
read the original abstract

Many clustering schemes are defined by optimizing an objective function defined on the partitions of the underlying set of a finite metric space. In this paper, we construct a framework for studying what happens when we instead impose various structural conditions on the clustering schemes, under the general heading of functoriality. Functoriality refers to the idea that one should be able to compare the results of clustering algorithms as one varies the data set, for example by adding points or by applying functions to it. We show that within this framework, one can prove a theorems analogous to one of J. Kleinberg, in which for example one obtains an existence and uniqueness theorem instead of a non-existence result. We obtain a full classification of all clustering schemes satisfying a condition we refer to as excisiveness. The classification can be changed by varying the notion of maps of finite metric spaces. The conditions occur naturally when one considers clustering as the statistical version of the geometric notion of connected components. By varying the degree of functoriality that one requires from the schemes it is possible to construct richer families of clustering schemes that exhibit sensitivity to density.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. It's All About Covers: Persistent Homology of Cover Refinements

    math.AT 2026-02 unverdicted novelty 8.0

    Cover refinements enable a near-linear-size approximation to the Vietoris-Rips filtration with unconditional log-3 interleaving that preserves persistent homology.