pith. sign in

arxiv: cs/0603120 · v1 · submitted 2006-03-30 · 💻 cs.AI

Approximation Algorithms for K-Modes Clustering

classification 💻 cs.AI
keywords k-modesclusteringapproximationk-medianalgorithmscategoricaldatametric
0
0 comments X
read the original abstract

In this paper, we study clustering with respect to the k-modes objective function, a natural formulation of clustering for categorical data. One of the main contributions of this paper is to establish the connection between k-modes and k-median, i.e., the optimum of k-median is at most twice the optimum of k-modes for the same categorical data clustering problem. Based on this observation, we derive a deterministic algorithm that achieves an approximation factor of 2. Furthermore, we prove that the distance measure in k-modes defines a metric. Hence, we are able to extend existing approximation algorithms for metric k-median to k-modes. Empirical results verify the superiority of our method.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.