On the Consistency of $k$-means++ algorithm

Mieczys{\l}aw A. K{\l}opotek

arxiv: 1702.06120 · v1 · pith:ZU6O54WCnew · submitted 2017-02-20 · 💻 cs.LG

On the Consistency of k-means++ algorithm

Mieczys{\l}aw A. K{\l}opotek This is my paper

classification 💻 cs.LG

keywords meansalgorithmapproximationdataexpectedlargepopulationsamples

0 comments

read the original abstract

We prove in this paper that the expected value of the objective function of the $k$-means++ algorithm for samples converges to population expected value. As $k$-means++, for samples, provides with constant factor approximation for $k$-means objectives, such an approximation can be achieved for the population with increase of the sample size. This result is of potential practical relevance when one is considering using subsampling when clustering large data sets (large data bases).

This paper has not been read by Pith yet.

On the Consistency of k-means++ algorithm

discussion (0)