pith. sign in

arxiv: 1309.5781 · v1 · pith:FEBBG6WMnew · submitted 2013-09-23 · 💻 cs.DS

PROBI: A Heuristic for the probabilistic k-median problem

classification 💻 cs.DS
keywords datasummaryprobistreamheuristick-medianprobabilisticproblem
0
0 comments X
read the original abstract

We develop the heuristic PROBI for the probabilistic Euclidean k-median problem based on a coreset construction by Lammersen et al. Our algorithm computes a summary of the data and then uses an adapted version of k-means++ (Arthur and Vassilvitskii, 2007) to compute a good solution on the summary. The summary is maintained in a data stream, so PROBI can be used in a data stream setting on very large data sets. We experimentally evaluate the quality of the summary and of the computed solution and compare the running time to state of the art data stream clustering algorithms.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.