PROBI: A Heuristic for the probabilistic k-median problem
classification
💻 cs.DS
keywords
datasummaryprobistreamheuristick-medianprobabilisticproblem
read the original abstract
We develop the heuristic PROBI for the probabilistic Euclidean k-median problem based on a coreset construction by Lammersen et al. Our algorithm computes a summary of the data and then uses an adapted version of k-means++ (Arthur and Vassilvitskii, 2007) to compute a good solution on the summary. The summary is maintained in a data stream, so PROBI can be used in a data stream setting on very large data sets. We experimentally evaluate the quality of the summary and of the computed solution and compare the running time to state of the art data stream clustering algorithms.
This paper has not been read by Pith yet.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.