pith. sign in

arxiv: 1711.00817 · v3 · pith:ZKI5FK6Bnew · submitted 2017-11-02 · 📊 stat.ML · cs.DS· cs.IT· cs.LG· math.IT

Medoids in almost linear time via multi-armed bandits

classification 📊 stat.ML cs.DScs.ITcs.LGmath.IT
keywords med-ditmedoidmulti-armedperformancepointsthousandsalgorithmalmost
0
0 comments X
read the original abstract

Computing the medoid of a large number of points in high-dimensional space is an increasingly common operation in many data science problems. We present an algorithm Med-dit which uses O(n log n) distance evaluations to compute the medoid with high probability. Med-dit is based on a connection with the multi-armed bandit problem. We evaluate the performance of Med-dit empirically on the Netflix-prize and the single-cell RNA-Seq datasets, containing hundreds of thousands of points living in tens of thousands of dimensions, and observe a 5-10x improvement in performance over the current state of the art. Med-dit is available at https://github.com/bagavi/Meddit

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.