pith. machine review for the scientific record. sign in

arxiv: 1705.06894 · v1 · submitted 2017-05-19 · 💻 cs.LG · cs.DS· stat.ML

Recognition: unknown

Practical Algorithms for Best-K Identification in Multi-Armed Bandits

Authors on Pith no claims yet
classification 💻 cs.LG cs.DSstat.ML
keywords best-problemalgorithmsarmspracticalidentificationadaptivelyapplications
0
0 comments X
read the original abstract

In the Best-$K$ identification problem (Best-$K$-Arm), we are given $N$ stochastic bandit arms with unknown reward distributions. Our goal is to identify the $K$ arms with the largest means with high confidence, by drawing samples from the arms adaptively. This problem is motivated by various practical applications and has attracted considerable attention in the past decade. In this paper, we propose new practical algorithms for the Best-$K$-Arm problem, which have nearly optimal sample complexity bounds (matching the lower bound up to logarithmic factors) and outperform the state-of-the-art algorithms for the Best-$K$-Arm problem (even for $K=1$) in practice.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.