Pure Exploration with Multiple Correct Answers

R\'emy Degenne; Wouter M. Koolen

arxiv: 1902.03475 · v1 · pith:CYHJ7QVVnew · submitted 2019-02-09 · 💻 cs.LG · stat.ML

Pure Exploration with Multiple Correct Answers

R\'emy Degenne , Wouter M. Koolen This is my paper

classification 💻 cs.LG stat.ML

keywords complexitysamplealgorithmanswersboundconvexityexplorationlower

0 comments

read the original abstract

We determine the sample complexity of pure exploration bandit problems with multiple good answers. We derive a lower bound using a new game equilibrium argument. We show how continuity and convexity properties of single-answer problems ensures that the Track-and-Stop algorithm has asymptotically optimal sample complexity. However, that convexity is lost when going to the multiple-answer setting. We present a new algorithm which extends Track-and-Stop to the multiple-answer case and has asymptotic sample complexity matching the lower bound.

This paper has not been read by Pith yet.

Pure Exploration with Multiple Correct Answers

discussion (0)