Pure Exploration in Infinitely-Armed Bandit Models with Fixed-Confidence

CNRS; CRIStAL); Emilie Kaufmann (SEQUEL; Javed Aslam; Jesse Anderton; Maryam Aziz

arxiv: 1803.04665 · v1 · pith:S34UTKYKnew · submitted 2018-03-13 · 📊 stat.ML · cs.LG

Pure Exploration in Infinitely-Armed Bandit Models with Fixed-Confidence

Maryam Aziz , Jesse Anderton , Emilie Kaufmann (SEQUEL , CNRS , CRIStAL) , Javed Aslam This is my paper

classification 📊 stat.ML cs.LG

keywords banditboundderivecomplexityidentificationlowermodelsnear-optimal

0 comments

read the original abstract

We consider the problem of near-optimal arm identification in the fixed confidence setting of the infinitely armed bandit problem when nothing is known about the arm reservoir distribution. We (1) introduce a PAC-like framework within which to derive and cast results; (2) derive a sample complexity lower bound for near-optimal arm identification; (3) propose an algorithm that identifies a nearly-optimal arm with high probability and derive an upper bound on its sample complexity which is within a log factor of our lower bound; and (4) discuss whether our log^2(1/delta) dependence is inescapable for "two-phase" (select arms first, identify the best later) algorithms in the infinite setting. This work permits the application of bandit models to a broader class of problems where fewer assumptions hold.

This paper has not been read by Pith yet.

Pure Exploration in Infinitely-Armed Bandit Models with Fixed-Confidence

discussion (0)