Optimal Best Arm Identification with Fixed Confidence

Aur\'elien Garivier (IMT); Emilie Kaufmann (CRIStAL; SEQUEL)

arxiv: 1602.04589 · v2 · pith:VYBGI5A6new · submitted 2016-02-15 · 🧮 math.ST · cs.LG· stat.ML· stat.TH

Optimal Best Arm Identification with Fixed Confidence

Aur\'elien Garivier (IMT) , Emilie Kaufmann (CRIStAL , SEQUEL) This is my paper

classification 🧮 math.ST cs.LGstat.MLstat.TH

keywords optimalboundcomplexitygiveidentificationlowerproverule

0 comments

read the original abstract

We give a complete characterization of the complexity of best-arm identification in one-parameter bandit problems. We prove a new, tight lower bound on the sample complexity. We propose the `Track-and-Stop' strategy, which we prove to be asymptotically optimal. It consists in a new sampling rule (which tracks the optimal proportions of arm draws highlighted by the lower bound) and in a stopping rule named after Chernoff, for which we give a new analysis.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Anytime-valid Optimal Policy Identification
stat.ME 2026-06 unverdicted novelty 6.0

Constructs a time-indexed set S_t retaining the true optimal policy uniformly over time with high probability, enabling early stopping with sample complexity O((log |Π| + log log(1/Δ_min))/Δ_min²) when the optimum is unique.