Thresholding Bandit for Dose-ranging: The Impact of Monotonicity

Aur\'elien Garivier (IMT); Laurent Rossi (IMT); Pierre M\'enard (IMT); Pierre Menard (IMT)

arxiv: 1711.04454 · v2 · pith:OQDDH55Fnew · submitted 2017-11-13 · 🧮 math.ST · stat.ML· stat.TH

Thresholding Bandit for Dose-ranging: The Impact of Monotonicity

Aur\'elien Garivier (IMT) , Pierre M\'enard (IMT) , Laurent Rossi (IMT) , Pierre Menard (IMT) This is my paper

classification 🧮 math.ST stat.MLstat.TH

keywords algorithmbanditcomplexitydeltaincreasingsamplethresholdingaddition

0 comments

read the original abstract

We analyze the sample complexity of the thresholding bandit problem, with and without the assumption that the mean values of the arms are increasing. In each case, we provide a lower bound valid for any risk $\delta$ and any $\delta$-correct algorithm; in addition, we propose an algorithm whose sample complexity is of the same order of magnitude for small risks. This work is motivated by phase 1 clinical trials, a practically important setting where the arm means are increasing by nature, and where no satisfactory solution is available so far.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Non-Asymptotic Pure Exploration by Solving Games
stat.ML 2019-06 unverdicted novelty 7.0

Game-solving algorithms using no-regret learners achieve non-asymptotic optimality guarantees for pure exploration in exponential family bandits.