Thresholding Bandit for Dose-ranging: The Impact of Monotonicity

· 2017 · math.ST · arXiv 1711.04454

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

We analyze the sample complexity of the thresholding bandit problem, with and without the assumption that the mean values of the arms are increasing. In each case, we provide a lower bound valid for any risk $\delta$ and any $\delta$-correct algorithm; in addition, we propose an algorithm whose sample complexity is of the same order of magnitude for small risks. This work is motivated by phase 1 clinical trials, a practically important setting where the arm means are increasing by nature, and where no satisfactory solution is available so far.

representative citing papers

Non-Asymptotic Pure Exploration by Solving Games

stat.ML · 2019-06-25 · unverdicted · novelty 7.0

Game-solving algorithms using no-regret learners achieve non-asymptotic optimality guarantees for pure exploration in exponential family bandits.

citing papers explorer

Showing 1 of 1 citing paper.

Non-Asymptotic Pure Exploration by Solving Games stat.ML · 2019-06-25 · unverdicted · none · ref 14 · internal anchor
Game-solving algorithms using no-regret learners achieve non-asymptotic optimality guarantees for pure exploration in exponential family bandits.

Thresholding Bandit for Dose-ranging: The Impact of Monotonicity

fields

years

verdicts

representative citing papers

citing papers explorer