Unimodal bandits: Regret lower bounds and optimal algorithms

Richard Combes, Alexandre Proutiere · 2014

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

stat.ML · 2019-06-25 · unverdicted · novelty 7.0

Game-solving algorithms using no-regret learners achieve non-asymptotic optimality guarantees for pure exploration in exponential family bandits.

Showing 1 of 1 citing paper.

Non-Asymptotic Pure Exploration by Solving Games stat.ML · 2019-06-25 · unverdicted · none · ref 7
Game-solving algorithms using no-regret learners achieve non-asymptotic optimality guarantees for pure exploration in exponential family bandits.