https://arxiv.org/pdf/1605.08722.pdf An empirical evaluation of Thompson sampling

Olivier Chapelle, Lihong Li · 2011 · arXiv 1605.08722

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

representative citing papers

stat.ML · 2026-04-28 · unverdicted · novelty 7.0

Spectral bandits achieve scalable regret in graph-structured recommendation by using an effective dimension to learn good policies from few node evaluations.

Budgeted Online Influence Maximization

cs.LG · 2026-04-21 · unverdicted · novelty 7.0

A new algorithm for online influence maximization under a total budget constraint using the independent cascade model and edge-level semi-bandit feedback, with improved regret bounds for both budgeted and cardinality settings.

Best of both worlds: Stochastic & adversarial best-arm identification

stat.ML · 2026-04-16 · unverdicted · novelty 7.0

No algorithm can be optimal in both stochastic and adversarial best-arm identification; a new parameter-free algorithm matches the derived lower bound up to log factors in stochastic cases while handling adversarial rewards.

citing papers explorer

Showing 3 of 3 citing papers.

Spectral bandits stat.ML · 2026-04-28 · unverdicted · none · ref 22
Spectral bandits achieve scalable regret in graph-structured recommendation by using an effective dimension to learn good policies from few node evaluations.
Budgeted Online Influence Maximization cs.LG · 2026-04-21 · unverdicted · none · ref 119
A new algorithm for online influence maximization under a total budget constraint using the independent cascade model and edge-level semi-bandit feedback, with improved regret bounds for both budgeted and cardinality settings.
Best of both worlds: Stochastic & adversarial best-arm identification stat.ML · 2026-04-16 · unverdicted · none · ref 5
No algorithm can be optimal in both stochastic and adversarial best-arm identification; a new parameter-free algorithm matches the derived lower bound up to log factors in stochastic cases while handling adversarial rewards.

https://arxiv.org/pdf/1605.08722.pdf An empirical evaluation of Thompson sampling

fields

years

verdicts

representative citing papers

citing papers explorer