pith. sign in

https://arxiv.org/pdf/1605.08722.pdf An empirical evaluation of Thompson sampling

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

years

2026 3

verdicts

UNVERDICTED 3

representative citing papers

Spectral bandits

stat.ML · 2026-04-28 · unverdicted · novelty 7.0

Spectral bandits achieve scalable regret in graph-structured recommendation by using an effective dimension to learn good policies from few node evaluations.

Budgeted Online Influence Maximization

cs.LG · 2026-04-21 · unverdicted · novelty 7.0

A new algorithm for online influence maximization under a total budget constraint using the independent cascade model and edge-level semi-bandit feedback, with improved regret bounds for both budgeted and cardinality settings.

Best of both worlds: Stochastic & adversarial best-arm identification

stat.ML · 2026-04-16 · unverdicted · novelty 7.0

No algorithm can be optimal in both stochastic and adversarial best-arm identification; a new parameter-free algorithm matches the derived lower bound up to log factors in stochastic cases while handling adversarial rewards.

citing papers explorer

Showing 3 of 3 citing papers.

  • Spectral bandits stat.ML · 2026-04-28 · unverdicted · none · ref 22

    Spectral bandits achieve scalable regret in graph-structured recommendation by using an effective dimension to learn good policies from few node evaluations.

  • Budgeted Online Influence Maximization cs.LG · 2026-04-21 · unverdicted · none · ref 119

    A new algorithm for online influence maximization under a total budget constraint using the independent cascade model and edge-level semi-bandit feedback, with improved regret bounds for both budgeted and cardinality settings.

  • Best of both worlds: Stochastic & adversarial best-arm identification stat.ML · 2026-04-16 · unverdicted · none · ref 5

    No algorithm can be optimal in both stochastic and adversarial best-arm identification; a new parameter-free algorithm matches the derived lower bound up to log factors in stochastic cases while handling adversarial rewards.