neurips.cc/paper_files/paper/2017/file/22b1f2e0983160db6f7bb9f62f4dbb39-Paper.pdf

URLhttps://proceedings · 2017

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Multi-Armed Bandits With Best-Action Queries

cs.LG · 2026-05-08 · unverdicted · novelty 8.0

Best-action queries yield Õ(min{T/k, √(T-k)}) regret for i.i.d. stochastic rewards but only Ω(√(T-k)) regret for correlated stochastic or adversarial rewards in the bandit-feedback model.

citing papers explorer

Showing 1 of 1 citing paper.

Multi-Armed Bandits With Best-Action Queries cs.LG · 2026-05-08 · unverdicted · none · ref 3
Best-action queries yield Õ(min{T/k, √(T-k)}) regret for i.i.d. stochastic rewards but only Ω(√(T-k)) regret for correlated stochastic or adversarial rewards in the bandit-feedback model.

neurips.cc/paper_files/paper/2017/file/22b1f2e0983160db6f7bb9f62f4dbb39-Paper.pdf

fields

years

verdicts

representative citing papers

citing papers explorer