pith. sign in

Title resolution pending

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

fields

cs.LG 1

years

2026 1

verdicts

UNVERDICTED 1

representative citing papers

Multi-Armed Bandits With Best-Action Queries

cs.LG · 2026-05-08 · unverdicted · novelty 8.0

Best-action queries yield Õ(min{T/k, √(T-k)}) regret for i.i.d. stochastic rewards but only Ω(√(T-k)) regret for correlated stochastic or adversarial rewards in the bandit-feedback model.

citing papers explorer

Showing 1 of 1 citing paper.

  • Multi-Armed Bandits With Best-Action Queries cs.LG · 2026-05-08 · unverdicted · none · ref 6

    Best-action queries yield Õ(min{T/k, √(T-k)}) regret for i.i.d. stochastic rewards but only Ω(√(T-k)) regret for correlated stochastic or adversarial rewards in the bandit-feedback model.