pith. sign in

Bui, Ramesh Johari, and Shie Mannor

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

fields

cs.LG 1

years

2026 1

verdicts

UNVERDICTED 1

representative citing papers

Trading off rewards and errors in multi-armed bandits

cs.LG · 2026-05-01 · unverdicted · novelty 5.0

An algorithm for multi-armed bandits that interpolates between reward maximization and accurate mean estimation, supported by matching upper and lower regret bounds.

citing papers explorer

Showing 1 of 1 citing paper.

  • Trading off rewards and errors in multi-armed bandits cs.LG · 2026-05-01 · unverdicted · none · ref 6

    An algorithm for multi-armed bandits that interpolates between reward maximization and accurate mean estimation, supported by matching upper and lower regret bounds.