Bypassing the monster: A faster and simpler optimal algorithm for contextual bandits under realizability.Mathematics of Operations Research, 47(3):1904–1931, 2022

David Simchi-Levi, Yunzong Xu · 1904 · arXiv 2021.1193

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

RIE-Greedy: Regularization-Induced Exploration for Contextual Bandits

stat.ML · 2026-03-11 · unverdicted · novelty 5.0

RIE-Greedy uses stochasticity from cross-validation regularization to induce Thompson Sampling-like exploration, claimed equivalent in the two-armed case and empirically competitive in large-scale settings.

citing papers explorer

Showing 1 of 1 citing paper.

RIE-Greedy: Regularization-Induced Exploration for Contextual Bandits stat.ML · 2026-03-11 · unverdicted · none · ref 20
RIE-Greedy uses stochasticity from cross-validation regularization to induce Thompson Sampling-like exploration, claimed equivalent in the two-armed case and empirically competitive in large-scale settings.

Bypassing the monster: A faster and simpler optimal algorithm for contextual bandits under realizability.Mathematics of Operations Research, 47(3):1904–1931, 2022

fields

years

verdicts

representative citing papers

citing papers explorer