Better Algorithms for Stochastic Bandits with Adversarial Corruptions

Anupam Gupta; Kunal Talwar; Tomer Koren

arxiv: 1902.08647 · v2 · pith:MSFCUDUEnew · submitted 2019-02-22 · 💻 cs.LG · stat.ML

Better Algorithms for Stochastic Bandits with Adversarial Corruptions

Anupam Gupta , Tomer Koren , Kunal Talwar This is my paper

classification 💻 cs.LG stat.ML

keywords adversarialalgorithmbanditscorruptionproblemstochasticagnosticalgorithms

0 comments

read the original abstract

We study the stochastic multi-armed bandits problem in the presence of adversarial corruption. We present a new algorithm for this problem whose regret is nearly optimal, substantially improving upon previous work. Our algorithm is agnostic to the level of adversarial contamination and can tolerate a significant amount of corruption with virtually no degradation in performance.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Do Not Trust The Auctioneer: Learning to Bid in Feedback-Manipulated Auctions
stat.ML 2026-05 unverdicted novelty 7.0

In first-price auctions with feedback-only shilling, an algorithm combining robust interval elimination and optimistic debiasing with racing achieves near-optimal regret rates of O(T^{2/3}) or O(sqrt(T)) and matches a...