No Internal Regret via Neighborhood Watch

Alexander Rakhlin; Dean Foster

arxiv: 1108.6088 · v1 · pith:XWK2QV6Inew · submitted 2011-08-30 · 💻 cs.LG · cs.GT

No Internal Regret via Neighborhood Watch

Dean Foster , Alexander Rakhlin This is my paper

classification 💻 cs.LG cs.GT

keywords gamesmonitoringpartialregretconditionfiniteinternalsqrt

0 comments

read the original abstract

We present an algorithm which attains O(\sqrt{T}) internal (and thus external) regret for finite games with partial monitoring under the local observability condition. Recently, this condition has been shown by (Bartok, Pal, and Szepesvari, 2011) to imply the O(\sqrt{T}) rate for partial monitoring games against an i.i.d. opponent, and the authors conjectured that the same holds for non-stochastic adversaries. Our result is in the affirmative, and it completes the characterization of possible rates for finite partial-monitoring games, an open question stated by (Cesa-Bianchi, Lugosi, and Stoltz, 2006). Our regret guarantees also hold for the more general model of partial monitoring with random signals.

This paper has not been read by Pith yet.

No Internal Regret via Neighborhood Watch

discussion (0)