Algorithmic Chaining and the Role of Partial Feedback in Online Nonparametric Learning

Claudio Gentile; Nicol\`o Cesa-Bianchi; Pierre Gaillard (SIERRA); S\'ebastien Gerchinovitz (IMT)

arxiv: 1702.08211 · v2 · pith:OSURO7QWnew · submitted 2017-02-27 · 📊 stat.ML · cs.LG· math.ST· stat.TH

Algorithmic Chaining and the Role of Partial Feedback in Online Nonparametric Learning

Nicol\`o Cesa-Bianchi , Pierre Gaillard (SIERRA) , Claudio Gentile , S\'ebastien Gerchinovitz (IMT) This is my paper

classification 📊 stat.ML cs.LGmath.STstat.TH

keywords feedbackchaininglipschitzlossesregretalgorithmicapproachauctions

0 comments

read the original abstract

We investigate contextual online learning with nonparametric (Lipschitz) comparison classes under different assumptions on losses and feedback information. For full information feedback and Lipschitz losses, we design the first explicit algorithm achieving the minimax regret rate (up to log factors). In a partial feedback model motivated by second-price auctions, we obtain algorithms for Lipschitz and semi-Lipschitz losses with regret bounds improving on the known bounds for standard bandit feedback. Our analysis combines novel results for contextual second-price auctions with a novel algorithmic approach based on chaining. When the context space is Euclidean, our chaining approach is efficient and delivers an even better regret bound.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Chaining Meets Chain Rule: Multilevel Entropic Regularization and Training of Neural Nets
cs.LG 2019-06 unverdicted novelty 6.0

Derives algorithm-dependent generalization bounds for neural nets using multilevel entropic regularization and proposes a Metropolis-simulated multi-scale Gibbs training procedure tested on a two-layer net for MNIST.