Online Optimization : Competing with Dynamic Comparators

Alexander Rakhlin; Ali Jadbabaie; Karthik Sridharan; Shahin Shahrampour

arxiv: 1501.06225 · v1 · pith:DGNOF476new · submitted 2015-01-26 · 💻 cs.LG · math.OC· stat.ML

Online Optimization : Competing with Dynamic Comparators

Ali Jadbabaie , Alexander Rakhlin , Shahin Shahrampour , Karthik Sridharan This is my paper

classification 💻 cs.LG math.OCstat.ML

keywords regretadaptivebenchmarkscomparatorsdynamicguaranteesonlineregularity

0 comments

read the original abstract

Recent literature on online learning has focused on developing adaptive algorithms that take advantage of a regularity of the sequence of observations, yet retain worst-case performance guarantees. A complementary direction is to develop prediction methods that perform well against complex benchmarks. In this paper, we address these two directions together. We present a fully adaptive method that competes with dynamic benchmarks in which regret guarantee scales with regularity of the sequence of cost functions and comparators. Notably, the regret bound adapts to the smaller complexity measure in the problem environment. Finally, we apply our results to drifting zero-sum, two-player games where both players achieve no regret guarantees against best sequences of actions in hindsight.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Priced Motion Through Optimal Faces: A Normal-Fan Geometry for Non-Stationary Adversarial MDPs
cs.LG 2026-06 unverdicted novelty 7.0

Introduces priced face-crossing via normal-fan geometry on occupancy polytopes to decompose dynamic regret into intrinsic motion cost plus within-face error in non-stationary adversarial MDPs.