pith. machine review for the scientific record. sign in

arxiv: 1602.05394 · v2 · submitted 2016-02-17 · 📊 stat.ML · cs.LG· math.OC· math.ST· stat.TH

Recognition: unknown

Online optimization and regret guarantees for non-additive long-term constraints

Authors on Pith no claims yet
classification 📊 stat.ML cs.LGmath.OCmath.STstat.TH
keywords onlineconstraintslong-termnon-additiveregretadvertisingdatadisplay
0
0 comments X
read the original abstract

We consider online optimization in the 1-lookahead setting, where the objective does not decompose additively over the rounds of the online game. The resulting formulation enables us to deal with non-stationary and/or long-term constraints , which arise, for example, in online display advertising problems. We propose an on-line primal-dual algorithm for which we obtain dynamic cumulative regret guarantees. They depend on the convexity and the smoothness of the non-additive penalty, as well as terms capturing the smoothness with which the residuals of the non-stationary and long-term constraints vary over the rounds. We conduct experiments on synthetic data to illustrate the benefits of the non-additive penalty and show vanishing regret convergence on live traffic data collected by a display advertising platform in production.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. A Theoretical Framework for Auxiliary-Loss-Free Load Balancing of Sparse Mixture-of-Experts in Large-Scale AI Models

    math.OC 2025-12 unverdicted novelty 5.0

    The authors cast auxiliary-loss-free load balancing as a primal-dual assignment solver, prove structural properties in deterministic and online regimes, and report experiments on 1B-parameter DeepSeekMoE models.