A simpler approach to obtaining an O(1/t) convergence rate for the projected stochastic subgradient method

Francis Bach; Mark Schmidt; Simon Lacoste-Julien

arxiv: 1212.2002 · v2 · pith:VDG7VF3Jnew · submitted 2012-12-10 · 💻 cs.LG · math.OC· stat.ML

A simpler approach to obtaining an O(1/t) convergence rate for the projected stochastic subgradient method

Simon Lacoste-Julien , Mark Schmidt , Francis Bach This is my paper

classification 💻 cs.LG math.OCstat.ML

keywords convergenceeasymethodprojectedratestochasticsubgradientapproach

0 comments

read the original abstract

In this note, we present a new averaging technique for the projected stochastic subgradient method. By using a weighted average with a weight of t+1 for each iterate w_t at iteration t, we obtain the convergence rate of O(1/t) with both an easy proof and an easy implementation. The new scheme is compared empirically to existing techniques, with similar performance behavior.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 3 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Gradient Descent's Last Iterate is Often (slightly) Suboptimal
math.OC 2026-04 unverdicted novelty 8.0

Proves it is impossible to achieve optimal last-iterate rates for GD and SGD without knowing the horizon T in advance, incurring an unavoidable poly-log factor penalty even in the deterministic case.
Factor Augmented High-Dimensional SGD
stat.ML 2026-05 unverdicted novelty 6.0

Proposes Factor-Augmented SGD that runs on streaming high-dimensional data and supplies the first convergence analysis explicitly accounting for latent-factor estimation error.
Robust Learning Meets Quasar-Convex Optimization: Inexact High-Order Proximal-Point Methods
math.OC 2026-05 unverdicted novelty 5.0

Robust learning problems are formulated as quasar-convex optimization, and HiPPA is proposed as an inexact high-order proximal method with global and superlinear convergence guarantees.