Linearly convergent stochastic heavy ball method for minimizing generalization error

Nicolas Loizou; Peter Richt\'arik

arxiv: 1710.10737 · v2 · pith:OK6OAJFEnew · submitted 2017-10-30 · 🧮 math.OC · cs.LG· cs.NA· stat.ML

Linearly convergent stochastic heavy ball method for minimizing generalization error

Nicolas Loizou , Peter Richt\'arik This is my paper

classification 🧮 math.OC cs.LGcs.NAstat.ML

keywords ballheavymethodanalysislossminimizingstochasticamended

0 comments

read the original abstract

In this work we establish the first linear convergence result for the stochastic heavy ball method. The method performs SGD steps with a fixed stepsize, amended by a heavy ball momentum term. In the analysis, we focus on minimizing the expected loss and not on finite-sum minimization, which is typically a much harder problem. While in the analysis we constrain ourselves to quadratic loss, the overall objective is not necessarily strongly convex.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Perfect Parallelization in Mini-Batch SGD with Classical Momentum Acceleration
cs.LG 2026-05 unverdicted novelty 6.0

Classical momentum acceleration in mini-batch SGD for quadratics is proportional to batch size up to saturation, enabling perfect parallelization under minimal noise assumptions.