pith. sign in

arxiv: 1710.10737 · v2 · pith:OK6OAJFEnew · submitted 2017-10-30 · 🧮 math.OC · cs.LG· cs.NA· stat.ML

Linearly convergent stochastic heavy ball method for minimizing generalization error

classification 🧮 math.OC cs.LGcs.NAstat.ML
keywords ballheavymethodanalysislossminimizingstochasticamended
0
0 comments X
read the original abstract

In this work we establish the first linear convergence result for the stochastic heavy ball method. The method performs SGD steps with a fixed stepsize, amended by a heavy ball momentum term. In the analysis, we focus on minimizing the expected loss and not on finite-sum minimization, which is typically a much harder problem. While in the analysis we constrain ourselves to quadratic loss, the overall objective is not necessarily strongly convex.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Perfect Parallelization in Mini-Batch SGD with Classical Momentum Acceleration

    cs.LG 2026-05 unverdicted novelty 6.0

    Classical momentum acceleration in mini-batch SGD for quadratics is proportional to batch size up to saturation, enabling perfect parallelization under minimal noise assumptions.