Stochastic modified equations and adaptive stochastic gradient algorithms

Cheng Tai; Qianxiao Li; Weinan E

arxiv: 1511.06251 · v3 · pith:ZNNUJAW4new · submitted 2015-11-19 · 💻 cs.LG · stat.ML

Stochastic modified equations and adaptive stochastic gradient algorithms

Qianxiao Li , Cheng Tai , Weinan E This is my paper

classification 💻 cs.LG stat.ML

keywords stochasticalgorithmsequationsgradientadaptivemodifiedaddedadjustment

0 comments

read the original abstract

We develop the method of stochastic modified equations (SME), in which stochastic gradient algorithms are approximated in the weak sense by continuous-time stochastic differential equations. We exploit the continuous formulation together with optimal control theory to derive novel adaptive hyper-parameter adjustment policies. Our algorithms have competitive performance with the added benefit of being robust to varying models and datasets. This provides a general methodology for the analysis and design of stochastic gradient algorithms.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Momentum Further Constrains Sharpness at the Edge of Stochastic Stability
cs.LG 2026-04 unverdicted novelty 7.0

Momentum SGD exhibits two distinct EoSS regimes for batch sharpness, stabilizing at 2(1-β)/η for small batches and 2(1+β)/η for large batches, aligning with linear stability thresholds.
Deep learning applied to computational mechanics: A comprehensive review, state of the art, and the classics
cs.LG 2022-12 unverdicted novelty 2.0

A comprehensive review of deep learning techniques for computational mechanics, including LSTM for constitutive modeling, PINNs for PDE solving, optimizers, and kernel methods.