Homogenization of sgd in high-dimensions: Exact dynamics and generalization properties

Courtney Paquette, Elliot Paquette, Ben Adlam, Jeffrey Pennington · 2022 · arXiv 2205.07069

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Homogenization of $\ell_2$-Adversarial Training in High-Dimensions: Exact Dynamics under Stochastic Gradient Descent

math.OC · 2026-06-30 · unverdicted · novelty 7.0

Derives ODE deterministic equivalents and an adversarial homogenized SDE for SGD iterates in high-dim ℓ2-adversarial training, showing no constant learning rate ensures monotone descent for single-class adversarial least squares and equivalence to adaptive regularized standard SGD.

The Role of Symmetry in Optimizing Overparameterized Networks

cs.LG · 2026-04-28 · unverdicted · novelty 6.0 · 2 refs

Overparameterization adds symmetries that precondition the Hessian for better minima and increase the probability mass of global minima near typical initializations.

citing papers explorer

Showing 2 of 2 citing papers after filters.

Homogenization of $\ell_2$-Adversarial Training in High-Dimensions: Exact Dynamics under Stochastic Gradient Descent math.OC · 2026-06-30 · unverdicted · none · ref 50
Derives ODE deterministic equivalents and an adversarial homogenized SDE for SGD iterates in high-dim ℓ2-adversarial training, showing no constant learning rate ensures monotone descent for single-class adversarial least squares and equivalence to adaptive regularized standard SGD.
The Role of Symmetry in Optimizing Overparameterized Networks cs.LG · 2026-04-28 · unverdicted · none · ref 29 · 2 links
Overparameterization adds symmetries that precondition the Hessian for better minima and increase the probability mass of global minima near typical initializations.

Homogenization of sgd in high-dimensions: Exact dynamics and generalization properties

fields

years

verdicts

representative citing papers

citing papers explorer