Scalable optimization in the modular norm

Large, T · 2024 · arXiv 2405.14813

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

representative citing papers

Implicit Bias of Mirror Flow in Homogeneous Neural Networks: Sparse and Dense Feature Learning

cs.LG · 2026-05-19 · unverdicted · novelty 7.0

Mirror flow reaches max-margin solutions in homogeneous neural networks where the mirror map choice controls whether learned features are sparse or dense while convergence can be exponentially slow.

Training Deep Learning Models with Norm-Constrained LMOs

cs.LG · 2025-02-11 · unverdicted · novelty 7.0

Scion is a new stochastic LMO-based optimizer family that unifies existing methods, supports unconstrained problems, and delivers hyperparameter transferability plus speedups on nanoGPT training.

Old Optimizer, New Norm: An Anthology

cs.LG · 2024-09-30 · unverdicted · novelty 7.0

Optimizers like Adam reduce to steepest descent under particular norms, opening a design space of norm assignments tailored to layer roles.

citing papers explorer

Showing 3 of 3 citing papers.

Implicit Bias of Mirror Flow in Homogeneous Neural Networks: Sparse and Dense Feature Learning cs.LG · 2026-05-19 · unverdicted · none · ref 26
Mirror flow reaches max-margin solutions in homogeneous neural networks where the mirror map choice controls whether learned features are sparse or dense while convergence can be exponentially slow.
Training Deep Learning Models with Norm-Constrained LMOs cs.LG · 2025-02-11 · unverdicted · none · ref 193
Scion is a new stochastic LMO-based optimizer family that unifies existing methods, supports unconstrained problems, and delivers hyperparameter transferability plus speedups on nanoGPT training.
Old Optimizer, New Norm: An Anthology cs.LG · 2024-09-30 · unverdicted · none · ref 27
Optimizers like Adam reduce to steepest descent under particular norms, opening a design space of norm assignments tailored to layer roles.

Scalable optimization in the modular norm

fields

years

verdicts

representative citing papers

citing papers explorer