pith. sign in

Normalized gradients for all

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

citation-role summary

background 2

citation-polarity summary

years

2026 3 2025 2

verdicts

UNVERDICTED 5

roles

background 2

polarities

background 2

representative citing papers

Function-free Optimization via Comparison Oracles

math.OC · 2026-04-29 · unverdicted · novelty 7.0

Introduces a geometry-based framework for comparison-oracle optimization, with O(d log(d/ε)) comparisons for normal direction estimation and Õ(d D²/ε²) comparisons to reach ε level-set optimality gap under regularity, convexity, and growth conditions.

Training Deep Learning Models with Norm-Constrained LMOs

cs.LG · 2025-02-11 · unverdicted · novelty 7.0

Scion is a new stochastic LMO-based optimizer family that unifies existing methods, supports unconstrained problems, and delivers hyperparameter transferability plus speedups on nanoGPT training.

citing papers explorer

Showing 5 of 5 citing papers.

  • Stochastic Auto-conditioned Fast Gradient Methods with Optimal Rates math.OC · 2026-04-07 · unverdicted · none · ref 20

    Stochastic AC-FGM achieves optimal O(1/√ε) iteration complexity and O(1/ε²) sample complexity while being fully adaptive to smoothness, horizon, and noise under bounded conditional variance.

  • Function-free Optimization via Comparison Oracles math.OC · 2026-04-29 · unverdicted · none · ref 29

    Introduces a geometry-based framework for comparison-oracle optimization, with O(d log(d/ε)) comparisons for normal direction estimation and Õ(d D²/ε²) comparisons to reach ε level-set optimality gap under regularity, convexity, and growth conditions.

  • Glocal Smoothness: Line search and adaptive step sizes can help in theory too! math.OC · 2025-06-14 · unverdicted · none · ref 3

    Glocal smoothness enables iterate-independent complexity bounds showing line search and adaptive steps outperform fixed steps, with GD+line search sometimes beating accelerated GD.

  • Training Deep Learning Models with Norm-Constrained LMOs cs.LG · 2025-02-11 · unverdicted · none · ref 201

    Scion is a new stochastic LMO-based optimizer family that unifies existing methods, supports unconstrained problems, and delivers hyperparameter transferability plus speedups on nanoGPT training.

  • Optimal Projection-Free Adaptive SGD for Matrix Optimization math.OC · 2026-04-02 · unverdicted · none · ref 19

    Proving stability of Leon's preconditioner enables the first tuning-free Nesterov-accelerated projection-free adaptive SGD variant with improved non-smooth non-convex rates.