pith. sign in

Convergence and Dynamical Behavior of the ADAM Algorithm for Nonconvex Stochastic Optimization,

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

years

2026 3

verdicts

UNVERDICTED 3

clear filters

representative citing papers

Adam Converges in Nonsmooth Nonconvex Optimization

math.OC · 2026-06-21 · unverdicted · novelty 8.0

The paper establishes the first finite-time convergence rate of 1/T^{2/13} for classical Adam (with bias correction, no extra steps) in nonsmooth nonconvex optimization under heavy-tailed noise with β1=β2.

A Stochastic--Geometric Theory of Scaling Laws in Grokking

stat.ML · 2026-06-29 · unverdicted · novelty 6.0

A stochastic-geometric model of solution-space topology under Adam derives explicit scaling laws for grokking transition time as a function of learning rate, batch size, and L2 coefficient.

citing papers explorer

Showing 3 of 3 citing papers after filters.