pith. sign in

Adabelief optimizer: Adapting stepsizes by the belief in observed gradients

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

verdicts

UNVERDICTED 3

clear filters

representative citing papers

On the Convergence of Muon and Beyond

cs.LG · 2025-09-19 · unverdicted · novelty 7.0

Muon-MVR2 attains the optimal anytime convergence rate of ~O(T^{-1/3}) in stochastic non-convex settings under horizon-free schedules.

citing papers explorer

Showing 1 of 1 citing paper after filters.

  • On the Convergence of Muon and Beyond cs.LG · 2025-09-19 · unverdicted · none · ref 56

    Muon-MVR2 attains the optimal anytime convergence rate of ~O(T^{-1/3}) in stochastic non-convex settings under horizon-free schedules.