pith. sign in

A high probability analysis of adaptive sgd with momentum

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

years

2026 3

verdicts

UNVERDICTED 3

clear filters

representative citing papers

Robust and Fast Training via Per-Sample Clipping

math.OC · 2026-05-04 · unverdicted · novelty 6.0

PS-Clip-SGD achieves optimal in-expectation convergence rates for non-convex optimization under heavy-tailed gradient noise, with matching high-probability guarantees, and outperforms standard methods on AlexNet trained on CIFAR-100.

citing papers explorer

Showing 3 of 3 citing papers after filters.