Measuring Learning Progress via Gradient-Momentum Coupling
Pith reviewed 2026-05-08 14:38 UTC · model grok-4.3
The pith
Gradient-Momentum Coupling offers a noise-robust alternative to prediction error for measuring learning progress in curiosity-driven reinforcement learning by quantifying gradient-momentum alignment.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Experiments on MiniGrid suggest that replacing prediction error with GMC within existing curiosity-driven architectures can improve robustness to observation noise.
Load-bearing premise
That momentum's natural filtering of noise and oscillations reliably identifies samples contributing to ongoing parameter updates rather than merely reflecting optimization artifacts.
read the original abstract
Measuring learning progress is essential for curiosity-driven exploration in reinforcement learning, but widely used signals such as prediction error often fail to distinguish meaningful, learnable patterns from random noise. This paper proposes Gradient-Momentum Coupling (GMC), a signal derived from optimization dynamics that quantifies how useful each sample's gradient is for ongoing learning by measuring its per-parameter normalized absolute product with the momentum from previous gradients. By leveraging momentum's natural filtering of noise and oscillations, GMC identifies samples that contribute to ongoing parameter updates. Controlled experiments demonstrate noise robustness and emergent curriculum learning, with the signal prioritizing tasks by learning speed rather than difficulty. Experiments on MiniGrid suggest that replacing prediction error with GMC within existing curiosity-driven architectures can improve robustness to observation noise.
Editorial analysis
A structured set of objections, weighed in public.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Momentum from previous gradients naturally filters noise and oscillations in optimization trajectories.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.