GNMR: Runtime Stability Control for Low-Precision Large Language Model Training

Boao Kong; Engao Zhang; Guohong Li; Kun Yuan; Weichen Jia; Yao Wang; Yaoyuan Wang; Yonghan Dong; Yunke Peng

arxiv: 2606.00539 · v1 · pith:3R3RAISTnew · submitted 2026-05-30 · 💻 cs.LG · math.OC· stat.ML

GNMR: Runtime Stability Control for Low-Precision Large Language Model Training

Boao Kong , Weichen Jia , Engao Zhang , Guohong Li , Yonghan Dong , Yao Wang , Yaoyuan Wang , Yunke Peng

show 1 more author

Kun Yuan

This is my paper

classification 💻 cs.LG math.OCstat.ML

keywords gnmrtrainingstabilitylow-precisioncontrolcontrollergradientlanguage

0 comments

read the original abstract

Training stability is a key bottleneck in low-precision language model training: efficient low-cost paths can still produce short-lived numerical risks at a small set of operators. We formulate this as runtime stability control and present Gradient Norm-to-Mean Ratio (GNMR), a lightweight controller that compares each recoverable unit's current gradient norm with its historical mean. Together with $\Delta$-GNMR for abrupt short-window increases, GNMR maps local risk signals to bounded recovery actions under a hard $\mathrm{maxO}$ budget and a short lock interval, without changing the numerical format, kernel, or backend recipe. Across activation-quantization stress, DeepSeek-style recipe-level training, and LLaMA-2 13B fine-tuning, GNMR preserves high-fidelity quality with sparse, budgeted recovery. These results support GNMR as a backend-agnostic controller to improve low-precision training stability while preserving low-cost execution.

This paper has not been read by Pith yet.

GNMR: Runtime Stability Control for Low-Precision Large Language Model Training

discussion (0)