pith:FBA4ZOPE
RMNP: Row-Momentum Normalized Preconditioning for Scalable Matrix-Based Optimization
RMNP replaces Newton-Schulz orthogonalization with row-wise L2 normalization to match Muon performance at linear cost.
arxiv:2603.20527 v3 · 2026-03-20 · cs.LG
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{FBA4ZOPERMBYKBNME7KYD25AIQ}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
RMNP delivers competitive optimization performance compared with Muon while substantially reducing preconditioning wall-clock time. We establish convergence guarantees for RMNP in the non-convex setting that match recent results for Muon optimizers, achieving the minimax optimal complexity.
The substitution is justified by the empirically observed diagonal block structure of the Transformer layerwise Hessian together with the claim that orthogonalization and row-wise (on input dim) ℓ2 normalization are asymptotically equivalent for transformers.
RMNP preconditions matrix updates via row-wise L2 normalization instead of Newton-Schulz iteration, reducing complexity to O(mn) while matching Muon's non-convex convergence rate and empirical performance.
References
Cited by
Receipt and verification
| First computed | 2026-05-18T02:45:04.674774Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
2841ccb9e48b038505ac27d581eba04438022926ba78350e921ded0a68330cb6
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/FBA4ZOPERMBYKBNME7KYD25AIQ \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: 2841ccb9e48b038505ac27d581eba04438022926ba78350e921ded0a68330cb6
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "e92f3bf1d9c42d393e8d8a72ba71b92e73aa30dc0aa1e26c2d7b46caa8d26031",
"cross_cats_sorted": [],
"license": "http://creativecommons.org/licenses/by/4.0/",
"primary_cat": "cs.LG",
"submitted_at": "2026-03-20T21:55:28Z",
"title_canon_sha256": "24d13c333a5267bdfede30bb7ec0da9457bde5b3addf8f7c03aa4b45f0c37341"
},
"schema_version": "1.0",
"source": {
"id": "2603.20527",
"kind": "arxiv",
"version": 3
}
}