AS-LoRA adaptively chooses which LoRA factor to update per layer and round using a curvature-aware second-order score, eliminating reconstruction error floors and improving performance in DP federated learning.
Curtis, and Jorge Nocedal
3 Pith papers cite this work. Polarity classification is still indexing.
years
2026 3verdicts
UNVERDICTED 3representative citing papers
DynMuon dynamically schedules the spectral shaping parameter p in Muon-like optimizers from positive to negative values, yielding lower validation loss and 10.6-26.5% fewer steps than standard Muon across tested settings.
LBW-Guard is a bounded autonomous control layer above AdamW that improves stability, reduces perplexity, and speeds up training for Qwen2.5 models under learning-rate stress on WikiText-103.
citing papers explorer
-
Adaptive Selection of LoRA Components in Privacy-Preserving Federated Learning
AS-LoRA adaptively chooses which LoRA factor to update per layer and round using a curvature-aware second-order score, eliminating reconstruction error floors and improving performance in DP federated learning.
-
DynMuon: A Dynamic Spectral Shaping View of Muon
DynMuon dynamically schedules the spectral shaping parameter p in Muon-like optimizers from positive to negative values, yielding lower validation loss and 10.6-26.5% fewer steps than standard Muon across tested settings.
-
Learn-by-Wire Training Control Governance: Bounded Autonomous Training Under Stress for Stability and Efficiency
LBW-Guard is a bounded autonomous control layer above AdamW that improves stability, reduces perplexity, and speeds up training for Qwen2.5 models under learning-rate stress on WikiText-103.