Milora: Harnessing mi- nor singular components for parameter-efficient llm fine- tuning.arXiv preprint arXiv:2406.09044

URLhttps://arxiv · 2024 · arXiv 2406.09044

6 Pith papers cite this work. Polarity classification is still indexing.

6 Pith papers citing it

read on arXiv browse 6 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

Enhancing Continual Learning of Vision-Language Models via Dynamic Prefix Weighting

cs.CV · 2026-04-20 · accept · novelty 7.0

DPW with a token-importance gating module and residual adapters achieves state-of-the-art performance in domain-class incremental learning for VLMs.

FuRA: Full-Rank Parameter-Efficient Fine-Tuning with Spectral Preconditioning

cs.LG · 2026-05-19 · unverdicted · novelty 6.0

FuRA uses block tensor-train factorization with fixed pretrained SVD basis to achieve full-rank spectral preconditioning, outperforming Full FT by +1.37 on LLaMA-3-8B commonsense reasoning and surpassing QLoRA in quantized settings.

Rotation-Preserving Supervised Fine-Tuning

cs.LG · 2026-05-08 · unverdicted · novelty 6.0

RPSFT improves the in-domain versus out-of-domain performance trade-off during LLM supervised fine-tuning by penalizing rotations in pretrained singular subspaces as a proxy for loss-sensitive directions.

On the Scaling of PEFT: Towards Million Personal Models of Trillion Parameters

cs.LG · 2026-06-01 · unverdicted · novelty 5.0

PEFT adapters are positioned as persistent personal state on foundation models, organized via Scale Up, Scale Down, and Scale Out axes, with MinT as an infrastructure example for managing them.

LoCO: Low-rank Compositional Rotation Fine-tuning

cs.LG · 2026-05-15 · unverdicted · novelty 5.0

LoCO is a PEFT technique that constructs orthogonal transformations via low-rank skew-symmetric matrices and compositional rotation chains with a parallelizable approximation, validated on transformer adaptations.

VLA-GSE: Boosting Parameter-Efficient Fine-Tuning in VLA with Generalized and Specialized Experts

cs.RO · 2026-05-07 · unverdicted · novelty 5.0

VLA-GSE uses spectral decomposition of the VLA backbone to create generalized and specialized experts, enabling effective robot task adaptation while updating only 2.51% of parameters and achieving 81.2% zero-shot success on LIBERO-Plus.

citing papers explorer

Showing 6 of 6 citing papers after filters.

Enhancing Continual Learning of Vision-Language Models via Dynamic Prefix Weighting cs.CV · 2026-04-20 · accept · none · ref 36
DPW with a token-importance gating module and residual adapters achieves state-of-the-art performance in domain-class incremental learning for VLMs.
FuRA: Full-Rank Parameter-Efficient Fine-Tuning with Spectral Preconditioning cs.LG · 2026-05-19 · unverdicted · none · ref 47
FuRA uses block tensor-train factorization with fixed pretrained SVD basis to achieve full-rank spectral preconditioning, outperforming Full FT by +1.37 on LLaMA-3-8B commonsense reasoning and surpassing QLoRA in quantized settings.
Rotation-Preserving Supervised Fine-Tuning cs.LG · 2026-05-08 · unverdicted · none · ref 40
RPSFT improves the in-domain versus out-of-domain performance trade-off during LLM supervised fine-tuning by penalizing rotations in pretrained singular subspaces as a proxy for loss-sensitive directions.
On the Scaling of PEFT: Towards Million Personal Models of Trillion Parameters cs.LG · 2026-06-01 · unverdicted · none · ref 31
PEFT adapters are positioned as persistent personal state on foundation models, organized via Scale Up, Scale Down, and Scale Out axes, with MinT as an infrastructure example for managing them.
LoCO: Low-rank Compositional Rotation Fine-tuning cs.LG · 2026-05-15 · unverdicted · none · ref 49
LoCO is a PEFT technique that constructs orthogonal transformations via low-rank skew-symmetric matrices and compositional rotation chains with a parallelizable approximation, validated on transformer adaptations.
VLA-GSE: Boosting Parameter-Efficient Fine-Tuning in VLA with Generalized and Specialized Experts cs.RO · 2026-05-07 · unverdicted · none · ref 34
VLA-GSE uses spectral decomposition of the VLA backbone to create generalized and specialized experts, enabling effective robot task adaptation while updating only 2.51% of parameters and achieving 81.2% zero-shot success on LIBERO-Plus.

Milora: Harnessing mi- nor singular components for parameter-efficient llm fine- tuning.arXiv preprint arXiv:2406.09044

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer