Mora: High-rank updating for parameter- efficient fine-tuning

MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning , author= · 2024 · arXiv 2405.12130

9 Pith papers cite this work. Polarity classification is still indexing.

9 Pith papers citing it

read on arXiv browse 9 citing papers

citation-role summary

other 1

citation-polarity summary

unclear 1

representative citing papers

LESSViT: Robust Hyperspectral Representation Learning under Spectral Configuration Shift

cs.CV · 2026-05-18 · unverdicted · novelty 6.0

LESSViT introduces a low-rank efficient spatial-spectral attention mechanism and a hyperspectral masked autoencoder to improve generalization across spectral configuration shifts in hyperspectral imagery.

Compared to What? Baselines and Metrics for Counterfactual Prompting

cs.CL · 2026-05-01 · conditional · novelty 6.0

Counterfactual prompting effects on LLMs are often indistinguishable from those caused by meaning-preserving paraphrases, causing most previously reported demographic sensitivities to disappear under proper statistical comparison.

BaLoRA: Bayesian Low-Rank Adaptation of Large Scale Models

cs.LG · 2026-04-27 · unverdicted · novelty 6.0

BaLoRA is a Bayesian LoRA variant with input-adaptive noise that improves accuracy over standard LoRA and supplies well-calibrated uncertainty estimates on language, vision, and scientific prediction tasks.

ScaLoRA: Optimally Scaled Low-Rank Adaptation for Efficient High-Rank Fine-Tuning

cs.LG · 2025-10-27 · unverdicted · novelty 6.0

ScaLoRA analytically derives per-update column scalings that let low-rank increments accumulate into high-rank weight updates, yielding faster convergence and higher accuracy than prior LoRA variants on LLMs up to 12B parameters.

TeRA: Vector-based Random Tensor Network for High-Rank Adaptation of Large Language Models

cs.LG · 2025-09-03 · unverdicted · novelty 6.0

TeRA parametrizes high-rank LLM weight updates via a random Tucker-like tensor network with shared frozen factors and layer-specific scaling vectors, matching high-rank adapter performance at vector-level parameter counts.

The Hidden Power of Scaling Factor in LoRA Optimization

cs.AI · 2026-06-11 · unverdicted · novelty 5.0

Alpha in LoRA outperforms learning-rate scaling, follows a square-root law with rank, and enables a minimalist LoRA-alpha method that improves performance across tasks.

SMoA: Spectrum Modulation Adapter for Parameter-Efficient Fine-Tuning

cs.LG · 2026-05-20 · unverdicted · novelty 5.0

SMoA is a new PEFT adapter that uses block-wise Hadamard-modulated low-rank branches on spectral partitions to cover more pretrained spectral directions than standard LoRA under a smaller parameter budget.

LoCO: Low-rank Compositional Rotation Fine-tuning

cs.LG · 2026-05-15 · unverdicted · novelty 5.0

LoCO is a PEFT technique that constructs orthogonal transformations via low-rank skew-symmetric matrices and compositional rotation chains with a parallelizable approximation, validated on transformer adaptations.

On the Convergence Analysis of Muon

stat.ML · 2025-05-29 · unverdicted · novelty 5.0

Convergence analysis shows Muon outperforms gradient descent by exploiting low-rank structure in neural network Hessians.

citing papers explorer

Showing 9 of 9 citing papers.

LESSViT: Robust Hyperspectral Representation Learning under Spectral Configuration Shift cs.CV · 2026-05-18 · unverdicted · none · ref 9
LESSViT introduces a low-rank efficient spatial-spectral attention mechanism and a hyperspectral masked autoencoder to improve generalization across spectral configuration shifts in hyperspectral imagery.
Compared to What? Baselines and Metrics for Counterfactual Prompting cs.CL · 2026-05-01 · conditional · none · ref 95
Counterfactual prompting effects on LLMs are often indistinguishable from those caused by meaning-preserving paraphrases, causing most previously reported demographic sensitivities to disappear under proper statistical comparison.
BaLoRA: Bayesian Low-Rank Adaptation of Large Scale Models cs.LG · 2026-04-27 · unverdicted · none · ref 38
BaLoRA is a Bayesian LoRA variant with input-adaptive noise that improves accuracy over standard LoRA and supplies well-calibrated uncertainty estimates on language, vision, and scientific prediction tasks.
ScaLoRA: Optimally Scaled Low-Rank Adaptation for Efficient High-Rank Fine-Tuning cs.LG · 2025-10-27 · unverdicted · none · ref 26
ScaLoRA analytically derives per-update column scalings that let low-rank increments accumulate into high-rank weight updates, yielding faster convergence and higher accuracy than prior LoRA variants on LLMs up to 12B parameters.
TeRA: Vector-based Random Tensor Network for High-Rank Adaptation of Large Language Models cs.LG · 2025-09-03 · unverdicted · none · ref 22
TeRA parametrizes high-rank LLM weight updates via a random Tucker-like tensor network with shared frozen factors and layer-specific scaling vectors, matching high-rank adapter performance at vector-level parameter counts.
The Hidden Power of Scaling Factor in LoRA Optimization cs.AI · 2026-06-11 · unverdicted · none · ref 30
Alpha in LoRA outperforms learning-rate scaling, follows a square-root law with rank, and enables a minimalist LoRA-alpha method that improves performance across tasks.
SMoA: Spectrum Modulation Adapter for Parameter-Efficient Fine-Tuning cs.LG · 2026-05-20 · unverdicted · none · ref 19
SMoA is a new PEFT adapter that uses block-wise Hadamard-modulated low-rank branches on spectral partitions to cover more pretrained spectral directions than standard LoRA under a smaller parameter budget.
LoCO: Low-rank Compositional Rotation Fine-tuning cs.LG · 2026-05-15 · unverdicted · none · ref 23
LoCO is a PEFT technique that constructs orthogonal transformations via low-rank skew-symmetric matrices and compositional rotation chains with a parallelizable approximation, validated on transformer adaptations.
On the Convergence Analysis of Muon stat.ML · 2025-05-29 · unverdicted · none · ref 10
Convergence analysis shows Muon outperforms gradient descent by exploiting low-rank structure in neural network Hessians.

Mora: High-rank updating for parameter- efficient fine-tuning

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer