arXiv preprint arXiv:2512.23165 , year=

Evaluating Parameter Efficient Methods for RLVR , author= · 2025 · arXiv 2512.23165

6 Pith papers cite this work. Polarity classification is still indexing.

6 Pith papers citing it

read on arXiv browse 6 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

ARCA: Adapter-Residual Credit Assignment When Token Signals Degenerate

cs.LG · 2026-05-29 · unverdicted · novelty 7.0

ARCA assigns token credit in LoRA-based LLM RL from the norm of adapter-induced hidden state changes, yielding non-degenerate distributions and competitive performance on MATH tasks with Qwen3-1.7B under GRPO.

BoostLoRA: Growing Effective Rank by Boosting Adapters

cs.LG · 2026-04-30 · unverdicted · novelty 7.0

BoostLoRA grows effective adapter rank linearly via iterative boosting on hard examples with orthogonal low-rank updates, outperforming both single-shot ultra-low-rank adapters and full fine-tuning on math and code tasks with zero added inference overhead.

FuRA: Full-Rank Parameter-Efficient Fine-Tuning with Spectral Preconditioning

cs.LG · 2026-05-19 · unverdicted · novelty 6.0

FuRA uses block tensor-train factorization with fixed pretrained SVD basis to achieve full-rank spectral preconditioning, outperforming Full FT by +1.37 on LLaMA-3-8B commonsense reasoning and surpassing QLoRA in quantized settings.

The Hidden Power of Scaling Factor in LoRA Optimization

cs.AI · 2026-06-11 · unverdicted · novelty 5.0

Alpha in LoRA outperforms learning-rate scaling, follows a square-root law with rank, and enables a minimalist LoRA-alpha method that improves performance across tasks.

On the Scaling of PEFT: Towards Million Personal Models of Trillion Parameters

cs.LG · 2026-06-01 · unverdicted · novelty 5.0

PEFT adapters are positioned as persistent personal state on foundation models, organized via Scale Up, Scale Down, and Scale Out axes, with MinT as an infrastructure example for managing them.

Teaching LLMs Brazilian Healthcare: Injecting Knowledge from Official Clinical Guidelines

cs.CL · 2026-05-01 · unverdicted · novelty 4.0

A 14B model trained on synthetic data from Brazilian clinical guidelines outperforms larger LLMs on new benchmarks for Brazilian healthcare protocols.

citing papers explorer

Showing 6 of 6 citing papers after filters.

ARCA: Adapter-Residual Credit Assignment When Token Signals Degenerate cs.LG · 2026-05-29 · unverdicted · none · ref 39
ARCA assigns token credit in LoRA-based LLM RL from the norm of adapter-induced hidden state changes, yielding non-degenerate distributions and competitive performance on MATH tasks with Qwen3-1.7B under GRPO.
BoostLoRA: Growing Effective Rank by Boosting Adapters cs.LG · 2026-04-30 · unverdicted · none · ref 36
BoostLoRA grows effective adapter rank linearly via iterative boosting on hard examples with orthogonal low-rank updates, outperforming both single-shot ultra-low-rank adapters and full fine-tuning on math and code tasks with zero added inference overhead.
FuRA: Full-Rank Parameter-Efficient Fine-Tuning with Spectral Preconditioning cs.LG · 2026-05-19 · unverdicted · none · ref 50
FuRA uses block tensor-train factorization with fixed pretrained SVD basis to achieve full-rank spectral preconditioning, outperforming Full FT by +1.37 on LLaMA-3-8B commonsense reasoning and surpassing QLoRA in quantized settings.
The Hidden Power of Scaling Factor in LoRA Optimization cs.AI · 2026-06-11 · unverdicted · none · ref 124
Alpha in LoRA outperforms learning-rate scaling, follows a square-root law with rank, and enables a minimalist LoRA-alpha method that improves performance across tasks.
On the Scaling of PEFT: Towards Million Personal Models of Trillion Parameters cs.LG · 2026-06-01 · unverdicted · none · ref 38
PEFT adapters are positioned as persistent personal state on foundation models, organized via Scale Up, Scale Down, and Scale Out axes, with MinT as an infrastructure example for managing them.
Teaching LLMs Brazilian Healthcare: Injecting Knowledge from Official Clinical Guidelines cs.CL · 2026-05-01 · unverdicted · none · ref 33
A 14B model trained on synthetic data from Brazilian clinical guidelines outperforms larger LLMs on new benchmarks for Brazilian healthcare protocols.

arXiv preprint arXiv:2512.23165 , year=

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer