pith. sign in

Chongli Qin and Jost Tobias Springenberg

11 Pith papers cite this work. Polarity classification is still indexing.

11 Pith papers citing it

citation-role summary

background 2

citation-polarity summary

years

2026 9 2025 2

roles

background 2

polarities

background 2

clear filters

representative citing papers

Rotation-Preserving Supervised Fine-Tuning

cs.LG · 2026-05-08 · unverdicted · novelty 6.0

RPSFT improves the in-domain versus out-of-domain performance trade-off during LLM supervised fine-tuning by penalizing rotations in pretrained singular subspaces as a proxy for loss-sensitive directions.

PriFT: Prior-Support Guided Supervised Fine-Tuning

cs.CL · 2026-06-08 · unverdicted · novelty 5.0

PriFT uses token reweighting signals from a frozen pretrained model to stabilize SFT and achieve better results than standard SFT baselines on reasoning tasks.

Proximal Supervised Fine-Tuning

cs.LG · 2025-08-25 · unverdicted · novelty 5.0

PSFT modifies supervised fine-tuning by incorporating trust-region ideas from RL to constrain policy changes, yielding better out-of-domain generalization in math and human-value tasks without entropy collapse.

Compatibility-Aware Dynamic Fine-Tuning for Large Language Models

cs.CL · 2026-04-22 · conditional · novelty 4.0

CADFT improves supervised fine-tuning of large language models by dynamically down-weighting training samples whose low model-likelihood indicates high gradient variance, yielding better stability and generalization.

citing papers explorer

Showing 2 of 2 citing papers after filters.

  • PriFT: Prior-Support Guided Supervised Fine-Tuning cs.CL · 2026-06-08 · unverdicted · none · ref 18

    PriFT uses token reweighting signals from a frozen pretrained model to stabilize SFT and achieve better results than standard SFT baselines on reasoning tasks.

  • Compatibility-Aware Dynamic Fine-Tuning for Large Language Models cs.CL · 2026-04-22 · conditional · none · ref 17

    CADFT improves supervised fine-tuning of large language models by dynamically down-weighting training samples whose low model-likelihood indicates high gradient variance, yielding better stability and generalization.