RPSFT improves the in-domain versus out-of-domain performance trade-off during LLM supervised fine-tuning by penalizing rotations in pretrained singular subspaces as a proxy for loss-sensitive directions.
and Soures, Nicholas and Kudithipudi, Dhireesha , year=
3 Pith papers cite this work. Polarity classification is still indexing.
years
2026 3verdicts
UNVERDICTED 3representative citing papers
Reinforcement learning learns state-dependent parametrization components in idealized climate models that outperform static tuning across several testbeds.
TRACED is a biophysical model that quantifies cell size distribution, extracellular diffusivity, tortuosity, and cell density in human gliomas from time-dependent diffusion MRI data.
citing papers explorer
-
Rotation-Preserving Supervised Fine-Tuning
RPSFT improves the in-domain versus out-of-domain performance trade-off during LLM supervised fine-tuning by penalizing rotations in pretrained singular subspaces as a proxy for loss-sensitive directions.
-
Replacing Tunable Parameters in Weather and Climate Models with State-Dependent Functions using Reinforcement Learning
Reinforcement learning learns state-dependent parametrization components in idealized climate models that outperform static tuning across several testbeds.
-
TRACED: In vivo imaging of extracellular intrinsic diffusivity, tortuosity, cell size distribution and cell density in human glioma patients
TRACED is a biophysical model that quantifies cell size distribution, extracellular diffusivity, tortuosity, and cell density in human gliomas from time-dependent diffusion MRI data.