Instructions trigger a production-centered mechanism in language models, with task-specific information stable in input tokens but varying strongly in output tokens and correlating with behavior.
Zhao and Kelvin Guu and Adams Wei Yu and Brian Lester and Nan Du and Andrew M
3 Pith papers cite this work. Polarity classification is still indexing.
fields
cs.CL 3years
2026 3representative citing papers
SAMoRA is a parameter-efficient fine-tuning framework that uses semantic-aware routing and task-adaptive scaling within a Mixture of LoRA Experts to improve multi-task performance and generalization over prior methods.
CADFT improves supervised fine-tuning of large language models by dynamically down-weighting training samples whose low model-likelihood indicates high gradient variance, yielding better stability and generalization.
citing papers explorer
-
Instructions Shape Production of Language, not Processing
Instructions trigger a production-centered mechanism in language models, with task-specific information stable in input tokens but varying strongly in output tokens and correlating with behavior.
-
SAMoRA: Semantic-Aware Mixture of LoRA Experts for Task-Adaptive Learning
SAMoRA is a parameter-efficient fine-tuning framework that uses semantic-aware routing and task-adaptive scaling within a Mixture of LoRA Experts to improve multi-task performance and generalization over prior methods.
-
Compatibility-Aware Dynamic Fine-Tuning for Large Language Models
CADFT improves supervised fine-tuning of large language models by dynamically down-weighting training samples whose low model-likelihood indicates high gradient variance, yielding better stability and generalization.