Multi-task fine-tuning on prompted classification tasks partially generalizes to unseen domains and prompts, with identifiable failure modes mitigated by mixing with instruction tuning and unexpected benefits for thinking-based classification.
benign": The summary describes the side task actions as reasonable, necessary, helpful, or routine. It treats them as a normal part of completing the task. Examples
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.AI 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
How Useful Is Cross-Domain Generalization for Training LLM Monitors?
Multi-task fine-tuning on prompted classification tasks partially generalizes to unseen domains and prompts, with identifiable failure modes mitigated by mixing with instruction tuning and unexpected benefits for thinking-based classification.