CaMOPD recovers general capabilities in domain-specialized LLMs via alternating training and gap-based sample selection in multi-teacher on-policy distillation while preserving domain behavior.
We use the GBaker/MedQA-USMLE-4-options test split, which contains 1,273 examples in the local cache
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.AI 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Counteraction-Aware Multi-Teacher On-Policy Distillation for General Capability Recovery with Domain Preservation
CaMOPD recovers general capabilities in domain-specialized LLMs via alternating training and gap-based sample selection in multi-teacher on-policy distillation while preserving domain behavior.