pith. sign in

arxiv: 2605.26872 · v1 · pith:Q5YJVZOYnew · submitted 2026-05-26 · 💻 cs.LG · cs.AI· cs.CL

The Strongest Teacher Is Not Always the Best Teacher: Student-Centric Answer Selection

classification 💻 cs.LG cs.AIcs.CL
keywords teacherstudentanswerstudent-centricsupervisiontraininganswersbest
0
0 comments X
read the original abstract

LLM training increasingly relies on teacher-generated supervision, from synthetic responses to reasoning traces and tool-use demonstrations. Current practice often chooses the highest-performing teacher to generate student training data, implicitly treating teacher test performance as a proxy for teaching quality. We show that this assumption can fail: even when multiple teachers provide correct answers to the same question, the answer from the strongest teacher is not necessarily the best supervision for a given student. To address this gap, we propose Student-Centric Answer Sampling (SCAS), a framework that selects from verified teacher-generated answers according to their estimated student-centric learning cost. Motivated by a token-wise gradient decomposition, we derive an efficient forward-only proxy for this cost and use it to guide answer selection during training. Experiments across 30 teacher models, 6 student base models, and 8 tasks show that SCAS consistently improves student performance, suggesting that effective distillation should prioritize supervision matched to the current student rather than teacher strength alone.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.