Teaching-assistant-in-the-loop: Improving knowledge distillation from imperfect teacher models in low-budget scenarios.arXiv preprint arXiv:2406.05322,

Yuhang Zhou, Wei Ai · arXiv 2406.05322

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

OmniOPD: Logit-Free On-Policy Distillation via Speculative Verification

cs.LG · 2026-05-31 · unverdicted · novelty 7.0

OmniOPD replaces token-level logit matching in on-policy distillation with Monte Carlo chunk-level semantic verification and a peak-entropy scheduler.

citing papers explorer

Showing 1 of 1 citing paper.

OmniOPD: Logit-Free On-Policy Distillation via Speculative Verification cs.LG · 2026-05-31 · unverdicted · none · ref 37
OmniOPD replaces token-level logit matching in on-policy distillation with Monte Carlo chunk-level semantic verification and a peak-entropy scheduler.

Teaching-assistant-in-the-loop: Improving knowledge distillation from imperfect teacher models in low-budget scenarios.arXiv preprint arXiv:2406.05322,

fields

years

verdicts

representative citing papers

citing papers explorer