← back to paper
arxiv: 2605.03677 · 2 revisions
Uni-OPD: Unifying On-Policy Distillation with a Dual-Perspective Recipe