EMNLP 2025 (Oral)

Seongryong Jung, Suwan Yoon, DongGeon Kim, Hwanhee Lee · 2025 · arXiv 2505.16297

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

ADWIN: Adaptive Windows for Horizon-Aware On-Policy Distillation

cs.LG · 2026-05-27 · unverdicted · novelty 6.0

ADWIN adaptively selects training horizons in on-policy distillation via prefix alignment checks, cutting end-to-end cost by up to 4.1x while matching or exceeding full-rollout accuracy on math and code benchmarks.

When Are Teacher Tokens Reliable? Position-Weighted On-Policy Self-Distillation for Reasoning

cs.LG · 2026-05-20 · unverdicted · novelty 6.0

Position-Weighted On-Policy Self-Distillation (PW-OPSD) weights later tokens more heavily after a diagnostic shows position predicts teacher reliability better than entropy, yielding +1.0 and +1.1 Avg@12 gains on AIME 2024/2025.

citing papers explorer

Showing 2 of 2 citing papers after filters.

ADWIN: Adaptive Windows for Horizon-Aware On-Policy Distillation cs.LG · 2026-05-27 · unverdicted · none · ref 18
ADWIN adaptively selects training horizons in on-policy distillation via prefix alignment checks, cutting end-to-end cost by up to 4.1x while matching or exceeding full-rollout accuracy on math and code benchmarks.
When Are Teacher Tokens Reliable? Position-Weighted On-Policy Self-Distillation for Reasoning cs.LG · 2026-05-20 · unverdicted · none · ref 18
Position-Weighted On-Policy Self-Distillation (PW-OPSD) weights later tokens more heavily after a diagnostic shows position predicts teacher reliability better than entropy, yielding +1.0 and +1.1 Avg@12 gains on AIME 2024/2025.

EMNLP 2025 (Oral)

fields

years

verdicts

representative citing papers

citing papers explorer