pith. sign in

← back to paper

Review history

arxiv: 2605.18141 · 2 revisions

A Brief Overview: On-Policy Self-Distillation In Large Language Models

  1. 2026-05-22 UNVERDICTED LOW v0.9.0 novelty 2.0
    42495 ms 5746 in 1116 out 2026-05-22T09:48:09.204199+00:00
  2. 2026-05-20 UNVERDICTED LOW v0.9.0 novelty 2.0
    49447 ms 5746 in 1192 out 2026-05-20T09:01:37.308250+00:00