← back to paper
arxiv: 2605.11299 · 2 revisions
Primal Generation, Dual Judgment: Self-Training from Test-Time Scaling