Title resolution pending

On-Policy Distillation: An Effective · 2025 · DOI 10.64480/xwxw-9c67

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open at publisher browse 1 citing papers

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

LatentRevise: Learning from Zero-Hit Reasoning

cs.CL · 2026-06-29 · unverdicted · novelty 6.0

LatentRevise performs first-order optimization on reasoning prefix embeddings from failed rollouts to generate longer, self-reflective, correct trajectories that improve SFT and RLVR performance on math benchmarks.

citing papers explorer

Showing 1 of 1 citing paper after filters.

LatentRevise: Learning from Zero-Hit Reasoning cs.CL · 2026-06-29 · unverdicted · none · ref 22
LatentRevise performs first-order optimization on reasoning prefix embeddings from failed rollouts to generate longer, self-reflective, correct trajectories that improve SFT and RLVR performance on math benchmarks.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer