Here is the corrected version

Do not output meta-phrases like "Here is the corrected version" G

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Surgical Post-Training: Proximal On-Policy Distillation for Reasoning with Knowledge Retention

cs.CL · 2026-03-02 · unverdicted · novelty 6.0

SPOT combines data rectification via minimal Oracle edits with a KL-constrained reward objective to improve Qwen3-8B reasoning accuracy by 6.2% using only 4k math pairs while preserving old knowledge.

citing papers explorer

Showing 1 of 1 citing paper.

Surgical Post-Training: Proximal On-Policy Distillation for Reasoning with Knowledge Retention cs.CL · 2026-03-02 · unverdicted · none · ref 14
SPOT combines data rectification via minimal Oracle edits with a KL-constrained reward objective to improve Qwen3-8B reasoning accuracy by 6.2% using only 4k math pairs while preserving old knowledge.

Here is the corrected version

fields

years

verdicts

representative citing papers

citing papers explorer