ATTNPO guides process-supervised RL with intrinsic attention signals to shorten reasoning traces while raising accuracy on nine benchmarks.
This benchmark is designed to evaluate the live code generation capabilities of large language models, emphasizing immediate correctness and practical programming skills
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CL 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
ATTNPO: Attention-Guided Process Supervision for Efficient Reasoning
ATTNPO guides process-supervised RL with intrinsic attention signals to shorten reasoning traces while raising accuracy on nine benchmarks.