This benchmark is designed to evaluate the live code generation capabilities of large language models, emphasizing immediate correctness and practical programming skills

LiveCodeBench(Jain et al · 2025

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

cs.CL · 2026-02-10 · unverdicted · novelty 6.0

ATTNPO guides process-supervised RL with intrinsic attention signals to shorten reasoning traces while raising accuracy on nine benchmarks.

Showing 1 of 1 citing paper.

ATTNPO: Attention-Guided Process Supervision for Efficient Reasoning cs.CL · 2026-02-10 · unverdicted · none · ref 14
ATTNPO guides process-supervised RL with intrinsic attention signals to shorten reasoning traces while raising accuracy on nine benchmarks.