In this section, you must clearly list each degradation method you used, and for each one, pinpoint exactly how, where, and why you altered the original reasoning

Generate Explanation: Create a concise ‘[Explanation of Degradation]’

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

ReCode: Reinforcing Code Generation with Reasoning-Process Rewards

cs.SE · 2025-08-07 · unverdicted · novelty 6.0

ReCode is a new RL framework combining contrastive reasoning-process reward learning with consistency-gated GRPO to improve code generation, yielding a 16.1% gain for a 7B model to match GPT-4-Turbo levels on benchmarks.

citing papers explorer

Showing 1 of 1 citing paper.

ReCode: Reinforcing Code Generation with Reasoning-Process Rewards cs.SE · 2025-08-07 · unverdicted · none · ref 15
ReCode is a new RL framework combining contrastive reasoning-process reward learning with consistency-gated GRPO to improve code generation, yielding a 16.1% gain for a 7B model to match GPT-4-Turbo levels on benchmarks.

In this section, you must clearly list each degradation method you used, and for each one, pinpoint exactly how, where, and why you altered the original reasoning

fields

years

verdicts

representative citing papers

citing papers explorer