pith. sign in

Agent-rlvr: Training software engineering agents via guidance and environment rewards

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

citation-role summary

background 2

citation-polarity summary

fields

cs.LG 2 cs.SE 2

years

2026 3 2025 1

roles

background 2

polarities

background 2

representative citing papers

SWE-Shepherd: Advancing PRMs for Reinforcing Code Agents

cs.SE · 2026-04-12 · unverdicted · novelty 5.0

SWE-Shepherd trains a lightweight PRM on SWE-Bench trajectories to score intermediate actions and guide code agents, showing gains in efficiency and action quality on SWE-Bench Verified.

citing papers explorer

Showing 4 of 4 citing papers.