arXiv preprint arXiv:2512.21919 , year=

SWE-RM: Execution-free Feedback For Software Engineering Agents , author= · arXiv 2512.21919

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

AgentLens: Revealing The Lucky Pass Problem in SWE-Agent Evaluation

cs.SE · 2026-05-13 · conditional · novelty 7.0

10.7% of passing SWE-agent trajectories are Lucky Passes with chaotic behaviors, and a quality score based on process references changes model rankings across eight backends.

citing papers explorer

Showing 1 of 1 citing paper.

AgentLens: Revealing The Lucky Pass Problem in SWE-Agent Evaluation cs.SE · 2026-05-13 · conditional · none · ref 30
10.7% of passing SWE-agent trajectories are Lucky Passes with chaotic behaviors, and a quality score based on process references changes model rankings across eight backends.

arXiv preprint arXiv:2512.21919 , year=

fields

years

verdicts

representative citing papers

citing papers explorer