Swe-replay: Efficient test-time scaling for software engineering agents,

· 2026 · arXiv 2601.22129

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

read on arXiv browse 4 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

SWE-Doctor: Guiding Software Engineering Agents with Runtime Diagnosis from Multi-Faceted Bug Reproduction Tests

cs.SE · 2026-07-01 · unverdicted · novelty 7.0

SWE-Doctor generates multi-faceted BRTs, derives runtime diagnoses from their executions, and uses the diagnoses to guide patch generation, raising average resolution rates to 75.7% on SWE-bench Verified and 59.4% on SWE-bench Pro.

EvoRepair: Enhancing Vulnerability Repair Agents Through Experience-Based Self-Evolution

cs.SE · 2026-05-28 · unverdicted · novelty 7.0

EvoRepair is the first experience-based self-evolving agent framework for automated vulnerability repair, reporting 90.46% overall success on PATCHEVAL and SEC-bench benchmarks.

FastContext: Training Efficient Repository Explorer for Coding Agents

cs.SE · 2026-06-12 · unverdicted · novelty 5.0

FastContext adds a dedicated exploration subagent with specialized models trained on reference trajectories and task rewards, cutting token consumption up to 60% and lifting resolution rates up to 5.5% on SWE-bench variants.

From Question Answering to Task Completion: A Survey on Agent System and Harness Design

cs.AI · 2026-06-14 · unverdicted · novelty 4.0

Survey framing LLM agents as model-plus-harness systems, decomposing harness responsibilities, mapping them to tasks, and highlighting open challenges in evaluation, safety, and co-evolution.

citing papers explorer

Showing 4 of 4 citing papers.

SWE-Doctor: Guiding Software Engineering Agents with Runtime Diagnosis from Multi-Faceted Bug Reproduction Tests cs.SE · 2026-07-01 · unverdicted · none · ref 17
SWE-Doctor generates multi-faceted BRTs, derives runtime diagnoses from their executions, and uses the diagnoses to guide patch generation, raising average resolution rates to 75.7% on SWE-bench Verified and 59.4% on SWE-bench Pro.
EvoRepair: Enhancing Vulnerability Repair Agents Through Experience-Based Self-Evolution cs.SE · 2026-05-28 · unverdicted · none · ref 18
EvoRepair is the first experience-based self-evolving agent framework for automated vulnerability repair, reporting 90.46% overall success on PATCHEVAL and SEC-bench benchmarks.
FastContext: Training Efficient Repository Explorer for Coding Agents cs.SE · 2026-06-12 · unverdicted · none · ref 1
FastContext adds a dedicated exploration subagent with specialized models trained on reference trajectories and task rewards, cutting token consumption up to 60% and lifting resolution rates up to 5.5% on SWE-bench variants.
From Question Answering to Task Completion: A Survey on Agent System and Harness Design cs.AI · 2026-06-14 · unverdicted · none · ref 184
Survey framing LLM agents as model-plus-harness systems, decomposing harness responsibilities, mapping them to tasks, and highlighting open challenges in evaluation, safety, and co-evolution.

Swe-replay: Efficient test-time scaling for software engineering agents,

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer