Agents in software engineering: survey, landscape, and vision,

· 2025 · DOI 10.1007/s10515-025-00544-2

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

open at publisher browse 2 citing papers

representative citing papers

Are Performance-Optimization Benchmarks Reliably Measuring Coding Agents?

cs.SE · 2026-07-01 · unverdicted · novelty 6.0

Audit of GSO, SWE-Perf and SWE-fficiency reveals that reference patches satisfy validity rules across machines for only 39/102, 11/140 and 411/498 tasks respectively, public submissions beat references on 85.3% of replay-valid tasks, and scoring rules cause ranking disagreements.

Human oversight of agentic systems in practice: Examining the oversight work, challenges, and heuristics of developers using software agents

cs.SE · 2026-06-03 · unverdicted · novelty 6.0

Exploratory interview study with 17 developers identifies four forms of emergent oversight work for software agents and documents situated challenges and heuristics.

citing papers explorer

Showing 2 of 2 citing papers after filters.

Are Performance-Optimization Benchmarks Reliably Measuring Coding Agents? cs.SE · 2026-07-01 · unverdicted · none · ref 18
Audit of GSO, SWE-Perf and SWE-fficiency reveals that reference patches satisfy validity rules across machines for only 39/102, 11/140 and 411/498 tasks respectively, public submissions beat references on 85.3% of replay-valid tasks, and scoring rules cause ranking disagreements.
Human oversight of agentic systems in practice: Examining the oversight work, challenges, and heuristics of developers using software agents cs.SE · 2026-06-03 · unverdicted · none · ref 142
Exploratory interview study with 17 developers identifies four forms of emergent oversight work for software agents and documents situated challenges and heuristics.

Agents in software engineering: survey, landscape, and vision,

fields

years

verdicts

representative citing papers

citing papers explorer