Communications of the ACM , volume=

Symbolic execution for software testing: three decades later , author= · 2013

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

representative citing papers

SWE-Mutation: Can LLMs Generate Reliable Test Suites in Software Engineering?

cs.SE · 2026-05-21 · unverdicted · novelty 6.0

SWE-Mutation benchmark shows current LLMs achieve low verification (10.20%) and detection (36.15%) rates on 2,636 mutated variants, exposing weaknesses in generating reliable test suites.

Learning Correct Behavior from Examples: Validating Sequential Execution in Autonomous Agents

cs.AI · 2026-05-04 · unverdicted · novelty 5.0

A new algorithm learns correct agent behavior models from few traces by combining dominator analysis, LLMs, and automata to validate sequential executions with high accuracy.

citing papers explorer

Showing 2 of 2 citing papers.

SWE-Mutation: Can LLMs Generate Reliable Test Suites in Software Engineering? cs.SE · 2026-05-21 · unverdicted · none · ref 5
SWE-Mutation benchmark shows current LLMs achieve low verification (10.20%) and detection (36.15%) rates on 2,636 mutated variants, exposing weaknesses in generating reliable test suites.
Learning Correct Behavior from Examples: Validating Sequential Execution in Autonomous Agents cs.AI · 2026-05-04 · unverdicted · none · ref 11
A new algorithm learns correct agent behavior models from few traces by combining dominator analysis, LLMs, and automata to validate sequential executions with high accuracy.

Communications of the ACM , volume=

fields

years

verdicts

representative citing papers

citing papers explorer