Canonical reference

Llm-based multi-agent systems for software engineering: Literature review, vision, and the road ahead.ACM Trans.Softw

Junda He, Christoph Treude, David Lo · 2025 · ACM Transactions on Software Engineering and Methodology · DOI 10.1145/3712003

Canonical reference. 100% of citing Pith papers cite this work as background.

14 Pith papers citing it

108 external citations · Crossref

Background 100% of classified citations

open at publisher browse 14 citing papers

citation-role summary

background 5

citation-polarity summary

background 5

representative citing papers

Correct Code, Vulnerable Dependencies: A Large Scale Measurement Study of LLM-Specified Library Versions

cs.SE · 2026-05-07 · conditional · novelty 8.0

LLMs frequently specify library versions with known CVEs in generated code (36-56% of tasks), show low compatibility (20-63%), and converge on the same risky versions across models.

The Rise of AI Teammates in Software Engineering (SE) 3.0: How Autonomous Coding Agents Are Reshaping Software Engineering

cs.SE · 2025-07-20 · conditional · novelty 8.0

AIDev is a new open dataset of 456k AI-agent pull requests showing agents submit code faster than humans but with lower acceptance rates and simpler changes.

Memory-Augmented LLM-based Multi-Agent System for Automated Feature Generation on Tabular Data

cs.AI · 2026-04-22 · unverdicted · novelty 7.0

MALMAS is a memory-augmented multi-agent LLM system that generates diverse, high-quality features for tabular data via agent decomposition, routing, and iterative memory-guided refinement.

FLARE: Agentic Coverage-Guided Fuzzing for LLM-Based Multi-Agent Systems

cs.SE · 2026-04-07 · unverdicted · novelty 7.0

FLARE extracts specifications from multi-agent LLM code and applies coverage-guided fuzzing to achieve 96.9% inter-agent and 91.1% intra-agent coverage while uncovering 56 new failures across 16 applications.

Vision-as-Inverse-Graphics Agent via Interleaved Multimodal Reasoning

cs.CV · 2026-01-16 · conditional · novelty 7.0

VIGA introduces a training-free interleaved multimodal reasoning loop that improves vision-as-inverse-graphics accuracy over one-shot baselines on BlenderGym, SlideBench, and new BlenderBench.

Three Heads Are Better Than One: A Multi-perspective Reasoning Framework for Enhanced Vulnerability Detection

cs.SE · 2026-05-18 · conditional · novelty 6.0

ReasonVul deploys three LLM agents with independent analysis and structured debate to achieve 40% PairAcc and 72.52% F1 on PrimeVul, outperforming baselines by 81% in PairAcc.

SelfHeal: Empirical Fix Pattern Analysis and Bug Repair in LLM Agents

cs.SE · 2026-04-20 · unverdicted · novelty 6.0

SelfHeal uses two ReAct agents and empirical fix patterns to repair bugs in LLM agents, outperforming baselines on a new 37-instance benchmark.

Agentic Business Process Management: A Research Manifesto

cs.AI · 2026-03-19 · unverdicted · novelty 6.0

Agentic Business Process Management reframes BPM around autonomous agents that must exhibit framed autonomy, explainability, conversational actionability, and self-modification to keep their actions aligned with organizational objectives.

What Do Agents Communicate? Characterizing Information Exchange in Multi-Agent Systems

cs.MA · 2026-05-19 · unverdicted · novelty 5.0

Systematic study of inter-agent communication in LLM multi-agent systems shows reasoning and verification are critical for performance, with a new augmentation technique recovering 86.2% of failures.

Accountable Agents in Software Engineering: An Analysis of Terms of Service and a Research Roadmap

cs.SE · 2026-05-06 · unverdicted · novelty 5.0

Comparative review of AI coding tool ToS shows responsibility for code quality and compliance shifted to users, with policy misalignment for autonomous agents, plus a research roadmap.

CodeWiki: Evaluating AI's Ability to Generate Holistic Documentation for Large-Scale Codebases

cs.SE · 2025-10-28 · unverdicted · novelty 5.0

CodeWiki presents a unified framework for repository-level documentation across seven languages using hierarchical decomposition, recursive multi-agent processing, and multi-modal synthesis, outperforming DeepWiki by 4.73% on CodeWikiBench.

Recommendations for Efficient and Responsible LLM Adoption within Industrial Software Development

cs.SE · 2026-04-29 · conditional · novelty 4.0

A multi-case study plus survey produces seven actionable recommendations for efficient and responsible LLM use in industrial software engineering.

Compact Constraint Encoding for LLM Code Generation: An Empirical Study of Token Economics and Constraint Compliance

cs.SE · 2026-04-08 · conditional · novelty 4.0

Compact constraint headers reduce prompt tokens by 25-30% with no significant change in constraint compliance rates across tested models and tasks.

Fairness in Multi-Agent Systems for Software Engineering: An SDLC-Oriented Rapid Review

cs.SE · 2026-04-10 · unverdicted · novelty 2.0

A rapid review of fairness in LLM-enabled multi-agent systems for the software development lifecycle concludes that the field lacks standardized evaluations, broad coverage, and effective governance, leaving it unprepared for deployable fair systems.

citing papers explorer

Showing 14 of 14 citing papers.

Correct Code, Vulnerable Dependencies: A Large Scale Measurement Study of LLM-Specified Library Versions cs.SE · 2026-05-07 · conditional · none · ref 16
LLMs frequently specify library versions with known CVEs in generated code (36-56% of tasks), show low compatibility (20-63%), and converge on the same risky versions across models.
The Rise of AI Teammates in Software Engineering (SE) 3.0: How Autonomous Coding Agents Are Reshaping Software Engineering cs.SE · 2025-07-20 · conditional · none · ref 20
AIDev is a new open dataset of 456k AI-agent pull requests showing agents submit code faster than humans but with lower acceptance rates and simpler changes.
Memory-Augmented LLM-based Multi-Agent System for Automated Feature Generation on Tabular Data cs.AI · 2026-04-22 · unverdicted · none · ref 33
MALMAS is a memory-augmented multi-agent LLM system that generates diverse, high-quality features for tabular data via agent decomposition, routing, and iterative memory-guided refinement.
FLARE: Agentic Coverage-Guided Fuzzing for LLM-Based Multi-Agent Systems cs.SE · 2026-04-07 · unverdicted · none · ref 15
FLARE extracts specifications from multi-agent LLM code and applies coverage-guided fuzzing to achieve 96.9% inter-agent and 91.1% intra-agent coverage while uncovering 56 new failures across 16 applications.
Vision-as-Inverse-Graphics Agent via Interleaved Multimodal Reasoning cs.CV · 2026-01-16 · conditional · none · ref 20
VIGA introduces a training-free interleaved multimodal reasoning loop that improves vision-as-inverse-graphics accuracy over one-shot baselines on BlenderGym, SlideBench, and new BlenderBench.
Three Heads Are Better Than One: A Multi-perspective Reasoning Framework for Enhanced Vulnerability Detection cs.SE · 2026-05-18 · conditional · none · ref 18
ReasonVul deploys three LLM agents with independent analysis and structured debate to achieve 40% PairAcc and 72.52% F1 on PrimeVul, outperforming baselines by 81% in PairAcc.
SelfHeal: Empirical Fix Pattern Analysis and Bug Repair in LLM Agents cs.SE · 2026-04-20 · unverdicted · none · ref 33
SelfHeal uses two ReAct agents and empirical fix patterns to repair bugs in LLM agents, outperforming baselines on a new 37-instance benchmark.
Agentic Business Process Management: A Research Manifesto cs.AI · 2026-03-19 · unverdicted · none · ref 3
Agentic Business Process Management reframes BPM around autonomous agents that must exhibit framed autonomy, explainability, conversational actionability, and self-modification to keep their actions aligned with organizational objectives.
What Do Agents Communicate? Characterizing Information Exchange in Multi-Agent Systems cs.MA · 2026-05-19 · unverdicted · none · ref 25
Systematic study of inter-agent communication in LLM multi-agent systems shows reasoning and verification are critical for performance, with a new augmentation technique recovering 86.2% of failures.
Accountable Agents in Software Engineering: An Analysis of Terms of Service and a Research Roadmap cs.SE · 2026-05-06 · unverdicted · none · ref 18
Comparative review of AI coding tool ToS shows responsibility for code quality and compliance shifted to users, with policy misalignment for autonomous agents, plus a research roadmap.
CodeWiki: Evaluating AI's Ability to Generate Holistic Documentation for Large-Scale Codebases cs.SE · 2025-10-28 · unverdicted · none · ref 17
CodeWiki presents a unified framework for repository-level documentation across seven languages using hierarchical decomposition, recursive multi-agent processing, and multi-modal synthesis, outperforming DeepWiki by 4.73% on CodeWikiBench.
Recommendations for Efficient and Responsible LLM Adoption within Industrial Software Development cs.SE · 2026-04-29 · conditional · none · ref 16
A multi-case study plus survey produces seven actionable recommendations for efficient and responsible LLM use in industrial software engineering.
Compact Constraint Encoding for LLM Code Generation: An Empirical Study of Token Economics and Constraint Compliance cs.SE · 2026-04-08 · conditional · none · ref 2
Compact constraint headers reduce prompt tokens by 25-30% with no significant change in constraint compliance rates across tested models and tasks.
Fairness in Multi-Agent Systems for Software Engineering: An SDLC-Oriented Rapid Review cs.SE · 2026-04-10 · unverdicted · none · ref 23
A rapid review of fairness in LLM-enabled multi-agent systems for the software development lifecycle concludes that the field lacks standardized evaluations, broad coverage, and effective governance, leaving it unprepared for deployable fair systems.

Llm-based multi-agent systems for software engineering: Literature review, vision, and the road ahead.ACM Trans.Softw

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer