hub Canonical reference

Llm-based multi-agent systems for software engineering: Literature review, vision, and the road ahead.ACM Trans.Softw

Junda He, Christoph Treude, David Lo · 2025 · ACM Transactions on Software Engineering and Methodology · DOI 10.1145/3712003

Canonical reference. 100% of citing Pith papers cite this work as background.

20 Pith papers citing it

108 external citations · Crossref

Background 100% of classified citations

open at publisher browse 20 citing papers

hub tools

JSON dossier citing papers JSON publisher DOI

citation-role summary

background 5

citation-polarity summary

background 5

representative citing papers

Correct Code, Vulnerable Dependencies: A Large Scale Measurement Study of LLM-Specified Library Versions

cs.SE · 2026-05-07 · conditional · novelty 8.0

LLMs frequently specify library versions with known CVEs in generated code (36-56% of tasks), show low compatibility (20-63%), and converge on the same risky versions across models.

The Rise of AI Teammates in Software Engineering (SE) 3.0: How Autonomous Coding Agents Are Reshaping Software Engineering

cs.SE · 2025-07-20 · conditional · novelty 8.0

AIDev is a new open dataset of 456k AI-agent pull requests showing agents submit code faster than humans but with lower acceptance rates and simpler changes.

From Prompt to Process: a Process Taxonomy and Comparative Assessment of Frameworks Supporting AI Software Development Agents

cs.SE · 2026-06-03 · conditional · novelty 7.0

A new six-dimension process taxonomy for AI software development frameworks shows convergence on artifact persistence and human oversight but reveals that no framework covers all dimensions strongly, indicating a depth-portability trade-off.

Memory-Augmented LLM-based Multi-Agent System for Automated Feature Generation on Tabular Data

cs.AI · 2026-04-22 · unverdicted · novelty 7.0

MALMAS is a memory-augmented multi-agent LLM system that generates diverse, high-quality features for tabular data via agent decomposition, routing, and iterative memory-guided refinement.

FLARE: Agentic Coverage-Guided Fuzzing for LLM-Based Multi-Agent Systems

cs.SE · 2026-04-07 · unverdicted · novelty 7.0

FLARE extracts specifications from multi-agent LLM code and applies coverage-guided fuzzing to achieve 96.9% inter-agent and 91.1% intra-agent coverage while uncovering 56 new failures across 16 applications.

Vision-as-Inverse-Graphics Agent via Interleaved Multimodal Reasoning

cs.CV · 2026-01-16 · conditional · novelty 7.0

VIGA introduces a training-free interleaved multimodal reasoning loop that improves vision-as-inverse-graphics accuracy over one-shot baselines on BlenderGym, SlideBench, and new BlenderBench.

POIROT: Interrogating Agents for Failure Detection in Multi-Agent Systems

cs.AI · 2026-06-01 · unverdicted · novelty 6.0

POIROT protocol repurposes agents in LLM multi-agent systems as an internal diagnostic layer for failure detection, outperforming single-LLM evaluators with gains that increase with complexity, agent count, and fault types.

Three Heads Are Better Than One: A Multi-perspective Reasoning Framework for Enhanced Vulnerability Detection

cs.SE · 2026-05-18 · conditional · novelty 6.0

ReasonVul deploys three LLM agents with independent analysis and structured debate to achieve 40% PairAcc and 72.52% F1 on PrimeVul, outperforming baselines by 81% in PairAcc.

SelfHeal: Empirical Fix Pattern Analysis and Bug Repair in LLM Agents

cs.SE · 2026-04-20 · unverdicted · novelty 6.0

SelfHeal uses two ReAct agents and empirical fix patterns to repair bugs in LLM agents, outperforming baselines on a new 37-instance benchmark.

Agentic Business Process Management: A Research Manifesto

cs.AI · 2026-03-19 · unverdicted · novelty 6.0

Agentic Business Process Management reframes BPM around autonomous agents that must exhibit framed autonomy, explainability, conversational actionability, and self-modification to keep their actions aligned with organizational objectives.

Dynamic Coordination Strategy Selection for Enterprise Multi-Agent Systems

cs.MA · 2026-05-30 · unverdicted · novelty 5.0

Large-scale experiment with 1440 task executions finds dynamic routing of coordination strategies achieves near-best quality scores across models and classes but does not reliably identify exact winners.

RocketSmith: Agentic Additive Manufacturing of High-Powered Rockets

cs.RO · 2026-05-25 · unverdicted · novelty 5.0 · 2 refs

RocketSmith is an LLM-based agentic system that designs four high-powered rockets via additive manufacturing, with two achieving stable launches and recovery after reaching 80% of simulated apogee.

Code2UML: Agentic LLMs with context engineering for scalable software visualization

cs.SE · 2026-05-23 · unverdicted · novelty 5.0

Agentic architecture with context engineering enables scalable UML diagram generation from source code across multiple languages and diagram types.

What Do Agents Communicate? Characterizing Information Exchange in Multi-Agent Systems

cs.MA · 2026-05-19 · unverdicted · novelty 5.0

Systematic study of inter-agent communication in LLM multi-agent systems shows reasoning and verification are critical for performance, with a new augmentation technique recovering 86.2% of failures.

Accountable Agents in Software Engineering: An Analysis of Terms of Service and a Research Roadmap

cs.SE · 2026-05-06 · unverdicted · novelty 5.0

Comparative review of AI coding tool ToS shows responsibility for code quality and compliance shifted to users, with policy misalignment for autonomous agents, plus a research roadmap.

CodeWiki: Evaluating AI's Ability to Generate Holistic Documentation for Large-Scale Codebases

cs.SE · 2025-10-28 · unverdicted · novelty 5.0

CodeWiki presents a unified framework for repository-level documentation across seven languages using hierarchical decomposition, recursive multi-agent processing, and multi-modal synthesis, outperforming DeepWiki by 4.73% on CodeWikiBench.

Automated Summarization of Software Documents: An LLM-based Multi-Agent Approach

cs.SE · 2026-06-23 · unverdicted · novelty 4.0

Metagente is an LLM multi-agent system using Teacher-Student collaboration that outperforms baselines on real-world software documentation summarization for requirements analysis and technical docs.

Recommendations for Efficient and Responsible LLM Adoption within Industrial Software Development

cs.SE · 2026-04-29 · conditional · novelty 4.0

A multi-case study plus survey produces seven actionable recommendations for efficient and responsible LLM use in industrial software engineering.

Compact Constraint Encoding for LLM Code Generation: An Empirical Study of Token Economics and Constraint Compliance

cs.SE · 2026-04-08 · conditional · novelty 4.0

Compact constraint headers reduce prompt tokens by 25-30% with no significant change in constraint compliance rates across tested models and tasks.

Fairness in Multi-Agent Systems for Software Engineering: An SDLC-Oriented Rapid Review

cs.SE · 2026-04-10 · unverdicted · novelty 2.0

A rapid review of fairness in LLM-enabled multi-agent systems for the software development lifecycle concludes that the field lacks standardized evaluations, broad coverage, and effective governance, leaving it unprepared for deployable fair systems.

citing papers explorer

Showing 0 of 0 citing papers after filters.

No citing papers match the current filters.

Llm-based multi-agent systems for software engineering: Literature review, vision, and the road ahead.ACM Trans.Softw

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer