arXiv preprint arXiv:2601.09822 , year =

Yongjian Tang, Thomas Runkler · 2026 · arXiv 2601.09822

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

read on arXiv browse 4 citing papers

citation-role summary

background 2

citation-polarity summary

background 2

representative citing papers

Beyond Individual Intelligence: Surveying Collaboration, Failure Attribution, and Self-Evolution in LLM-based Multi-Agent Systems

cs.AI · 2026-05-14 · unverdicted · novelty 7.0 · 2 refs

A survey that unifies prior work on multi-agent LLM systems via the LIFE framework, mapping dependencies across collaboration, failure attribution, and autonomous self-evolution while identifying cross-stage challenges.

PerfCodeBench: Benchmarking LLMs for System-Level High-Performance Code Optimization

cs.SE · 2026-05-13 · unverdicted · novelty 7.0

PerfCodeBench reveals that state-of-the-art LLMs produce functionally correct but significantly slower code than expert-optimized versions on system-level tasks, especially those involving parallelism and GPUs.

PromptMN: Pseudo Prompting Language

cs.CL · 2026-06-15 · unverdicted · novelty 4.0

PromptMN is a pseudo-prompting DSL that adds compact typed directives to natural language to improve clarity, reusability, and reverse engineering of AI instructions.

Code Broker: A Multi-Agent System for Automated Code Quality Assessment

cs.SE · 2026-04-25 · unverdicted · novelty 3.0

Code Broker deploys a five-agent hierarchy that combines LLM semantic analysis with static linting to generate actionable Python code quality reports.

citing papers explorer

Showing 4 of 4 citing papers after filters.

Beyond Individual Intelligence: Surveying Collaboration, Failure Attribution, and Self-Evolution in LLM-based Multi-Agent Systems cs.AI · 2026-05-14 · unverdicted · none · ref 236 · 2 links
A survey that unifies prior work on multi-agent LLM systems via the LIFE framework, mapping dependencies across collaboration, failure attribution, and autonomous self-evolution while identifying cross-stage challenges.
PerfCodeBench: Benchmarking LLMs for System-Level High-Performance Code Optimization cs.SE · 2026-05-13 · unverdicted · none · ref 41
PerfCodeBench reveals that state-of-the-art LLMs produce functionally correct but significantly slower code than expert-optimized versions on system-level tasks, especially those involving parallelism and GPUs.
PromptMN: Pseudo Prompting Language cs.CL · 2026-06-15 · unverdicted · none · ref 17
PromptMN is a pseudo-prompting DSL that adds compact typed directives to natural language to improve clarity, reusability, and reverse engineering of AI instructions.
Code Broker: A Multi-Agent System for Automated Code Quality Assessment cs.SE · 2026-04-25 · unverdicted · none · ref 13
Code Broker deploys a five-agent hierarchy that combines LLM semantic analysis with static linting to generate actionable Python code quality reports.

arXiv preprint arXiv:2601.09822 , year =

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer