Towards realistic project-level code generation via multi-agent collaboration and semantic architecture modeling

Qianhui Zhao, Li Zhang, Fang Liu, Junhang Cheng, Chengru Wu, Junchen Ai, Qiaoyuanhe Meng, Lichen Zhang, Xiaoli Lian, Shubin Song, Yuanping Guo · 2025 · arXiv 2511.03404

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

read on arXiv browse 5 citing papers

representative citing papers

Do Papers Tell the Whole Story? A Benchmark and Framework for Uncovering Hidden Implementation Gaps in Bioinformatics

cs.LG · 2026-03-23 · unverdicted · novelty 8.0

BioCon is the first benchmark dataset and cross-modal framework for detecting inconsistencies between methodological descriptions in bioinformatics papers and their code implementations.

Benchmarking Requirement-to-Architecture Generation with Hybrid Evaluation

cs.SE · 2026-04-08 · unverdicted · novelty 7.0

R2ABench benchmark shows LLMs generate syntactically valid software architectures from requirements but produce structurally fragmented results due to weak relational reasoning.

CodeTeam: An LLM-Powered Multi-Agent Framework for Repository-Level Code Generation

cs.SE · 2026-06-20 · unverdicted · novelty 6.0

CodeTeam is an LLM multi-agent system that improves SketchBLEU by 4.1/2.9 points and achieves top test pass rates (34.6% PE, 42.3% SFT) on repository-level code generation benchmarks via role-specialized planning and implementation stages.

When Parallelism Pays Off: Cohesion-Aware Task Partitioning for Multi-Agent Coding

cs.LG · 2026-05-31 · unverdicted · novelty 6.0

Co-Coder partitions code dependency graphs via community detection to orchestrate multi-agent LLM coding, improving pass rates up to 14%, wall-clock speedup up to 2.1x, and cutting API cost up to 35% on dependency-dense tasks.

Contract-Coding: Towards Repo-Level Generation via Structured Symbolic Paradigm

cs.SE · 2026-04-10 · unverdicted · novelty 5.0

Contract-Coding projects ambiguous intents into formal Language Contracts as a single source of truth to enable more reliable repo-level code generation, reporting 47% functional success on the Greenfield-5 benchmark.

citing papers explorer

Showing 5 of 5 citing papers after filters.

Do Papers Tell the Whole Story? A Benchmark and Framework for Uncovering Hidden Implementation Gaps in Bioinformatics cs.LG · 2026-03-23 · unverdicted · none · ref 16
BioCon is the first benchmark dataset and cross-modal framework for detecting inconsistencies between methodological descriptions in bioinformatics papers and their code implementations.
Benchmarking Requirement-to-Architecture Generation with Hybrid Evaluation cs.SE · 2026-04-08 · unverdicted · none · ref 33
R2ABench benchmark shows LLMs generate syntactically valid software architectures from requirements but produce structurally fragmented results due to weak relational reasoning.
CodeTeam: An LLM-Powered Multi-Agent Framework for Repository-Level Code Generation cs.SE · 2026-06-20 · unverdicted · none · ref 47
CodeTeam is an LLM multi-agent system that improves SketchBLEU by 4.1/2.9 points and achieves top test pass rates (34.6% PE, 42.3% SFT) on repository-level code generation benchmarks via role-specialized planning and implementation stages.
When Parallelism Pays Off: Cohesion-Aware Task Partitioning for Multi-Agent Coding cs.LG · 2026-05-31 · unverdicted · none · ref 34
Co-Coder partitions code dependency graphs via community detection to orchestrate multi-agent LLM coding, improving pass rates up to 14%, wall-clock speedup up to 2.1x, and cutting API cost up to 35% on dependency-dense tasks.
Contract-Coding: Towards Repo-Level Generation via Structured Symbolic Paradigm cs.SE · 2026-04-10 · unverdicted · none · ref 4
Contract-Coding projects ambiguous intents into formal Language Contracts as a single source of truth to enable more reliable repo-level code generation, reporting 47% functional success on the Greenfield-5 benchmark.

Towards realistic project-level code generation via multi-agent collaboration and semantic architecture modeling

fields

years

verdicts

representative citing papers

citing papers explorer