CodePori: Large-Scale System for Autonomous Software Development Using Multi-Agent Technology

Aakash Ahmad; Jussi Rasku; Kai-Kristian Kemell; Malik Abdul Sami; Mika Saari; Muhammad Waseem; Pekka Abrahamsson; Zeeshan Rasheed

arxiv: 2402.01411 · v3 · pith:WYRWS3IHnew · submitted 2024-02-02 · 💻 cs.SE

CodePori: Large-Scale System for Autonomous Software Development Using Multi-Agent Technology

Zeeshan Rasheed , Muhammad Waseem , Kai-Kristian Kemell , Aakash Ahmad , Malik Abdul Sami , Mika Saari , Jussi Rasku , Pekka Abrahamsson This is my paper

classification 💻 cs.SE

keywords multi-agentdevelopmentllm-basedsoftwaresystemsautonomouschallengescode

0 comments

read the original abstract

Context: LLM-based multi-agent systems enable automation and decision support in software development, yet existing studies rely on benchmark datasets offering only binary pass-or-fail results, limiting insight into real-world applicability. Objective: This study empirically investigates the potential and limitations of LLM-based agents in autonomous software development tasks. Method: A two-phase approach was employed: developing a multi-agent system, CodePori, for automated code generation, and conducting participant-based evaluation to assess practical performance. Results: Participant feedback reveals key strengths, challenges, and areas for improvement in LLM-based multi-agent systems, highlighting aspects missed by standard code-generation benchmarks. Conclusions: While LLM-based multi-agent systems show potential for large-scale software development, successful integration requires addressing challenges such as memory limitations, hallucinations, and code smells, alongside a practitioner-centric perspective.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 4 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Memory in the Age of AI Agents
cs.CL 2025-12 unverdicted novelty 6.0

The paper maps agent memory research via three forms (token-level, parametric, latent), three functions (factual, experiential, working), and dynamics of formation/evolution/retrieval, plus benchmarks and future directions.
Beyond Functional Correctness: Design Issues in AI IDE-Generated Large-Scale Projects
cs.SE 2026-04 conditional novelty 5.0

AI IDEs with structured guidance can produce functional large-scale code but frequently introduce design flaws such as duplication, complexity, and principle violations that risk long-term maintainability.
A Comprehensive Survey of Self-Evolving AI Agents: A New Paradigm Bridging Foundation Models and Lifelong Agentic Systems
cs.AI 2025-08 unverdicted novelty 5.0

A comprehensive review of self-evolving AI agents that improve themselves over time, organized via a framework of inputs, agent system, environment, and optimizers, with domain-specific and safety discussions.
Large Language Model-Based Agents for Software Engineering: A Survey
cs.SE 2024-09 unverdicted novelty 4.0

A literature survey that collects and categorizes 124 papers on LLM-based agents for software engineering from SE and agent perspectives.