MASPO: Joint Prompt Optimization for LLM-based Multi-Agent Systems

· 2026 · cs.AI · arXiv 2605.06623

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

Large language model (LLM)-based Multi-agent systems (MAS) have shown promise in tackling complex collaborative tasks, where agents are typically orchestrated via role-specific prompts. While the quality of these prompts is pivotal, jointly optimizing them across interacting agents remains a non-trivial challenge, primarily due to the misalignment between local agent objectives and holistic system goals. To address this, we introduce MASPO, a novel framework designed to automatically and iteratively refine prompts across the entire system. A core innovation of MASPO is its joint evaluation mechanism, which assesses prompts not merely by their local validity, but by their capacity to facilitate downstream success for successor agents. This effectively bridges the gap between local interactions and global outcomes without relying on ground-truth labels. Furthermore, MASPO employs a data-driven evolutionary beam search to efficiently navigate the high-dimensional prompt space. Extensive empirical evaluations across 6 diverse tasks demonstrate that MASPO consistently outperforms state-of-the-art prompt optimization methods, achieving an average accuracy improvement of 2.9. We release our code at https://github.com/wangzx1219/MASPO.

representative citing papers

Do More Agents Help? Controlled and Protocol-Aligned Evaluation of LLM Agent Workflows

cs.AI · 2026-06-04 · conditional · novelty 5.0

Under controlled identical protocols, only one of six multi-agent LLM systems marginally exceeds a single-agent baseline on benchmark-balanced accuracy while the rest trail and cost more; a runtime workflow reaches 66.72% on GAIA.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Do More Agents Help? Controlled and Protocol-Aligned Evaluation of LLM Agent Workflows cs.AI · 2026-06-04 · conditional · none · ref 82 · internal anchor
Under controlled identical protocols, only one of six multi-agent LLM systems marginally exceeds a single-agent baseline on benchmark-balanced accuracy while the rest trail and cost more; a runtime workflow reaches 66.72% on GAIA.

MASPO: Joint Prompt Optimization for LLM-based Multi-Agent Systems

fields

years

verdicts

representative citing papers

citing papers explorer