StackPlanner: A Centralized Hierarchical Multi-Agent System with Task-Experience Memory Management

Hongbin Lai; Huheng Huang; Jiaran Gao; Junfeng Zhao; Ruiqing Li; Ruizhe Zhang; Tao Feng; Xinke Jiang; Xu Chu; Yasha Wang

arxiv: 2601.05890 · v2 · pith:V4X67XTQnew · submitted 2026-01-09 · 💻 cs.AI

StackPlanner: A Centralized Hierarchical Multi-Agent System with Task-Experience Memory Management

Ruizhe Zhang , Xinke Jiang , Zhibang Yang , Zhixin Zhang , Jiaran Gao , Yuzhen Xiao , Tao Feng , Yue Fang

show 7 more authors

Yuxuan Liu Ruiqing Li Hongbin Lai Huheng Huang Xu Chu Junfeng Zhao Yasha Wang

This is my paper

classification 💻 cs.AI

keywords memorymulti-agentcoordinationexperiencestackplannercentralizedcollaborationcontrol

0 comments

read the original abstract

Multi-agent systems based on large language models, particularly centralized architectures, have recently shown strong potential for complex and knowledge-intensive tasks. However, central agents often suffer from unstable long-horizon collaboration due to the lack of memory management, leading to context bloat, error accumulation, and poor cross-task generalization. To address both task-level memory inefficiency and the inability to reuse coordination experience, we propose StackPlanner, a hierarchical multi-agent framework with explicit memory control. StackPlanner addresses these challenges by decoupling high-level coordination from subtask execution with active task-level memory control, and by learning to retrieve and exploit reusable coordination experience via structured experience memory and reinforcement learning. Experiments on multiple deep-search and agent system benchmarks demonstrate the effectiveness of our approach in enabling reliable long-horizon multi-agent collaboration.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 3 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Beyond Individual Intelligence: Surveying Collaboration, Failure Attribution, and Self-Evolution in LLM-based Multi-Agent Systems
cs.AI 2026-05 unverdicted novelty 7.0

A survey that unifies prior work on multi-agent LLM systems via the LIFE framework, mapping dependencies across collaboration, failure attribution, and autonomous self-evolution while identifying cross-stage challenges.
ScaffoldAgent: Utility-Guided Dynamic Outline Optimization for Open-Ended Deep Research
cs.AI 2026-06 unverdicted novelty 5.0

ScaffoldAgent improves long-form report generation by modeling outline evolution as expansion, contraction, and revision guided by a utility function estimating downstream value.
Beyond Individual Intelligence: Surveying Collaboration, Failure Attribution, and Self-Evolution in LLM-based Multi-Agent Systems
cs.AI 2026-05 conditional novelty 5.0

The survey proposes the LIFE framework to unify fragmented research on collaboration, failure attribution, and self-evolution in LLM multi-agent systems into a progression toward self-organizing intelligence.