Topology Matters: Measuring Memory Leakage in Multi-Agent LLMs

Defu Cao; Jinbo Liu; Tianyao Su; Xiyang Hu; Yan Liu; Yifei Wei; Yuan Liang; Yue Zhao; Yushun Dong

arxiv: 2512.04668 · v4 · pith:PBZUF7P2new · submitted 2025-12-04 · 💻 cs.CR · cs.AI· cs.CL

Topology Matters: Measuring Memory Leakage in Multi-Agent LLMs

Jinbo Liu , Defu Cao , Yifei Wei , Tianyao Su , Yuan Liang , Yushun Dong , Yan Liu , Yue Zhao

show 1 more author

Xiyang Hu

This is my paper

classification 💻 cs.CR cs.AIcs.CL

keywords leakagememorymulti-agentattacker-targetattackerconnectivityextractioninformation

0 comments

read the original abstract

Graph topology is a fundamental determinant of memory leakage in multi-agent LLM systems, yet its effects remain poorly quantified. We introduce MAMA (Multi-Agent Memory Attack), a controlled evaluation framework for comparing topology-conditioned memory leakage in multi-agent LLM systems. MAMA operates on synthetic documents containing labeled Personally Identifiable Information (PII) entities, from which we generate sanitized task instructions. We execute a two-phase protocol: Engram (seeding private information into a target agent's memory) and Resonance (multi-round interaction where an attacker attempts extraction). Over 10 rounds, we measure leakage using a two-stage recovery criterion that combines exact-match extraction with LLM-based inference over the attacker's final output. We evaluate six canonical topologies (complete, circle, chain, tree, star, star-ring) across $n\in\{4,5,6\}$, attacker-target placements, and base models. Results are consistent: denser connectivity, shorter attacker-target distance, and higher target centrality increase leakage; most leakage occurs in early rounds and then plateaus; model choice shifts absolute rates but preserves broad structural trends; spatiotemporal/location attributes leak more readily than identity credentials or regulated identifiers. We distill practical guidance for system design: favor sparse or hierarchical connectivity, maximize attacker-target separation, and restrict hub/shortcut pathways via topology-aware access control. Our code is available at https://github.com/llll121/mama-eval.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 3 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

AgentSocialBench: Evaluating Privacy Risks in Human-Centered Agentic Social Networks
cs.AI 2026-04 unverdicted novelty 8.0

AgentSocialBench demonstrates that privacy preservation is fundamentally harder in human-centered agentic social networks than in single-agent cases due to cross-domain coordination pressures and an abstraction parado...
AgentCollabBench: Diagnosing When Good Agents Make Bad Collaborators
cs.CL 2026-05 unverdicted novelty 6.0

AgentCollabBench shows that multi-agent reliability is limited by communication topology, with converging-DAG nodes causing synthesis bottlenecks that discard constraints and explain 7-40% of information loss variance.
Safety in Embodied AI: A Survey of Risks, Attacks, and Defenses
cs.CR 2026-03 unverdicted novelty 6.0

The survey organizes over 400 papers on embodied AI safety into a multi-level taxonomy and flags overlooked issues such as fragile multimodal fusion and unstable planning under jailbreaks.