Agent Memory: Characterization and System Implications of Stateful Long-Horizon Workloads

Alex Pentland; Marian Verhelst; Robin Geens; Thierry Tambe; Tsachy Weissman; Yasmine Omri; Zachary Broveak; Zexue He; Ziyu Gan

arxiv: 2606.06448 · v1 · pith:7Y7Q3UROnew · submitted 2026-06-04 · 💻 cs.AI

Agent Memory: Characterization and System Implications of Stateful Long-Horizon Workloads

Yasmine Omri , Ziyu Gan , Zachary Broveak , Robin Geens , Zexue He , Alex Pentland , Marian Verhelst , Tsachy Weissman

show 1 more author

Thierry Tambe

This is my paper

classification 💻 cs.AI

keywords memoryagentsystemsacrossagentscharacterizationconstructioncost

0 comments

read the original abstract

LLM agents are increasingly deployed on long-horizon tasks requiring sustained reasoning over extended interaction histories. Realizing this at scale requires agents to persistently store, retrieve, and update their own memory across sessions. A rich ecosystem of agent memory systems has emerged spanning flat retrieval, LLM-mediated extraction, consolidating fact stores, and agentic control flows. Yet, their system-level behavior remains uncharacterized. We present the first systems characterization of agent memory. First, we introduce a system-oriented taxonomy classifying agent memory systems along four axes. Second, we build a phase-aware profiling harness attributing cost to construction, retrieval, and generation. Third, we characterize ten representative systems across two benchmark suites, uncovering how design choices shift cost across the write and read paths. Finally, we derive 10 system recommendations covering construction scheduling, capability floors, amortization via query volume, freshness-latency tradeoffs, and fleet-scale management.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Memory as a Wasting Asset: Pricing Flash Endurance for Embodied Agents, and the Limits of Doing So
cs.AI 2026-06 unverdicted novelty 6.0

Flash endurance is priced via shadow price η making placement cost-optimal for any sign of value-write correlation χ, with χ positive only in recurrent long-horizon manipulation and the budget binding only on low-endu...