MemTrace: Tracing and Attributing Errors in Large Language Model Memory Systems

· 2026 · cs.CL · arXiv 2605.28732

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

Memory is essential for enabling large language models to support long-horizon reasoning, yet existing memory systems remain unreliable and difficult to debug. Tracing memory's dynamic evolution is crucial to understand how information is synthesized, propagated, or corrupted over time. In this work, we study the new problem of error tracing and attribution in LLM memory systems. We propose a novel framework that transforms memory pipelines into executable memory evolution graphs, enabling fine-grained tracing of operational information flow. We then construct MemTraceBench, a benchmark collected from representative memory systems such as Long-Context, RAG, Mem0, and EverMemOS, to systematically study memory failure modes. We further introduce an automatic attribution method that iteratively traces operation subgraphs to pinpoint the root cause of any failed case. Our analysis reveals that memory failures are systematic, stemming from operation-level issues like information loss and retrieval misalignment. Crucially, we leverage these fine-grained attribution signals to guide downstream prompt optimization, establishing a closed-loop system that automatically corrects faults and boosts end-task performance by up to 7.62%. Code will be released at https://github.com/zjunlp/MemTrace.

representative citing papers

A-TMA: Decoupling State-Aware Memory Failures in Long-Term Agent Memory

cs.AI · 2026-07-02 · unverdicted · novelty 5.0

ATMA adds state labels and evidence packets to existing memory systems to reduce ghost memory failures, with reported gains on a new LTP benchmark and LoCoMo.

citing papers explorer

Showing 1 of 1 citing paper.

A-TMA: Decoupling State-Aware Memory Failures in Long-Term Agent Memory cs.AI · 2026-07-02 · unverdicted · none · ref 24 · internal anchor
ATMA adds state labels and evidence packets to existing memory systems to reduce ghost memory failures, with reported gains on a new LTP benchmark and LoCoMo.

MemTrace: Tracing and Attributing Errors in Large Language Model Memory Systems

fields

years

verdicts

representative citing papers

citing papers explorer