MemGuard: Preventing Memory Contamination in Long-Term Memory-Augmented Large Language Models

Cheng Qian; Dilek Hakkani-Tur; Heng Ji; Hyeonjeong Ha; Jeonghwan Kim; Jiayu Liu; Kathleen McKeown; William M. Campbell; Yue Wu; Yuji Zhang

read the original abstract

Memory-augmented large language models extend reasoning beyond a fixed context window by maintaining long-term memory across interactions. However, existing memory systems often collapse stable user facts, episodic events, and behavioral rules into a shared space, allowing functionally distinct memories to be retrieved and used as interchangeable evidence. We identify this failure mode as heterogeneous memory contamination, where context-specific events become overgeneralized claims, or semantically relevant but functionally incompatible memories mislead generation. To this end, we introduce MemGuard, a type-aware memory framework that preserves functional memory boundaries during memory construction and retrieval. It assigns each memory an explicit functional role at write time, maintains relations across type-isolated memories, and selectively composes evidence only from necessary memory types, reducing contamination from irrelevant or functionally incompatible evidence. Across hallucination and long-horizon conversation benchmarks, MemGuard improves memory reliability by up to 28.27% while retrieving up to 5.8x fewer memory tokens than prior methods. These results suggest that reliable long-term reasoning depends on principled organization and selective use of heterogeneous memory.

MemGuard: Preventing Memory Contamination in Long-Term Memory-Augmented Large Language Models

discussion (0)