Evaluating very long-term conversational memory of LLM agents

Adyasha Maharana, Dong-Ho Lee, Sergey Tulyakov, Mohit Bansal, Francesco Barbieri, Yuwei Fang · 2024

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

STALE: Can LLM Agents Know When Their Memories Are No Longer Valid?

cs.CL · 2026-05-07 · unverdicted · novelty 6.0

LLM agents struggle to detect and act on implicit memory conflicts, with top models scoring 55.2% on the new STALE benchmark of 400 scenarios; CUPMem prototype strengthens state-aware revision.

citing papers explorer

Showing 1 of 1 citing paper after filters.

STALE: Can LLM Agents Know When Their Memories Are No Longer Valid? cs.CL · 2026-05-07 · unverdicted · none · ref 23
LLM agents struggle to detect and act on implicit memory conflicts, with top models scoring 55.2% on the new STALE benchmark of 400 scenarios; CUPMem prototype strengthens state-aware revision.

Evaluating very long-term conversational memory of LLM agents

fields

years

verdicts

representative citing papers

citing papers explorer