NeuSymMS: A Hybrid Neuro-Symbolic Memory System for Persistent, Self-Curating LLM Agents

Daya Rajaratnam; Mujahid Sultan; Sri Thuraisamy

arxiv: 2605.17596 · v2 · pith:UK2RBB4Nnew · submitted 2026-05-17 · 💻 cs.AI

NeuSymMS: A Hybrid Neuro-Symbolic Memory System for Persistent, Self-Curating LLM Agents

Mujahid Sultan , Sri Thuraisamy , Daya Rajaratnam This is my paper

Pith reviewed 2026-05-22 09:20 UTC · model grok-4.3

classification 💻 cs.AI

keywords neuro-symbolic memoryLLM agentspersistent memoryfact extractionexpert systemsmemory managementhybrid architectureCLIPS rules

0 comments

The pith

NeuSymMS pairs neural fact extraction with symbolic rules to give LLM agents persistent, scoped memory across sessions.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces NeuSymMS to let LLM agents remember and reason about users over multiple conversations without losing details or mixing up information. It extracts facts from dialogue using neural models and then applies a symbolic expert system to classify, deduplicate, and manage those facts according to explicit lifecycle rules. Knowledge is stored as structured triples in a database with support for scoping and dual short-term and long-term horizons that promote active items and prune old ones. A sympathetic reader would care because current agents often suffer from context limits and unreliable recall, and this approach claims to deliver memory that stays continuous and auditable in production settings.

Core claim

NeuSymMS couples neural fact extraction from unstructured dialogue using LLMs and a CLIPS-based expert system that classifies, deduplicates, and reconciles facts under explicit lifecycle rules. The system represents knowledge as subject-relation-value triples stored in a relational database management system, supports user/agent/agent-to-agent scoping, and implements a dual-horizon memory model with access-based promotion and time-based pruning to maintain continuity while avoiding context-window bloat and cross-entity contamination.

What carries the argument

Hybrid neuro-symbolic architecture in which neural LLMs extract facts and a CLIPS expert system enforces lifecycle rules on subject-relation-value triples held in a relational database.

If this is right

The architecture maintains continuity of memory while avoiding context-window bloat and cross-entity contamination.
It supports scoping of knowledge to specific users, agents, or agent-to-agent interactions.
It provides a practical path to trustworthy, auditable memory for production agentic systems.
Dual-horizon memory with promotion and pruning keeps short-term and long-term stores balanced.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

Agents using this memory could reduce repetition when users return to ongoing tasks or preferences.
The structured triple format might allow easier auditing or explanation of what an agent knows about a user.
Combining the approach with existing retrieval methods could create layered memory systems that handle both structured facts and raw logs.

Load-bearing premise

The CLIPS-based expert system can reliably classify, deduplicate, and reconcile extracted facts under explicit lifecycle rules without introducing systematic errors or inconsistencies.

What would settle it

Run the system on a controlled sequence of dialogues that deliberately contain duplicate or conflicting facts about the same subject and measure whether the stored triples remain consistent without manual intervention.

read the original abstract

We present NeuSymMS, an adaptive memory system that enables large language model (LLM) agents to learn, remember, and reason about users across sessions via a hybrid neuro-symbolic architecture. NeuSymMS couples neural fact extraction from unstructured dialogue using LLMs and a CLIPS-based expert system that classifies, deduplicates, and reconciles facts under explicit lifecycle rules. The system represents knowledge as subject-relation-value triples stored in relational database management system. It supports user/agents/agent-to-agent scoping, and implements a dual-horizon (short-term and long-term) memory model. IT leverages access-based promotion and time-based pruning of the memory on both horizpons. NeuSymMS maintains continuity of memory while avoiding context-window bloat and cross-entity contamination. We argue that this architecture offers a practical path to trustworthy, auditable memory for production agentic systems and discuss its novelty relative to log retrieval, summarization, and key-value approaches.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 3 minor

Summary. The manuscript presents NeuSymMS, a hybrid neuro-symbolic memory system for persistent, self-curating LLM agents. It couples LLM-based neural extraction of subject-relation-value triples from dialogue with a CLIPS expert system that classifies, deduplicates, and reconciles facts under explicit lifecycle rules. Knowledge is stored in an RDBMS with support for user/agent scoping and a dual-horizon (short-term/long-term) model using access-based promotion and time-based pruning. The authors claim this architecture provides a practical path to trustworthy, auditable memory for production agentic systems, distinguishing it from log retrieval, summarization, and key-value approaches.

Significance. If the architecture performs as described, the work could meaningfully advance reliable long-term memory for agentic LLM systems by combining neural extraction with symbolic rule-based curation, enabling auditability and reducing context contamination. The hybrid design and explicit lifecycle rules represent a concrete step beyond purely neural or retrieval-only methods. However, the complete absence of empirical results, ablations, or quantitative metrics in the manuscript prevents assessment of whether these benefits are realized in practice.

major comments (2)

[Abstract / Architecture] Abstract and Architecture description: The central claim that the system delivers 'trustworthy, auditable memory' rests on the CLIPS expert system reliably classifying, deduplicating, and reconciling LLM-extracted triples under lifecycle rules, yet no concrete rule definitions, conflict-resolution logic (e.g., priority ordering, evidence weighting, or rollback), or error-handling for hallucinations/contradictions are provided. This is load-bearing for the trustworthiness argument.
[Evaluation] Evaluation: The manuscript supplies no empirical results, error rates, ablation studies, case studies, or performance metrics to validate persistence, consistency, or auditability claims. Without such data the assertion of a 'practical path' for production systems cannot be evaluated.

minor comments (3)

[Abstract] Typo: 'IT leverages' should be 'It leverages'.
[Abstract] Spelling: 'horizpons' should be 'horizons'.
[Abstract] The abstract states the system 'discusses its novelty relative to log retrieval, summarization, and key-value approaches,' but the manuscript provides no explicit comparison table or detailed differentiation in the provided text.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their constructive feedback on NeuSymMS. We address the major comments point by point below, indicating where revisions will be made to the manuscript.

read point-by-point responses

Referee: [Abstract / Architecture] Abstract and Architecture description: The central claim that the system delivers 'trustworthy, auditable memory' rests on the CLIPS expert system reliably classifying, deduplicating, and reconciling LLM-extracted triples under lifecycle rules, yet no concrete rule definitions, conflict-resolution logic (e.g., priority ordering, evidence weighting, or rollback), or error-handling for hallucinations/contradictions are provided. This is load-bearing for the trustworthiness argument.

Authors: We agree that concrete details on the CLIPS rules are necessary to substantiate the trustworthiness and auditability claims. In the revised manuscript we will add a dedicated subsection in the architecture description that specifies the rule sets for fact classification, deduplication, reconciliation, and handling of contradictions or hallucinations. This will include example rules, priority mechanisms, and rollback procedures to make the logic explicit and auditable. revision: yes
Referee: [Evaluation] Evaluation: The manuscript supplies no empirical results, error rates, ablation studies, case studies, or performance metrics to validate persistence, consistency, or auditability claims. Without such data the assertion of a 'practical path' for production systems cannot be evaluated.

Authors: The current manuscript is an architecture and design paper that introduces the hybrid neuro-symbolic approach and its lifecycle rules. We acknowledge the absence of quantitative evaluation and will add a new section containing illustrative case studies that demonstrate multi-session fact extraction, deduplication, scoping, and dual-horizon persistence. These examples will provide qualitative evidence of consistency and auditability. Comprehensive quantitative metrics and ablations are reserved for a follow-up empirical study once the implementation is further matured. revision: partial

Circularity Check

0 steps flagged

No circularity detected in system architecture description

full rationale

The paper presents NeuSymMS as a hybrid neuro-symbolic architecture for LLM agent memory, coupling neural fact extraction with a CLIPS expert system for classification, deduplication, and reconciliation under explicit lifecycle rules, stored as subject-relation-value triples. No equations, fitted parameters, or derivation chain are described that would reduce any claimed result to its own inputs by construction. The central claim of trustworthy, auditable memory follows directly from the enumerated components and rules rather than from self-referential definitions, self-citation load-bearing premises, or renamed empirical patterns. As an engineering architecture paper without mathematical derivations or predictive modeling steps, the work is self-contained against external benchmarks and exhibits no circularity.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 0 invented entities

The description relies on standard assumptions about LLM reliability for fact extraction and the soundness of rule-based reconciliation; no free parameters, new physical entities, or ad-hoc constants are introduced in the abstract.

axioms (2)

domain assumption LLMs can extract accurate subject-relation-value facts from unstructured dialogue
This is invoked as the basis for the neural fact extraction step.
domain assumption Explicit lifecycle rules in CLIPS can correctly classify, deduplicate, and reconcile facts
This underpins the symbolic component's ability to maintain memory integrity.

pith-pipeline@v0.9.0 · 5705 in / 1296 out tokens · 29230 ms · 2026-05-22T09:20:20.613047+00:00 · methodology

Review history (2 revisions) →

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

NeuSymMS couples neural fact extraction from unstructured dialogue using LLMs and a CLIPS-based expert system that classifies, deduplicates, and reconciles facts under explicit lifecycle rules. The system represents knowledge as subject-relation-value triples stored in relational database management system.
IndisputableMonolith/Foundation/ArithmeticFromLogic.lean LogicNat.induction unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

It supports user/agents/agent-to-agent scoping, and implements a dual-horizon (short-term and long-term) memory model. IT leverages access-based promotion and time-based pruning of the memory on both horizpons.

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

42 extracted references · 42 canonical work pages · 8 internal anchors

[1]

Lost in the Middle: How Language Models Use Long Contexts

N. F. Liuet al., “Lost in the middle: How language models use long contexts,”arXiv preprint arXiv:2307.03172, 2023

work page internal anchor Pith review Pith/arXiv arXiv 2023
[2]

Retrieval-augmented generation for knowledge-intensive nlp tasks,

P. Lewiset al., “Retrieval-augmented generation for knowledge-intensive nlp tasks,” inNeurIPS, 2020

work page 2020
[3]

Pegasus: Pre-training with extracted gap-sentences for abstractive summarization,

J. Zhanget al., “Pegasus: Pre-training with extracted gap-sentences for abstractive summarization,” inICML, 2020

work page 2020
[4]

GPT-4 Technical Report

OpenAI, “Gpt-4 technical report,” 2023, arXiv preprint arXiv:2303.08774

work page internal anchor Pith review Pith/arXiv arXiv 2023
[5]

Longformer: The long-document transformer,

I. Beltagyet al., “Longformer: The long-document transformer,” inACL, 2020

work page 2020
[6]

Locomo: A benchmark for long-context memory in llms,

J. Ahn, J. Doe, and A. Smith, “Locomo: A benchmark for long-context memory in llms,” inProceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (ACL). Association for Computational Linguistics, 2024, snap Research

work page 2024
[7]

Recursive summarization for long-term dialogue memory in llms,

X. Wang, J. Smith, and A. Lee, “Recursive summarization for long-term dialogue memory in llms,”Neurocomputing, 2025

work page 2025
[8]

Remem: Hybrid memory graphs for episodic recollection,

J. Shu and J. Smith, “Remem: Hybrid memory graphs for episodic recollection,”International Conference on Learning Representations, 2026, to appear

work page 2026
[9]

A-mem: Dynamic zettelkasten-based memory graphs with agentic indexing,

J. Lee and J. Smith, “A-mem: Dynamic zettelkasten-based memory graphs with agentic indexing,”NeurIPS, 2025, placeholder entry

work page 2025
[10]

Giarratano and G

J. Giarratano and G. Riley,Expert Systems: Principles and Program- ming. Thomson, 2005

work page 2005
[11]

A truth maintenance system,

J. Doyle, “A truth maintenance system,”Artificial Intelligence, 1979

work page 1979
[12]

Neural-symbolic learning and reasoning: A survey and interpretation,

A. d. Garcezet al., “Neural-symbolic learning and reasoning: A survey and interpretation,”Neurocomputing, 2019

work page 2019
[13]

Knowledge graphs,

A. Hoganet al., “Knowledge graphs,”ACM Computing Surveys, 2021

work page 2021
[14]

Riley,CLIPS User’s Guide, NASA Johnson Space Center, 2017

G. Riley,CLIPS User’s Guide, NASA Johnson Space Center, 2017

work page 2017
[15]

Dense passage retrieval for open-domain question answering,

V . Karpukhinet al., “Dense passage retrieval for open-domain question answering,” inEMNLP, 2020

work page 2020
[16]

Leveraging Passage Retrieval with Generative Models for Open Domain Question Answering

G. Izacard and E. Grave, “Leveraging passage retrieval with gener- ative models for open domain question answering,”arXiv preprint arXiv:2007.01282, 2021

work page internal anchor Pith review Pith/arXiv arXiv 2007
[17]

Exploring the limits of transfer learning with a unified text-to-text transformer,

C. Raffelet al., “Exploring the limits of transfer learning with a unified text-to-text transformer,”Journal of Machine Learning Research, 2020

work page 2020
[18]

Summeval: Re-evaluating summarization evaluation,

P. Labanet al., “Summeval: Re-evaluating summarization evaluation,” inEMNLP, 2021

work page 2021
[19]

A review of relational machine learning for knowledge graphs,

M. Nickelet al., “A review of relational machine learning for knowledge graphs,”Proceedings of the IEEE, 2016

work page 2016
[20]

Rete: A fast algorithm for the many patterns/many objects match problem,

C. L. Forgy, “Rete: A fast algorithm for the many patterns/many objects match problem,”Artificial Intelligence, 1982

work page 1982
[21]

An assumption-based tms,

J. de Kleer, “An assumption-based tms,”Artificial Intelligence, 1986

work page 1986
[22]

Neural-symbolic learning and reasoning: A survey and interpretation,

T. R. Besoldet al., “Neural-symbolic learning and reasoning: A survey and interpretation,”Frontiers in Artificial Intelligence and Applications, 2017

work page 2017
[23]

The neuro-symbolic concept learner,

J. Maoet al., “The neuro-symbolic concept learner,” inICLR, 2019

work page 2019
[24]

Human memory: A proposed system and its control processes,

R. C. Atkinson and R. M. Shiffrin, “Human memory: A proposed system and its control processes,” inPsychology of Learning and Motivation, 1968

work page 1968
[25]

Working memory,

A. D. Baddeley and G. Hitch, “Working memory,”Psychology of Learning and Motivation, 1974

work page 1974
[26]

Ebbinghaus,Memory: A Contribution to Experimental Psychology

H. Ebbinghaus,Memory: A Contribution to Experimental Psychology. Leipzig: Duncker & Humblot, 1885, forgetting and decay phenomena; cite for time-based pruning rationale. [Online]. Available: https://archive.org/details/memorycontribut00ebbigoog

work page
[27]

Generative agents: Interactive simulacra of human behavior,

J. S. Parket al., “Generative agents: Interactive simulacra of human behavior,” inCHI, 2023

work page 2023
[28]

Reflexion: Language Agents with Verbal Reinforcement Learning

N. Shinnet al., “Reflexion: Language agents with verbal reinforcement learning,”arXiv preprint arXiv:2303.11366, 2023

work page internal anchor Pith review Pith/arXiv arXiv 2023
[29]

Voyager: An Open-Ended Embodied Agent with Large Language Models

G. Wanget al., “V oyager: An open-ended embodied agent in minecraft,” arXiv preprint arXiv:2305.16291, 2023

work page internal anchor Pith review Pith/arXiv arXiv 2023
[30]

The semantic web,

T. Berners-Leeet al., “The semantic web,”Scientific American, 2001

work page 2001
[31]

Patterns for building LLM-based systems & products,

E. Yan, “Patterns for building LLM-based systems & products,” July 2023, accessed: 2026-05-20. [Online]. Available: https://eugeneyan.com/writing/llm-patterns/

work page 2023
[32]

Nexa: Enterprise agentic ai platform,

M. R. Team, “Nexa: Enterprise agentic ai platform,” https://www.asknexa.ai, 2025, accessed 2026-05-17

work page 2025
[33]

Evaluating very long-term conversational memory of LLM agents,

A. Maharana, D.-H. Lee, S. Tulyakov, M. Bansal, F. Barbieri, and Y . Fang, “Evaluating very long-term conversational memory of LLM agents,” inProceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), L.-W. Ku, A. Martins, and V . Srikumar, Eds. Bangkok, Thailand: Association for Computational Linguist...

work page 2024
[34]

Longmemeval: A benchmark for long-term inter- active memory in llm assistants,

F. Wu and Coauthors, “Longmemeval: A benchmark for long-term inter- active memory in llm assistants,”arXiv preprint, 2024, [Online]. Avail- able: https://www.emergentmind.com/topics/longmemeval-benchmark

work page 2024
[35]

Ama-bench: Evaluating long-horizon memory for agentic llms,

A.-B. Team, “Ama-bench: Evaluating long-horizon memory for agentic llms,”arXiv preprint arXiv:2602.22769, 2026

work page internal anchor Pith review arXiv 2026
[36]

Memoryarena: Benchmarking agent memory in interdependent multi-session loops,

A. D. Team, “Memoryarena: Benchmarking agent memory in interdependent multi-session loops,” [Online]. Available: https://memoryarena.github.io, 2026

work page 2026
[37]

Mem0: Building Production-Ready AI Agents with Scalable Long-Term Memory

M. Research and D. Team, “Mem0: Building production-ready ai agents with scalable long-term memory,”arXiv preprint arXiv:2504.19413, 2025

work page internal anchor Pith review Pith/arXiv arXiv 2025
[38]

MemGPT: Towards LLMs as Operating Systems

M. R. Team, “Memgpt: Towards llms as operating systems,”arXiv preprint arXiv:2310.08560, 2023

work page internal anchor Pith review Pith/arXiv arXiv 2023
[39]

Lightweight and cognitive agentic memory for efficient long- term interaction,

L. Team, “Lightweight and cognitive agentic memory for efficient long- term interaction,”arXiv preprint arXiv:2511.01448, 2025

work page arXiv 2025
[40]

Telemem: Contradiction-aware llm consolidation for long- term agent memory,

T. Team, “Telemem: Contradiction-aware llm consolidation for long- term agent memory,”arXiv preprint, 2026

work page 2026
[41]

Semantic anchoring for structured retrieval in llm agents,

S. A. Team, “Semantic anchoring for structured retrieval in llm agents,” arXiv preprint arXiv:2508.12630, 2025

work page arXiv 2025
[42]

Amac: Interpretable admission control for agentic memory systems,

A. Team, “Amac: Interpretable admission control for agentic memory systems,”arXiv preprint arXiv:2603.04549, 2026. Page 7 of 7

work page arXiv 2026

[1] [1]

Lost in the Middle: How Language Models Use Long Contexts

N. F. Liuet al., “Lost in the middle: How language models use long contexts,”arXiv preprint arXiv:2307.03172, 2023

work page internal anchor Pith review Pith/arXiv arXiv 2023

[2] [2]

Retrieval-augmented generation for knowledge-intensive nlp tasks,

P. Lewiset al., “Retrieval-augmented generation for knowledge-intensive nlp tasks,” inNeurIPS, 2020

work page 2020

[3] [3]

Pegasus: Pre-training with extracted gap-sentences for abstractive summarization,

J. Zhanget al., “Pegasus: Pre-training with extracted gap-sentences for abstractive summarization,” inICML, 2020

work page 2020

[4] [4]

GPT-4 Technical Report

OpenAI, “Gpt-4 technical report,” 2023, arXiv preprint arXiv:2303.08774

work page internal anchor Pith review Pith/arXiv arXiv 2023

[5] [5]

Longformer: The long-document transformer,

I. Beltagyet al., “Longformer: The long-document transformer,” inACL, 2020

work page 2020

[6] [6]

Locomo: A benchmark for long-context memory in llms,

J. Ahn, J. Doe, and A. Smith, “Locomo: A benchmark for long-context memory in llms,” inProceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (ACL). Association for Computational Linguistics, 2024, snap Research

work page 2024

[7] [7]

Recursive summarization for long-term dialogue memory in llms,

X. Wang, J. Smith, and A. Lee, “Recursive summarization for long-term dialogue memory in llms,”Neurocomputing, 2025

work page 2025

[8] [8]

Remem: Hybrid memory graphs for episodic recollection,

J. Shu and J. Smith, “Remem: Hybrid memory graphs for episodic recollection,”International Conference on Learning Representations, 2026, to appear

work page 2026

[9] [9]

A-mem: Dynamic zettelkasten-based memory graphs with agentic indexing,

J. Lee and J. Smith, “A-mem: Dynamic zettelkasten-based memory graphs with agentic indexing,”NeurIPS, 2025, placeholder entry

work page 2025

[10] [10]

Giarratano and G

J. Giarratano and G. Riley,Expert Systems: Principles and Program- ming. Thomson, 2005

work page 2005

[11] [11]

A truth maintenance system,

J. Doyle, “A truth maintenance system,”Artificial Intelligence, 1979

work page 1979

[12] [12]

Neural-symbolic learning and reasoning: A survey and interpretation,

A. d. Garcezet al., “Neural-symbolic learning and reasoning: A survey and interpretation,”Neurocomputing, 2019

work page 2019

[13] [13]

Knowledge graphs,

A. Hoganet al., “Knowledge graphs,”ACM Computing Surveys, 2021

work page 2021

[14] [14]

Riley,CLIPS User’s Guide, NASA Johnson Space Center, 2017

G. Riley,CLIPS User’s Guide, NASA Johnson Space Center, 2017

work page 2017

[15] [15]

Dense passage retrieval for open-domain question answering,

V . Karpukhinet al., “Dense passage retrieval for open-domain question answering,” inEMNLP, 2020

work page 2020

[16] [16]

Leveraging Passage Retrieval with Generative Models for Open Domain Question Answering

G. Izacard and E. Grave, “Leveraging passage retrieval with gener- ative models for open domain question answering,”arXiv preprint arXiv:2007.01282, 2021

work page internal anchor Pith review Pith/arXiv arXiv 2007

[17] [17]

Exploring the limits of transfer learning with a unified text-to-text transformer,

C. Raffelet al., “Exploring the limits of transfer learning with a unified text-to-text transformer,”Journal of Machine Learning Research, 2020

work page 2020

[18] [18]

Summeval: Re-evaluating summarization evaluation,

P. Labanet al., “Summeval: Re-evaluating summarization evaluation,” inEMNLP, 2021

work page 2021

[19] [19]

A review of relational machine learning for knowledge graphs,

M. Nickelet al., “A review of relational machine learning for knowledge graphs,”Proceedings of the IEEE, 2016

work page 2016

[20] [20]

Rete: A fast algorithm for the many patterns/many objects match problem,

C. L. Forgy, “Rete: A fast algorithm for the many patterns/many objects match problem,”Artificial Intelligence, 1982

work page 1982

[21] [21]

An assumption-based tms,

J. de Kleer, “An assumption-based tms,”Artificial Intelligence, 1986

work page 1986

[22] [22]

Neural-symbolic learning and reasoning: A survey and interpretation,

T. R. Besoldet al., “Neural-symbolic learning and reasoning: A survey and interpretation,”Frontiers in Artificial Intelligence and Applications, 2017

work page 2017

[23] [23]

The neuro-symbolic concept learner,

J. Maoet al., “The neuro-symbolic concept learner,” inICLR, 2019

work page 2019

[24] [24]

Human memory: A proposed system and its control processes,

R. C. Atkinson and R. M. Shiffrin, “Human memory: A proposed system and its control processes,” inPsychology of Learning and Motivation, 1968

work page 1968

[25] [25]

Working memory,

A. D. Baddeley and G. Hitch, “Working memory,”Psychology of Learning and Motivation, 1974

work page 1974

[26] [26]

Ebbinghaus,Memory: A Contribution to Experimental Psychology

H. Ebbinghaus,Memory: A Contribution to Experimental Psychology. Leipzig: Duncker & Humblot, 1885, forgetting and decay phenomena; cite for time-based pruning rationale. [Online]. Available: https://archive.org/details/memorycontribut00ebbigoog

work page

[27] [27]

Generative agents: Interactive simulacra of human behavior,

J. S. Parket al., “Generative agents: Interactive simulacra of human behavior,” inCHI, 2023

work page 2023

[28] [28]

Reflexion: Language Agents with Verbal Reinforcement Learning

N. Shinnet al., “Reflexion: Language agents with verbal reinforcement learning,”arXiv preprint arXiv:2303.11366, 2023

work page internal anchor Pith review Pith/arXiv arXiv 2023

[29] [29]

Voyager: An Open-Ended Embodied Agent with Large Language Models

G. Wanget al., “V oyager: An open-ended embodied agent in minecraft,” arXiv preprint arXiv:2305.16291, 2023

work page internal anchor Pith review Pith/arXiv arXiv 2023

[30] [30]

The semantic web,

T. Berners-Leeet al., “The semantic web,”Scientific American, 2001

work page 2001

[31] [31]

Patterns for building LLM-based systems & products,

E. Yan, “Patterns for building LLM-based systems & products,” July 2023, accessed: 2026-05-20. [Online]. Available: https://eugeneyan.com/writing/llm-patterns/

work page 2023

[32] [32]

Nexa: Enterprise agentic ai platform,

M. R. Team, “Nexa: Enterprise agentic ai platform,” https://www.asknexa.ai, 2025, accessed 2026-05-17

work page 2025

[33] [33]

Evaluating very long-term conversational memory of LLM agents,

A. Maharana, D.-H. Lee, S. Tulyakov, M. Bansal, F. Barbieri, and Y . Fang, “Evaluating very long-term conversational memory of LLM agents,” inProceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), L.-W. Ku, A. Martins, and V . Srikumar, Eds. Bangkok, Thailand: Association for Computational Linguist...

work page 2024

[34] [34]

Longmemeval: A benchmark for long-term inter- active memory in llm assistants,

F. Wu and Coauthors, “Longmemeval: A benchmark for long-term inter- active memory in llm assistants,”arXiv preprint, 2024, [Online]. Avail- able: https://www.emergentmind.com/topics/longmemeval-benchmark

work page 2024

[35] [35]

Ama-bench: Evaluating long-horizon memory for agentic llms,

A.-B. Team, “Ama-bench: Evaluating long-horizon memory for agentic llms,”arXiv preprint arXiv:2602.22769, 2026

work page internal anchor Pith review arXiv 2026

[36] [36]

Memoryarena: Benchmarking agent memory in interdependent multi-session loops,

A. D. Team, “Memoryarena: Benchmarking agent memory in interdependent multi-session loops,” [Online]. Available: https://memoryarena.github.io, 2026

work page 2026

[37] [37]

Mem0: Building Production-Ready AI Agents with Scalable Long-Term Memory

M. Research and D. Team, “Mem0: Building production-ready ai agents with scalable long-term memory,”arXiv preprint arXiv:2504.19413, 2025

work page internal anchor Pith review Pith/arXiv arXiv 2025

[38] [38]

MemGPT: Towards LLMs as Operating Systems

M. R. Team, “Memgpt: Towards llms as operating systems,”arXiv preprint arXiv:2310.08560, 2023

work page internal anchor Pith review Pith/arXiv arXiv 2023

[39] [39]

Lightweight and cognitive agentic memory for efficient long- term interaction,

L. Team, “Lightweight and cognitive agentic memory for efficient long- term interaction,”arXiv preprint arXiv:2511.01448, 2025

work page arXiv 2025

[40] [40]

Telemem: Contradiction-aware llm consolidation for long- term agent memory,

T. Team, “Telemem: Contradiction-aware llm consolidation for long- term agent memory,”arXiv preprint, 2026

work page 2026

[41] [41]

Semantic anchoring for structured retrieval in llm agents,

S. A. Team, “Semantic anchoring for structured retrieval in llm agents,” arXiv preprint arXiv:2508.12630, 2025

work page arXiv 2025

[42] [42]

Amac: Interpretable admission control for agentic memory systems,

A. Team, “Amac: Interpretable admission control for agentic memory systems,”arXiv preprint arXiv:2603.04549, 2026. Page 7 of 7

work page arXiv 2026