MeMo: Towards Language Models with Associative Memory Mechanisms

Andrea Favalli; Cristina Giannone; Davide Venditti; Elena Sofia Ruzzetti; Fabio Massimo Zanzotto; Federico Ranaldi; Giancarlo A. Xompero; Leonardo Ranaldi; Raniero Romagnoli

arxiv: 2502.12851 · v1 · pith:3PY77BBFnew · submitted 2025-02-18 · 💻 cs.CL · cs.AI

MeMo: Towards Language Models with Associative Memory Mechanisms

Fabio Massimo Zanzotto , Elena Sofia Ruzzetti , Giancarlo A. Xompero , Leonardo Ranaldi , Davide Venditti , Federico Ranaldi , Cristina Giannone , Andrea Favalli

show 1 more author

Raniero Romagnoli

This is my paper

classification 💻 cs.CL cs.AI

keywords memoarchitecturelanguagememorizationassociativelearningmodelsability

0 comments

read the original abstract

Memorization is a fundamental ability of Transformer-based Large Language Models, achieved through learning. In this paper, we propose a paradigm shift by designing an architecture to memorize text directly, bearing in mind the principle that memorization precedes learning. We introduce MeMo, a novel architecture for language modeling that explicitly memorizes sequences of tokens in layered associative memories. By design, MeMo offers transparency and the possibility of model editing, including forgetting texts. We experimented with the MeMo architecture, showing the memorization power of the one-layer and the multi-layer configurations.

This paper has not been read by Pith yet.

MeMo: Towards Language Models with Associative Memory Mechanisms

discussion (0)