pith:QOPZW2B2
Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models
Engram introduces conditional memory as a new sparsity axis that lets large language models perform direct O(1) knowledge lookups instead of computing retrieval.
arxiv:2601.07372 v1 · 2026-01-12 · cs.CL · cs.AI
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{QOPZW2B2YFYM2D6YOCSNSC5YUO}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
Scaling Engram to 27B parameters achieves superior performance over a strictly iso-parameter and iso-FLOPs MoE baseline, with notable gains in reasoning (BBH +5.0, ARC-Challenge +3.7) and long-context retrieval (Multi-Query NIAH: 84.2 to 97.0).
The U-shaped scaling law for sparsity allocation between MoE computation and Engram memory generalizes beyond the tested model sizes and tasks, and the observed mechanistic benefits (relieving early layers, freeing attention) are causally due to the memory module rather than confounding factors in the experimental setup.
Engram adds conditional memory via scalable lookup to LLMs, outperforming iso-parameter MoE baselines on reasoning and long-context tasks by following a U-shaped scaling law for allocating between computation and memory.
Formal links
Cited by
Receipt and verification
| First computed | 2026-05-17T23:38:49.048549Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
839f9b683ac170cd0fd870a4d90bb8a3a891d73cca8c323b9994b23576dabaee
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/QOPZW2B2YFYM2D6YOCSNSC5YUO \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: 839f9b683ac170cd0fd870a4d90bb8a3a891d73cca8c323b9994b23576dabaee
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "d190569143980c41c63739ea5ab93edafb302492a5bf8dd424afdb714a6040b2",
"cross_cats_sorted": [
"cs.AI"
],
"license": "http://creativecommons.org/licenses/by/4.0/",
"primary_cat": "cs.CL",
"submitted_at": "2026-01-12T09:54:49Z",
"title_canon_sha256": "f356246ba4b44007608f772ef51593fefac15a455acd4eda5c5b20e0d67de9a2"
},
"schema_version": "1.0",
"source": {
"id": "2601.07372",
"kind": "arxiv",
"version": 1
}
}