A benchmark for procedural memory retrieval in language agents.CoRR, abs/2511.21730,

Ishant Kohar, Aswanth Krishnan · arXiv 2511.21730

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

LMEB: Long-horizon Memory Embedding Benchmark

cs.CL · 2026-03-13 · unverdicted · novelty 7.0

LMEB benchmark shows that embedding models' performance on traditional retrieval does not transfer to long-horizon memory tasks, larger models do not always perform better, and LMEB measures capabilities orthogonal to MTEB.

citing papers explorer

Showing 1 of 1 citing paper.

LMEB: Long-horizon Memory Embedding Benchmark cs.CL · 2026-03-13 · unverdicted · none · ref 16
LMEB benchmark shows that embedding models' performance on traditional retrieval does not transfer to long-horizon memory tasks, larger models do not always perform better, and LMEB measures capabilities orthogonal to MTEB.

A benchmark for procedural memory retrieval in language agents.CoRR, abs/2511.21730,

fields

years

verdicts

representative citing papers

citing papers explorer