LongMemEval benchmarks long-term memory in chat assistants, revealing 30% accuracy drops across sustained interactions and proposing indexing-retrieval-reading optimizations that boost performance.
Long time no see! open-domain conversation with long-term persona memory
3 Pith papers cite this work. Polarity classification is still indexing.
verdicts
UNVERDICTED 3representative citing papers
ToxPrune prunes toxic subwords from BPE tokenizers in LLMs to mitigate toxic dialogue responses and improve diversity on both toxic and non-toxic models.
Proposes HACD-H framework integrating emotional adaptation, relational organization, memory and personality into a dynamical system and reports empirical patterns from a 14,700-turn dataset linking social intelligence to reduced social cognitive energy.
citing papers explorer
-
Human-AI Coevolution Dynamics: A Formal Theory of Social Intelligence Emergence Through Long-Term Interaction
Proposes HACD-H framework integrating emotional adaptation, relational organization, memory and personality into a dynamical system and reports empirical patterns from a 14,700-turn dataset linking social intelligence to reduced social cognitive energy.