Explaining context length scaling and bounds for language models.arXiv preprint arXiv:2502.01481, 2025

Jingzhe Shi, Qinwei Ma, Hongyi Liu, Hang Zhao, Jeng- Neng Hwang, Lei Li · 2025 · arXiv 2502.01481

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

representative citing papers

When Stored Evidence Stops Being Usable: Scale-Conditioned Evaluation of Agent Memory

cs.AI · 2026-05-08 · unverdicted · novelty 7.0

A new evaluation protocol shows agent memory reliability degrades variably with added irrelevant sessions depending on agent, memory interface, and scale.

AgenticAI-DialogGen: Topic-Guided Conversation Generation for Fine-Tuning and Evaluating Short- and Long-Term Memories of LLMs

cs.CL · 2026-04-14 · unverdicted · novelty 6.0

AgenticAI-DialogGen uses LLM agents to generate persona-grounded, topic-guided conversations and QA pairs encoding short- and long-term memory, producing the TGC dataset that improves LLM performance on memory tasks.

Grounding Multi-Hop Reasoning in Structural Causal Models via Group Relative Policy Optimization

cs.AI · 2026-05-02 · unverdicted · novelty 5.0

SCM-GRPO grounds multi-hop fact verification in structural causal models and applies GRPO reinforcement learning to optimize reasoning chain length, outperforming baselines on HoVer and EX-FEVER.

RAM: Recover Any 3D Human Motion in-the-Wild

cs.CV · 2026-03-20 · unverdicted · novelty 4.0

RAM outperforms prior methods on PoseTrack and 3DPW for zero-shot multi-person 3D motion tracking and reconstruction by fusing semantic tracking, memory-augmented pose estimation, and predictive fusion.

citing papers explorer

Showing 4 of 4 citing papers.

When Stored Evidence Stops Being Usable: Scale-Conditioned Evaluation of Agent Memory cs.AI · 2026-05-08 · unverdicted · none · ref 25
A new evaluation protocol shows agent memory reliability degrades variably with added irrelevant sessions depending on agent, memory interface, and scale.
AgenticAI-DialogGen: Topic-Guided Conversation Generation for Fine-Tuning and Evaluating Short- and Long-Term Memories of LLMs cs.CL · 2026-04-14 · unverdicted · none · ref 3
AgenticAI-DialogGen uses LLM agents to generate persona-grounded, topic-guided conversations and QA pairs encoding short- and long-term memory, producing the TGC dataset that improves LLM performance on memory tasks.
Grounding Multi-Hop Reasoning in Structural Causal Models via Group Relative Policy Optimization cs.AI · 2026-05-02 · unverdicted · none · ref 51
SCM-GRPO grounds multi-hop fact verification in structural causal models and applies GRPO reinforcement learning to optimize reasoning chain length, outperforming baselines on HoVer and EX-FEVER.
RAM: Recover Any 3D Human Motion in-the-Wild cs.CV · 2026-03-20 · unverdicted · none · ref 73
RAM outperforms prior methods on PoseTrack and 3DPW for zero-shot multi-person 3D motion tracking and reconstruction by fusing semantic tracking, memory-augmented pose estimation, and predictive fusion.

Explaining context length scaling and bounds for language models.arXiv preprint arXiv:2502.01481, 2025

fields

years

verdicts

representative citing papers

citing papers explorer