H i A gent: Hierarchical Working Memory Management for Solving Long-Horizon Agent Tasks with Large Language Model

Association for Computational Linguistics · 2025 · DOI 10.18653/v1/2025.acl-long.1575

8 Pith papers cite this work. Polarity classification is still indexing.

8 Pith papers citing it

open at publisher browse 8 citing papers

citation-role summary

background 2

citation-polarity summary

background 1 unclear 1

representative citing papers

ReasoningBank: Scaling Agent Self-Evolving with Reasoning Memory

cs.AI · 2025-09-29 · conditional · novelty 7.0

ReasoningBank distills generalizable reasoning strategies from agent successes and failures to enable self-evolution, with memory-aware test-time scaling amplifying gains over raw-trajectory or success-only memory on web and software benchmarks.

SKILL.nb: Selective Formalization and Gated Execution for Durable Agent Workflows

cs.AI · 2026-06-06 · unverdicted · novelty 6.0

SKILL.nb uses selective formalization and gate-conditioned execution in auditable notebooks to improve durability of agent workflows, achieving 53.7% success on WebArena-Verified with 91.7% retention across re-executions.

Enhancing Software Engineering Through Closed-Loop Memory Optimization

cs.SE · 2026-06-04 · unverdicted · novelty 6.0

MemOp is a closed-loop memory augmentation framework for SE agents that defines memory utility via downstream task impact and reports gains of up to 5.25% success rate, 4.63% resolve efficiency, and 9.79% cost reduction.

SPIKE: An Adaptive Dual Controller Framework for Cost-Efficient Long-Horizon Game Agents

cs.CV · 2026-05-18 · unverdicted · novelty 6.0

SPIKE dual-controller framework raises success rates 5-9 points and cuts tokens 55% in StarDojo agents by reusing strategic plans across stable segments and escalating only at detected events.

MemRouter: Memory-as-Embedding Routing for Long-Term Conversational Agents

cs.CL · 2026-05-01 · unverdicted · novelty 6.0

A lightweight supervised router using frozen-LLM embeddings for memory admission decisions outperforms LLM-based memory managers in both F1 score and latency on the LoCoMo benchmark.

Memory in the LLM Era: Modular Architectures and Strategies in a Unified Framework

cs.CL · 2026-04-02 · unverdicted · novelty 6.0

A unified framework for LLM agent memory is benchmarked, with a new hybrid method outperforming state-of-the-art on standard tasks.

When Does Overlap Help? OSU-Mem and a Cell-Conditional Analysis of Trajectory Memory for LLM Agents

cs.IR · 2026-06-19 · unverdicted · novelty 5.0

OSU-Mem shows overlapping memory helps retrieval when evidence shares tools or entities but hurts when steps are heterogeneous, with benefits on synthetic benchmarks vanishing on mixed real ones due to query mixing.

MINTEval: Evaluating Memory under Multi-Target Interference in Long-Horizon Agent Systems

cs.CL · 2026-05-18 · unverdicted · novelty 5.0

MINTEval benchmark shows current memory-augmented systems average 27.9% accuracy on long-horizon interference tasks, limited by retrieval and memory construction with degradation from intervening updates.

citing papers explorer

Showing 1 of 1 citing paper after filters.

ReasoningBank: Scaling Agent Self-Evolving with Reasoning Memory cs.AI · 2025-09-29 · conditional · none · ref 78
ReasoningBank distills generalizable reasoning strategies from agent successes and failures to enable self-evolution, with memory-aware test-time scaling amplifying gains over raw-trajectory or success-only memory on web and software benchmarks.

H i A gent: Hierarchical Working Memory Management for Solving Long-Horizon Agent Tasks with Large Language Model

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer