AEL uses a fast-timescale bandit for memory policy selection and slow-timescale LLM reflection for causal insights, achieving a Sharpe ratio of 2.13 on a 208-episode portfolio benchmark while showing that added mechanisms degrade performance.
Hybrid self-evolving structured memory for gui agents
3 Pith papers cite this work. Polarity classification is still indexing.
years
2026 3representative citing papers
MementoGUI introduces a modular memory-control framework with working and episodic memory operators that improves long-horizon GUI agent performance over history-replay and text-only baselines.
Proposes ATMem as active task-driving state memory and STR-GRPO RL to improve GUI agent reliability on long-horizon mobile tasks over passive record storage.
citing papers explorer
-
AEL: Agent Evolving Learning for Open-Ended Environments
AEL uses a fast-timescale bandit for memory policy selection and slow-timescale LLM reflection for causal insights, achieving a Sharpe ratio of 2.13 on a 208-episode portfolio benchmark while showing that added mechanisms degrade performance.
-
MementoGUI: Learning Agentic Multimodal Memory Control for Long-Horizon GUI Agents
MementoGUI introduces a modular memory-control framework with working and episodic memory operators that improves long-horizon GUI agent performance over history-replay and text-only baselines.
-
What Memory Do GUI Agents Really Need? From Passive Records to Active Task-Driving States
Proposes ATMem as active task-driving state memory and STR-GRPO RL to improve GUI agent reliability on long-horizon mobile tasks over passive record storage.