2.Update Rule.The utility is updated via the linear EMA rule with learning rateα∈(0,1): Qt+1 = (1−α)Q t +αr t

Stationary Reward

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

MemRL: Self-Evolving Agents via Runtime Reinforcement Learning on Episodic Memory

cs.CL · 2026-01-06 · unverdicted · novelty 5.0

MemRL enables self-evolving AI agents through reinforcement learning on episodic memory with a two-phase retrieval process that filters noise and selects high-utility strategies based on environmental feedback.

citing papers explorer

Showing 1 of 1 citing paper.

MemRL: Self-Evolving Agents via Runtime Reinforcement Learning on Episodic Memory cs.CL · 2026-01-06 · unverdicted · none · ref 3
MemRL enables self-evolving AI agents through reinforcement learning on episodic memory with a two-phase retrieval process that filters noise and selects high-utility strategies based on environmental feedback.

2.Update Rule.The utility is updated via the linear EMA rule with learning rateα∈(0,1): Qt+1 = (1−α)Q t +αr t

fields

years

verdicts

representative citing papers

citing papers explorer