The Annals of Mathematical Statistics , volume=

A Stochastic Approximation Method , author=

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

representative citing papers

MemQ: Integrating Q-Learning into Self-Evolving Memory Agents over Provenance DAGs

cs.AI · 2026-05-08 · unverdicted · novelty 7.0 · 3 refs

MemQ improves LLM agent performance by using eligibility traces over provenance DAGs to assign credit to dependent memories, achieving top success rates on six benchmarks with largest gains on complex multi-step tasks.

Equilibrium Selection in Multi-Agent Policy Gradients via Opponent-Aware Basin Entry

cs.LG · 2026-05-18 · unverdicted · novelty 6.0

Opponent-aware peer-learning corrections in finite-unroll Meta-MAPG increase entry probability into target stable-Nash basins relative to standard policy gradient, with annealing to recover local convergence.

citing papers explorer

Showing 2 of 2 citing papers.

MemQ: Integrating Q-Learning into Self-Evolving Memory Agents over Provenance DAGs cs.AI · 2026-05-08 · unverdicted · none · ref 22 · 3 links
MemQ improves LLM agent performance by using eligibility traces over provenance DAGs to assign credit to dependent memories, achieving top success rates on six benchmarks with largest gains on complex multi-step tasks.
Equilibrium Selection in Multi-Agent Policy Gradients via Opponent-Aware Basin Entry cs.LG · 2026-05-18 · unverdicted · none · ref 13
Opponent-aware peer-learning corrections in finite-unroll Meta-MAPG increase entry probability into target stable-Nash basins relative to standard policy gradient, with annealing to recover local convergence.

The Annals of Mathematical Statistics , volume=

fields

years

verdicts

representative citing papers

citing papers explorer