← back to paper
arxiv: 2605.00425 · 2 revisions
AEM: Adaptive Entropy Modulation for Multi-Turn Agentic Reinforcement Learning