pith. sign in

Agentrl: Scaling agentic reinforcement learning with a multi-turn, multi-task framework

8 Pith papers cite this work. Polarity classification is still indexing.

8 Pith papers citing it

citation-role summary

background 1

citation-polarity summary

years

2026 6 2025 2

verdicts

UNVERDICTED 8

roles

background 1

polarities

background 1

representative citing papers

AstraFlow: Dataflow-Oriented Reinforcement Learning for Agentic LLMs

cs.LG · 2026-05-15 · unverdicted · novelty 7.0

AstraFlow decouples RL components into autonomous dataflow services to natively support multi-policy agentic LLM training, elastic scaling, and cross-region execution with 2.7x speedup on math, code, search, and AgentBench workloads.

TRACE: Capability-Targeted Agentic Training

cs.AI · 2026-04-07 · unverdicted · novelty 6.0

TRACE identifies capability gaps from agent trajectory contrasts, synthesizes per-capability RL training environments, and routes LoRA adapters at inference to improve performance on customer service and tool-use benchmarks.

Skill1: Unified Evolution of Skill-Augmented Agents via Reinforcement Learning

cs.AI · 2026-05-07 · unverdicted · novelty 5.0 · 3 refs

Skill1 trains a single RL policy to co-evolve skill selection, utilization, and distillation in language model agents from one task-outcome reward, using low-frequency trends to credit selection and high-frequency variation to credit distillation, outperforming baselines on ALFWorld and WebShop.

citing papers explorer

Showing 8 of 8 citing papers.