These works target higher benchmark scores under a single reward-weighted objective but do not decompose the observed improvements into capability vs

12 A Related Work RL for LLM agents, tool use · 2025

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Does RL Expand the Capability Boundary of LLM Agents? A PASS@(k,T) Analysis

cs.LG · 2026-04-16 · unverdicted · novelty 7.0

RL expands the capability boundary of LLM agents on compositional tool-use tasks, shown by non-converging pass curves at large k with increasing T, while SFT regresses it and the effect is absent on simpler tasks.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Does RL Expand the Capability Boundary of LLM Agents? A PASS@(k,T) Analysis cs.LG · 2026-04-16 · unverdicted · none · ref 21
RL expands the capability boundary of LLM agents on compositional tool-use tasks, shown by non-converging pass curves at large k with increasing T, while SFT regresses it and the effect is absent on simpler tasks.

These works target higher benchmark scores under a single reward-weighted objective but do not decompose the observed improvements into capability vs

fields

years

verdicts

representative citing papers

citing papers explorer