m0_i2": 4}, 20) - Parallel: results = await asyncio.gather( launch_subagent({

async def launch_subagent(targets: dict, num_steps: int, context: str = "") -> str Launch a subagent to craft specific targets (shares your inventory)

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Recursive Agent Optimization

cs.LG · 2026-05-07 · unverdicted · novelty 5.0

RAO uses RL to train recursive agents that delegate sub-tasks to self-copies, yielding better training efficiency, generalization to harder tasks, scaling beyond context windows, and lower wall-clock time.

citing papers explorer

Showing 1 of 1 citing paper.

Recursive Agent Optimization cs.LG · 2026-05-07 · unverdicted · none · ref 9
RAO uses RL to train recursive agents that delegate sub-tasks to self-copies, yielding better training efficiency, generalization to harder tasks, scaling beyond context windows, and lower wall-clock time.

m0_i2": 4}, 20) - Parallel: results = await asyncio.gather( launch_subagent({

fields

years

verdicts

representative citing papers

citing papers explorer