pith:YF6Q6WFE
Recursive Agent Optimization
Reinforcement learning trains agents to recursively delegate sub-tasks to copies of themselves for divide-and-conquer scaling.
arxiv:2605.06639 v1 · 2026-05-07 · cs.LG · cs.AI · cs.CL · cs.MA
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{YF6Q6WFESKNLMY7FPAWQJZGPWV}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
recursive agents trained in this way enjoy better training efficiency, can scale to tasks that go beyond the model's context window, generalize to tasks much harder than the ones the agent was trained on, and can enjoy reduced wall-clock time compared to single-agent systems.
That reinforcement learning can reliably teach agents effective delegation and communication rules without introducing prohibitive overhead, infinite recursion risks, or communication failures that undermine the divide-and-conquer benefit.
RAO uses RL to train recursive agents that delegate sub-tasks to self-copies, yielding better training efficiency, generalization to harder tasks, scaling beyond context windows, and lower wall-clock time.
References
Receipt and verification
| First computed | 2026-05-18T15:04:06.583108Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
c17d0f58a4929ab663e5782d04e4cfb57d59e3b59f6cb043e21ff81a7d72f48c
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/YF6Q6WFESKNLMY7FPAWQJZGPWV \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: c17d0f58a4929ab663e5782d04e4cfb57d59e3b59f6cb043e21ff81a7d72f48c
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "b8fd2ec1394c6f4d7e5f57b9f7439374eeef98be31d1a639ed097d815751a76f",
"cross_cats_sorted": [
"cs.AI",
"cs.CL",
"cs.MA"
],
"license": "http://creativecommons.org/licenses/by/4.0/",
"primary_cat": "cs.LG",
"submitted_at": "2026-05-07T17:49:09Z",
"title_canon_sha256": "bdb824664eb729b97f9636399a078ea2d309fcee715c7fadf02788e627016269"
},
"schema_version": "1.0",
"source": {
"id": "2605.06639",
"kind": "arxiv",
"version": 1
}
}