pith:UCH5PWOF
AgentCoder: Multi-Agent-based Code Generation with Iterative Testing and Optimisation
A multi-agent system divides code generation among three agents to reach higher accuracy with lower token cost than single models.
arxiv:2312.13010 v3 · 2023-12-20 · cs.CL
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{UCH5PWOF7QIHGFDOPYMVRW2FTO}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
AgentCoder (GPT-4) achieves 96.3% and 91.8% pass@1 in HumanEval and MBPP datasets with an overall token overhead of 56.9K and 66.3K, while state-of-the-art obtains only 90.2% and 78.9% pass@1 with an overall token overhead of 138.2K and 206.5K.
That iterative feedback from the test executor agent reliably improves code quality without introducing new errors or causing the programmer agent to overfit to the generated tests.
A three-agent loop of code generation, test creation, and execution feedback lifts pass@1 to 96.3% on HumanEval and 91.8% on MBPP for GPT-4 while using roughly half the tokens of prior state-of-the-art.
References
Formal links
Cited by
Receipt and verification
| First computed | 2026-05-17T23:38:53.608051Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
a08fd7d9c5fc1073146e7e1958db459bb4659d19bbbd7e45013838433479baa2
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/UCH5PWOF7QIHGFDOPYMVRW2FTO \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: a08fd7d9c5fc1073146e7e1958db459bb4659d19bbbd7e45013838433479baa2
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "f82e5695b3ec0d0a1c717f6d512cde914ec019994f33804d782a92a621d4e1c0",
"cross_cats_sorted": [],
"license": "http://creativecommons.org/licenses/by/4.0/",
"primary_cat": "cs.CL",
"submitted_at": "2023-12-20T13:22:41Z",
"title_canon_sha256": "5a6bba019c25f78396c3860172ae82837bc1075766dc395859feffa808ad737f"
},
"schema_version": "1.0",
"source": {
"id": "2312.13010",
"kind": "arxiv",
"version": 3
}
}