pith:2GB75OA7
Agent Laboratory: Using LLM Agents as Research Assistants
Agent Laboratory lets LLM agents carry out the full research process from idea to code repository and report.
arxiv:2501.04227 v2 · 2025-01-08 · cs.HC · cs.AI · cs.CL · cs.LG
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{2GB75OA7A6ATDAEEV63PIS5JCB}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more
Record completeness
Claims
Agent Laboratory driven by o1-preview generates the best research outcomes; the generated machine learning code is able to achieve state-of-the-art performance compared to existing methods; human involvement significantly improves overall quality; and it achieves an 84% decrease in research expenses compared to previous autonomous research methods.
That the human evaluators invited to assess the outputs provide unbiased, reproducible judgments and that the SOTA comparisons use current, fairly matched baselines without post-hoc selection of tasks or metrics.
Agent Laboratory is an autonomous LLM framework that completes end-to-end research from idea to report and code, with human feedback improving quality and cutting expenses by 84% while reaching competitive ML performance.
References
Formal links
Cited by
Receipt and verification
| First computed | 2026-05-17T23:38:15.177657Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
d183feb81f0781318084afb6f44ba9105750a156f2ea9504bd3b7b2c132f5642
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/2GB75OA7A6ATDAEEV63PIS5JCB \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: d183feb81f0781318084afb6f44ba9105750a156f2ea9504bd3b7b2c132f5642
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "60134504e062ed509777419fbe85f761cfbfe5d36590d086ae3fb354f16fa935",
"cross_cats_sorted": [
"cs.AI",
"cs.CL",
"cs.LG"
],
"license": "http://creativecommons.org/licenses/by/4.0/",
"primary_cat": "cs.HC",
"submitted_at": "2025-01-08T01:58:42Z",
"title_canon_sha256": "2eea69674a838fac30e8504e6339ed62c38737ea128b881f59b4deb534b50845"
},
"schema_version": "1.0",
"source": {
"id": "2501.04227",
"kind": "arxiv",
"version": 2
}
}