{"record_type":"pith_number_record","schema_url":"https://pith.science/schemas/pith-number/v1.json","pith_number":"pith:2025:2GB75OA7A6ATDAEEV63PIS5JCB","short_pith_number":"pith:2GB75OA7","schema_version":"1.0","canonical_sha256":"d183feb81f0781318084afb6f44ba9105750a156f2ea9504bd3b7b2c132f5642","source":{"kind":"arxiv","id":"2501.04227","version":2},"attestation_state":"computed","paper":{"title":"Agent Laboratory: Using LLM Agents as Research Assistants","license":"http://creativecommons.org/licenses/by/4.0/","headline":"Agent Laboratory lets LLM agents carry out the full research process from idea to code repository and report.","cross_cats":["cs.AI","cs.CL","cs.LG"],"primary_cat":"cs.HC","authors_text":"Emad Barsoum, Jialian Wu, Jiang Liu, Michael Moor, Samuel Schmidgall, Xiaodong Yu, Ximeng Sun, Yusheng Su, Ze Wang, Zicheng Liu","submitted_at":"2025-01-08T01:58:42Z","abstract_excerpt":"Historically, scientific discovery has been a lengthy and costly process, demanding substantial time and resources from initial conception to final results. To accelerate scientific discovery, reduce research costs, and improve research quality, we introduce Agent Laboratory, an autonomous LLM-based framework capable of completing the entire research process. This framework accepts a human-provided research idea and progresses through three stages--literature review, experimentation, and report writing to produce comprehensive research outputs, including a code repository and a research report"},"verification_status":{"content_addressed":true,"pith_receipt":true,"author_attested":false,"weak_author_claims":0,"strong_author_claims":0,"externally_anchored":false,"storage_verified":false,"citation_signatures":0,"replication_records":0,"graph_snapshot":true,"references_resolved":true,"formal_links_present":true},"canonical_record":{"source":{"id":"2501.04227","kind":"arxiv","version":2},"metadata":{"license":"http://creativecommons.org/licenses/by/4.0/","primary_cat":"cs.HC","submitted_at":"2025-01-08T01:58:42Z","cross_cats_sorted":["cs.AI","cs.CL","cs.LG"],"title_canon_sha256":"2eea69674a838fac30e8504e6339ed62c38737ea128b881f59b4deb534b50845","abstract_canon_sha256":"60134504e062ed509777419fbe85f761cfbfe5d36590d086ae3fb354f16fa935"},"schema_version":"1.0"},"receipt":{"kind":"pith_receipt","key_id":"pith-v1-2026-05","algorithm":"ed25519","signed_at":"2026-05-17T23:38:15.178207Z","signature_b64":"Vl4hFhsidrBvrrOAjdIdf6+Aw5QzlZxZQ6ePxNxoCQqnbjcx+siorcwgiacGE3ZsTw032MLAOiZ3w/tDQBWMBg==","signed_message":"canonical_sha256_bytes","builder_version":"pith-number-builder-2026-05-17-v1","receipt_version":"0.3","canonical_sha256":"d183feb81f0781318084afb6f44ba9105750a156f2ea9504bd3b7b2c132f5642","last_reissued_at":"2026-05-17T23:38:15.177657Z","signature_status":"signed_v1","first_computed_at":"2026-05-17T23:38:15.177657Z","public_key_fingerprint":"8d4b5ee74e4693bcd1df2446408b0d54"},"graph_snapshot":{"paper":{"title":"Agent Laboratory: Using LLM Agents as Research Assistants","license":"http://creativecommons.org/licenses/by/4.0/","headline":"Agent Laboratory lets LLM agents carry out the full research process from idea to code repository and report.","cross_cats":["cs.AI","cs.CL","cs.LG"],"primary_cat":"cs.HC","authors_text":"Emad Barsoum, Jialian Wu, Jiang Liu, Michael Moor, Samuel Schmidgall, Xiaodong Yu, Ximeng Sun, Yusheng Su, Ze Wang, Zicheng Liu","submitted_at":"2025-01-08T01:58:42Z","abstract_excerpt":"Historically, scientific discovery has been a lengthy and costly process, demanding substantial time and resources from initial conception to final results. To accelerate scientific discovery, reduce research costs, and improve research quality, we introduce Agent Laboratory, an autonomous LLM-based framework capable of completing the entire research process. This framework accepts a human-provided research idea and progresses through three stages--literature review, experimentation, and report writing to produce comprehensive research outputs, including a code repository and a research report"},"claims":{"count":4,"items":[{"kind":"strongest_claim","text":"Agent Laboratory driven by o1-preview generates the best research outcomes; the generated machine learning code is able to achieve state-of-the-art performance compared to existing methods; human involvement significantly improves overall quality; and it achieves an 84% decrease in research expenses compared to previous autonomous research methods.","source":"verdict.strongest_claim","status":"machine_extracted","claim_id":"C1","attestation":"unclaimed"},{"kind":"weakest_assumption","text":"That the human evaluators invited to assess the outputs provide unbiased, reproducible judgments and that the SOTA comparisons use current, fairly matched baselines without post-hoc selection of tasks or metrics.","source":"verdict.weakest_assumption","status":"machine_extracted","claim_id":"C2","attestation":"unclaimed"},{"kind":"one_line_summary","text":"Agent Laboratory is an autonomous LLM framework that completes end-to-end research from idea to report and code, with human feedback improving quality and cutting expenses by 84% while reaching competitive ML performance.","source":"verdict.one_line_summary","status":"machine_extracted","claim_id":"C3","attestation":"unclaimed"},{"kind":"headline","text":"Agent Laboratory lets LLM agents carry out the full research process from idea to code repository and report.","source":"verdict.pith_extraction.headline","status":"machine_extracted","claim_id":"C4","attestation":"unclaimed"}],"snapshot_sha256":"f741f1c1c64402d0ccfad8a34c0307e0d3cd5cb7121634fe6c6308abffad8c5c"},"source":{"id":"2501.04227","kind":"arxiv","version":2},"verdict":{"id":"7ae86f5b-3942-4b95-8577-59240bbe2a77","model_set":{"reader":"grok-4.3"},"created_at":"2026-05-17T04:00:55.190729Z","strongest_claim":"Agent Laboratory driven by o1-preview generates the best research outcomes; the generated machine learning code is able to achieve state-of-the-art performance compared to existing methods; human involvement significantly improves overall quality; and it achieves an 84% decrease in research expenses compared to previous autonomous research methods.","one_line_summary":"Agent Laboratory is an autonomous LLM framework that completes end-to-end research from idea to report and code, with human feedback improving quality and cutting expenses by 84% while reaching competitive ML performance.","pipeline_version":"pith-pipeline@v0.9.0","weakest_assumption":"That the human evaluators invited to assess the outputs provide unbiased, reproducible judgments and that the SOTA comparisons use current, fairly matched baselines without post-hoc selection of tasks or metrics.","pith_extraction_headline":"Agent Laboratory lets LLM agents carry out the full research process from idea to code repository and report."},"references":{"count":16,"sample":[{"doi":"","year":2024,"title":"Agentclinic: A multimodal agent benchmark to evaluate ai in simulated clinical environments","work_id":"418a0992-6f06-45f9-958d-cfdfc51c72af","ref_index":1,"cited_arxiv_id":"","is_internal_anchor":false},{"doi":"","year":null,"title":"Results, and 8. Discussion. Just create the scaffolding as compilable latex. Your title should start with Research Report: (title here) where title here is a title you choose. For author write Agent L","work_id":"aba8cca0-24cc-4bf5-9d01-4cb1c9ac1a9e","ref_index":2,"cited_arxiv_id":"","is_internal_anchor":false},{"doi":"","year":null,"title":"This is not the place to critique the paper; the authors should generally agree with a well-written summary","work_id":"973affa8-48ee-42c5-8aed-54ee3b29f3d0","ref_index":3,"cited_arxiv_id":"","is_internal_anchor":false},{"doi":"","year":null,"title":"Think of the things where a response from the author can change your opinion, clarify a confusion or address a limitation","work_id":"ec0635cf-b5d6-4eee-bea2-5b6afc0ba770","ref_index":4,"cited_arxiv_id":"","is_internal_anchor":false},{"doi":"","year":null,"title":"Limitations: Have the authors adequately addressed the limitations and potential negative societal impact of their work? If not, please include constructive suggestions for improvement. In general, au","work_id":"5af2c341-8b04-4d89-9c0a-a6a77d543ef1","ref_index":5,"cited_arxiv_id":"","is_internal_anchor":false}],"resolved_work":16,"snapshot_sha256":"5cb5ef159b48654057dfd418c42fd081cef5102f5accd9c214de6df2230a61cd","internal_anchors":0},"formal_canon":{"evidence_count":3,"snapshot_sha256":"10f192cd2985e917a0a2472b95c5759eaf34540ad65df0cf8e34766a71742e98"},"author_claims":{"count":0,"strong_count":0,"snapshot_sha256":"258153158e38e3291e3d48162225fcdb2d5a3ed65a07baac614ab91432fd4f57"},"builder_version":"pith-number-builder-2026-05-17-v1"},"aliases":[{"alias_kind":"arxiv","alias_value":"2501.04227","created_at":"2026-05-17T23:38:15.177744+00:00"},{"alias_kind":"arxiv_version","alias_value":"2501.04227v2","created_at":"2026-05-17T23:38:15.177744+00:00"},{"alias_kind":"doi","alias_value":"10.48550/arxiv.2501.04227","created_at":"2026-05-17T23:38:15.177744+00:00"},{"alias_kind":"pith_short_12","alias_value":"2GB75OA7A6AT","created_at":"2026-05-18T12:33:37.589309+00:00"},{"alias_kind":"pith_short_16","alias_value":"2GB75OA7A6ATDAEE","created_at":"2026-05-18T12:33:37.589309+00:00"},{"alias_kind":"pith_short_8","alias_value":"2GB75OA7","created_at":"2026-05-18T12:33:37.589309+00:00"}],"events":[],"event_summary":{},"paper_claims":[],"inbound_citations":{"count":24,"internal_anchor_count":24,"sample":[{"citing_arxiv_id":"2506.10060","citing_title":"Textual Bayes: Quantifying Prompt Uncertainty in LLM-Based Systems","ref_index":53,"is_internal_anchor":true},{"citing_arxiv_id":"2506.18841","citing_title":"LongWriter-Zero: Mastering Ultra-Long Text Generation via Reinforcement Learning","ref_index":29,"is_internal_anchor":true},{"citing_arxiv_id":"2506.22653","citing_title":"URSA: The Universal Research and Scientific Agent","ref_index":18,"is_internal_anchor":true},{"citing_arxiv_id":"2507.11810","citing_title":"Evolving Roles of LLMs in Scientific Innovation: Assistant, Collaborator, Scientist, and Evaluator","ref_index":153,"is_internal_anchor":true},{"citing_arxiv_id":"2510.13896","citing_title":"GenCellAgent: Generalizable, Training-Free Cellular Image Segmentation via Large Language Model Agents","ref_index":38,"is_internal_anchor":true},{"citing_arxiv_id":"2512.06879","citing_title":"WisPaper: Your AI Scholar Search Engine","ref_index":17,"is_internal_anchor":true},{"citing_arxiv_id":"2601.14289","citing_title":"RPC-Bench: A Fine-grained Benchmark for Research Paper Comprehension","ref_index":4,"is_internal_anchor":true},{"citing_arxiv_id":"2601.15895","citing_title":"Co-Constructing Alignment: A Participatory Approach to Situate AI Values","ref_index":54,"is_internal_anchor":true},{"citing_arxiv_id":"2602.10154","citing_title":"PRISM-XR: Empowering Privacy-Aware XR Collaboration with Multimodal Large Language Models","ref_index":49,"is_internal_anchor":true},{"citing_arxiv_id":"2604.02360","citing_title":"Fighting AI with AI: AI-Agent Augmented DNS Blocking of LLM Services during Student Evaluations","ref_index":12,"is_internal_anchor":true},{"citing_arxiv_id":"2504.19678","citing_title":"From LLM Reasoning to Autonomous AI Agents: A Comprehensive Review","ref_index":18,"is_internal_anchor":true},{"citing_arxiv_id":"2605.06607","citing_title":"AI CFD Scientist: Toward Open-Ended Computational Fluid Dynamics Discovery with Physics-Aware AI Agents","ref_index":26,"is_internal_anchor":true},{"citing_arxiv_id":"2509.20328","citing_title":"Video models are zero-shot learners and reasoners","ref_index":6,"is_internal_anchor":true},{"citing_arxiv_id":"2604.03460","citing_title":"FermiLink: A Unified Agent Framework for Multidomain Autonomous Scientific Simulations","ref_index":14,"is_internal_anchor":true},{"citing_arxiv_id":"2605.10530","citing_title":"Personalized Deep Research: A User-Centric Framework, Dataset, and Hybrid Evaluation for Knowledge Discovery","ref_index":34,"is_internal_anchor":true},{"citing_arxiv_id":"2605.09915","citing_title":"Position: Academic Conferences are Potentially Facing Denominator Gaming Caused by Fully Automated Scientific Agents","ref_index":22,"is_internal_anchor":true},{"citing_arxiv_id":"2605.06607","citing_title":"AI CFD Scientist: Toward Open-Ended Computational Fluid Dynamics Discovery with Physics-Aware AI Agents","ref_index":28,"is_internal_anchor":true},{"citing_arxiv_id":"2604.23136","citing_title":"How Researchers Navigate Accountability, Transparency, and Trust When Using AI Tools in Early-Stage Research: A Think-Aloud Study","ref_index":62,"is_internal_anchor":true},{"citing_arxiv_id":"2605.01489","citing_title":"SciResearcher: Scaling Deep Research Agents for Frontier Scientific Reasoning","ref_index":31,"is_internal_anchor":true},{"citing_arxiv_id":"2605.04097","citing_title":"CTM-AI: A Blueprint for General AI Inspired by a Model of Consciousness","ref_index":20,"is_internal_anchor":true},{"citing_arxiv_id":"2604.22861","citing_title":"IntrAgent: An LLM Agent for Content-Grounded Information Retrieval through Literature Review","ref_index":7,"is_internal_anchor":true},{"citing_arxiv_id":"2502.18864","citing_title":"Towards an AI co-scientist","ref_index":293,"is_internal_anchor":true},{"citing_arxiv_id":"2605.06607","citing_title":"AI CFD Scientist: Toward Open-Ended Computational Fluid Dynamics Discovery with Physics-Aware AI Agents","ref_index":28,"is_internal_anchor":true},{"citing_arxiv_id":"2604.24198","citing_title":"Rewarding the Scientific Process: Process-Level Reward Modeling for Agentic Data Analysis","ref_index":47,"is_internal_anchor":true}]},"formal_canon":{"evidence_count":3,"sample":[],"anchors":[]},"links":{"html":"https://pith.science/pith/2GB75OA7A6ATDAEEV63PIS5JCB","json":"https://pith.science/pith/2GB75OA7A6ATDAEEV63PIS5JCB.json","graph_json":"https://pith.science/api/pith-number/2GB75OA7A6ATDAEEV63PIS5JCB/graph.json","events_json":"https://pith.science/api/pith-number/2GB75OA7A6ATDAEEV63PIS5JCB/events.json","paper":"https://pith.science/paper/2GB75OA7"},"agent_actions":{"view_html":"https://pith.science/pith/2GB75OA7A6ATDAEEV63PIS5JCB","download_json":"https://pith.science/pith/2GB75OA7A6ATDAEEV63PIS5JCB.json","view_paper":"https://pith.science/paper/2GB75OA7","resolve_alias":"https://pith.science/api/pith-number/resolve?arxiv=2501.04227&json=true","fetch_graph":"https://pith.science/api/pith-number/2GB75OA7A6ATDAEEV63PIS5JCB/graph.json","fetch_events":"https://pith.science/api/pith-number/2GB75OA7A6ATDAEEV63PIS5JCB/events.json","actions":{"anchor_timestamp":"https://pith.science/pith/2GB75OA7A6ATDAEEV63PIS5JCB/action/timestamp_anchor","attest_storage":"https://pith.science/pith/2GB75OA7A6ATDAEEV63PIS5JCB/action/storage_attestation","attest_author":"https://pith.science/pith/2GB75OA7A6ATDAEEV63PIS5JCB/action/author_attestation","sign_citation":"https://pith.science/pith/2GB75OA7A6ATDAEEV63PIS5JCB/action/citation_signature","submit_replication":"https://pith.science/pith/2GB75OA7A6ATDAEEV63PIS5JCB/action/replication_record"}},"created_at":"2026-05-17T23:38:15.177744+00:00","updated_at":"2026-05-17T23:38:15.177744+00:00"}