{"record_type":"pith_number_record","schema_url":"https://pith.science/schemas/pith-number/v1.json","pith_number":"pith:2023:CRLXGX6HLNAR5TGXA74634JOUB","short_pith_number":"pith:CRLXGX6H","schema_version":"1.0","canonical_sha256":"1457735fc75b411eccd707f9edf12ea06da375736e27cd84892fb3461752f2e3","source":{"kind":"arxiv","id":"2304.05376","version":5},"attestation_state":"computed","paper":{"title":"ChemCrow: Augmenting large-language models with chemistry tools","license":"http://arxiv.org/licenses/nonexclusive-distrib/1.0/","headline":"An LLM agent augmented with 18 chemistry tools autonomously plans and executes real syntheses.","cross_cats":["stat.ML"],"primary_cat":"physics.chem-ph","authors_text":"Andres M Bran, Andrew D White, Carlo Baldassari, Oliver Schilter, Philippe Schwaller, Sam Cox","submitted_at":"2023-04-11T17:41:13Z","abstract_excerpt":"Over the last decades, excellent computational chemistry tools have been developed. Integrating them into a single platform with enhanced accessibility could help reaching their full potential by overcoming steep learning curves. Recently, large-language models (LLMs) have shown strong performance in tasks across domains, but struggle with chemistry-related problems. Moreover, these models lack access to external knowledge sources, limiting their usefulness in scientific applications. In this study, we introduce ChemCrow, an LLM chemistry agent designed to accomplish tasks across organic synth"},"verification_status":{"content_addressed":true,"pith_receipt":true,"author_attested":false,"weak_author_claims":0,"strong_author_claims":0,"externally_anchored":false,"storage_verified":false,"citation_signatures":0,"replication_records":0,"graph_snapshot":true,"references_resolved":true,"formal_links_present":true},"canonical_record":{"source":{"id":"2304.05376","kind":"arxiv","version":5},"metadata":{"license":"http://arxiv.org/licenses/nonexclusive-distrib/1.0/","primary_cat":"physics.chem-ph","submitted_at":"2023-04-11T17:41:13Z","cross_cats_sorted":["stat.ML"],"title_canon_sha256":"ae2a785a80cf17a151605a196b32c1d49db79270a7ba3453336ff86074d41507","abstract_canon_sha256":"dae4249d73fd6c7db0bfb592f86e05254a3e52b455d53f46513c9435b6d3eb76"},"schema_version":"1.0"},"receipt":{"kind":"pith_receipt","key_id":"pith-v1-2026-05","algorithm":"ed25519","signed_at":"2026-05-17T23:38:50.481646Z","signature_b64":"RPWCqdLkTpQcbj50LIlHK/GRtvujwKZ4SgkhfOuiNfuzbH/HFe8QFLPdCgSrrVBIuP591Q/n/uSRKXRqrDzgAQ==","signed_message":"canonical_sha256_bytes","builder_version":"pith-number-builder-2026-05-17-v1","receipt_version":"0.3","canonical_sha256":"1457735fc75b411eccd707f9edf12ea06da375736e27cd84892fb3461752f2e3","last_reissued_at":"2026-05-17T23:38:50.480970Z","signature_status":"signed_v1","first_computed_at":"2026-05-17T23:38:50.480970Z","public_key_fingerprint":"8d4b5ee74e4693bcd1df2446408b0d54"},"graph_snapshot":{"paper":{"title":"ChemCrow: Augmenting large-language models with chemistry tools","license":"http://arxiv.org/licenses/nonexclusive-distrib/1.0/","headline":"An LLM agent augmented with 18 chemistry tools autonomously plans and executes real syntheses.","cross_cats":["stat.ML"],"primary_cat":"physics.chem-ph","authors_text":"Andres M Bran, Andrew D White, Carlo Baldassari, Oliver Schilter, Philippe Schwaller, Sam Cox","submitted_at":"2023-04-11T17:41:13Z","abstract_excerpt":"Over the last decades, excellent computational chemistry tools have been developed. Integrating them into a single platform with enhanced accessibility could help reaching their full potential by overcoming steep learning curves. Recently, large-language models (LLMs) have shown strong performance in tasks across domains, but struggle with chemistry-related problems. Moreover, these models lack access to external knowledge sources, limiting their usefulness in scientific applications. In this study, we introduce ChemCrow, an LLM chemistry agent designed to accomplish tasks across organic synth"},"claims":{"count":4,"items":[{"kind":"strongest_claim","text":"Our agent autonomously planned and executed the syntheses of an insect repellent, three organocatalysts, and guided the discovery of a novel chromophore. Our evaluation, including both LLM and expert assessments, demonstrates ChemCrow's effectiveness in automating a diverse set of chemical tasks.","source":"verdict.strongest_claim","status":"machine_extracted","claim_id":"C1","attestation":"unclaimed"},{"kind":"weakest_assumption","text":"That the base large language model can reliably interpret tool outputs, avoid hallucinated chemistry, and produce valid multi-step plans without human correction or post-hoc filtering.","source":"verdict.weakest_assumption","status":"machine_extracted","claim_id":"C2","attestation":"unclaimed"},{"kind":"one_line_summary","text":"ChemCrow augments LLMs with 18 expert chemistry tools to autonomously plan and execute syntheses and guide molecular discoveries in organic synthesis, drug discovery, and materials design.","source":"verdict.one_line_summary","status":"machine_extracted","claim_id":"C3","attestation":"unclaimed"},{"kind":"headline","text":"An LLM agent augmented with 18 chemistry tools autonomously plans and executes real syntheses.","source":"verdict.pith_extraction.headline","status":"machine_extracted","claim_id":"C4","attestation":"unclaimed"}],"snapshot_sha256":"ade9d878fb72db3660108804e39a41b575c250ab665543a842662d956b8e76ae"},"source":{"id":"2304.05376","kind":"arxiv","version":5},"verdict":{"id":"ebd84408-7e96-4040-9ce7-126c9d56326d","model_set":{"reader":"grok-4.3"},"created_at":"2026-05-15T19:01:47.404607Z","strongest_claim":"Our agent autonomously planned and executed the syntheses of an insect repellent, three organocatalysts, and guided the discovery of a novel chromophore. Our evaluation, including both LLM and expert assessments, demonstrates ChemCrow's effectiveness in automating a diverse set of chemical tasks.","one_line_summary":"ChemCrow augments LLMs with 18 expert chemistry tools to autonomously plan and execute syntheses and guide molecular discoveries in organic synthesis, drug discovery, and materials design.","pipeline_version":"pith-pipeline@v0.9.0","weakest_assumption":"That the base large language model can reliably interpret tool outputs, avoid hallucinated chemistry, and produce valid multi-step plans without human correction or post-hoc filtering.","pith_extraction_headline":"An LLM agent augmented with 18 chemistry tools autonomously plans and executes real syntheses."},"references":{"count":118,"sample":[{"doi":"","year":2018,"title":"BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding","work_id":"ed240a10-5b19-406c-baa5-30803f465785","ref_index":1,"cited_arxiv_id":"1810.04805","is_internal_anchor":true},{"doi":"","year":2020,"title":"D.; Dhariwal, P.; Neelakantan, A.; Shyam, P.; Sastry, G.; Askell, A., et al","work_id":"013cd642-ff3f-4b54-8bb1-47e827bbd8ec","ref_index":2,"cited_arxiv_id":"","is_internal_anchor":false},{"doi":"","year":2021,"title":"On the Opportunities and Risks of Foundation Models","work_id":"a18039e9-928d-47c9-a836-32656a71bf71","ref_index":3,"cited_arxiv_id":"2108.07258","is_internal_anchor":true},{"doi":"","year":2022,"title":"PaLM: Scaling Language Modeling with Pathways","work_id":"a94f3ef7-2c49-4445-93fe-6ec16aafd966","ref_index":4,"cited_arxiv_id":"2204.02311","is_internal_anchor":true},{"doi":"","year":2023,"title":"Sparks of Artificial General Intelligence: Early experiments with GPT-4","work_id":"a23cfe92-7f7c-424b-98d4-b386a83002fb","ref_index":5,"cited_arxiv_id":"2303.12712","is_internal_anchor":true}],"resolved_work":118,"snapshot_sha256":"a22d6927e3754deb45688b76416f3fc84fd4eb0ac1af6ee8f61668c1e674bf03","internal_anchors":13},"formal_canon":{"evidence_count":2,"snapshot_sha256":"afccc77099260e7bc07f3ed9197598956037d0c4babe2c66da342e79691215a5"},"author_claims":{"count":0,"strong_count":0,"snapshot_sha256":"258153158e38e3291e3d48162225fcdb2d5a3ed65a07baac614ab91432fd4f57"},"builder_version":"pith-number-builder-2026-05-17-v1"},"aliases":[{"alias_kind":"arxiv","alias_value":"2304.05376","created_at":"2026-05-17T23:38:50.481052+00:00"},{"alias_kind":"arxiv_version","alias_value":"2304.05376v5","created_at":"2026-05-17T23:38:50.481052+00:00"},{"alias_kind":"doi","alias_value":"10.48550/arxiv.2304.05376","created_at":"2026-05-17T23:38:50.481052+00:00"},{"alias_kind":"pith_short_12","alias_value":"CRLXGX6HLNAR","created_at":"2026-05-18T12:33:33.725879+00:00"},{"alias_kind":"pith_short_16","alias_value":"CRLXGX6HLNAR5TGX","created_at":"2026-05-18T12:33:33.725879+00:00"},{"alias_kind":"pith_short_8","alias_value":"CRLXGX6H","created_at":"2026-05-18T12:33:33.725879+00:00"}],"events":[],"event_summary":{},"paper_claims":[],"inbound_citations":{"count":38,"internal_anchor_count":38,"sample":[{"citing_arxiv_id":"2605.23204","citing_title":"AutoResearch AI: Towards AI-Powered Research Automation for Scientific Discovery","ref_index":102,"is_internal_anchor":true},{"citing_arxiv_id":"2503.21460","citing_title":"Large Language Model Agent: A Survey on Methodology, Applications and Challenges","ref_index":269,"is_internal_anchor":true},{"citing_arxiv_id":"2605.22287","citing_title":"SciCore-Mol: Augmenting Large Language Models with Pluggable Molecular Cognition Modules","ref_index":6,"is_internal_anchor":true},{"citing_arxiv_id":"2605.22343","citing_title":"Sibyl-AutoResearch: Autonomous Research Needs Self-Evolving Trial-and-Error Harnesses, Not Paper Generators","ref_index":7,"is_internal_anchor":true},{"citing_arxiv_id":"2507.21035","citing_title":"GenoMAS: A Multi-Agent Framework for Scientific Discovery via Code-Driven Gene Expression Analysis","ref_index":11,"is_internal_anchor":true},{"citing_arxiv_id":"2508.16112","citing_title":"IR-Agent: Expert-Inspired LLM Agents for Structure Elucidation from Infrared Spectra","ref_index":4,"is_internal_anchor":true},{"citing_arxiv_id":"2604.12253","citing_title":"A Scoping Review of Large Language Model-Based Pedagogical Agents","ref_index":10,"is_internal_anchor":true},{"citing_arxiv_id":"2605.16616","citing_title":"MLReplicate: Benchmarking Autonomous Research Systems for Machine Learning Reproducibility","ref_index":4,"is_internal_anchor":true},{"citing_arxiv_id":"2605.18747","citing_title":"Code as Agent Harness","ref_index":61,"is_internal_anchor":true},{"citing_arxiv_id":"2605.09366","citing_title":"Towards a Virtual Neuroscientist: Autonomous Neuroimaging Analysis via Multi-Agent Collaboration","ref_index":14,"is_internal_anchor":true},{"citing_arxiv_id":"2506.06921","citing_title":"Teaching Astronomy with Large Language Models","ref_index":12,"is_internal_anchor":true},{"citing_arxiv_id":"2509.20374","citing_title":"CFDLLMBench: A Benchmark Suite for Evaluating Large Language Models in Computational Fluid Dynamics","ref_index":11,"is_internal_anchor":true},{"citing_arxiv_id":"2601.12538","citing_title":"Agentic Reasoning for Large Language Models","ref_index":33,"is_internal_anchor":true},{"citing_arxiv_id":"2512.02393","citing_title":"Process-Centric Analysis of Agentic Software Systems","ref_index":10,"is_internal_anchor":true},{"citing_arxiv_id":"2512.06879","citing_title":"WisPaper: Your AI Scholar Search Engine","ref_index":5,"is_internal_anchor":true},{"citing_arxiv_id":"2602.16708","citing_title":"Formal Policy Enforcement for Real-World Agentic Systems","ref_index":7,"is_internal_anchor":true},{"citing_arxiv_id":"2305.18323","citing_title":"ReWOO: Decoupling Reasoning from Observations for Efficient Augmented Language Models","ref_index":8,"is_internal_anchor":true},{"citing_arxiv_id":"2605.12784","citing_title":"ToolMol: Evolutionary Agentic Framework for Multi-objective Drug Discovery","ref_index":3,"is_internal_anchor":true},{"citing_arxiv_id":"2308.11432","citing_title":"A Survey on Large Language Model based Autonomous Agents","ref_index":76,"is_internal_anchor":true},{"citing_arxiv_id":"2511.20857","citing_title":"Evo-Memory: Benchmarking LLM Agent Test-time Learning with Self-Evolving Memory","ref_index":294,"is_internal_anchor":true},{"citing_arxiv_id":"2605.06607","citing_title":"AI CFD Scientist: Toward Open-Ended Computational Fluid Dynamics Discovery with Physics-Aware AI Agents","ref_index":3,"is_internal_anchor":true},{"citing_arxiv_id":"2605.12784","citing_title":"ToolMol: Evolutionary Agentic Framework for Multi-objective Drug Discovery","ref_index":3,"is_internal_anchor":true},{"citing_arxiv_id":"2605.13762","citing_title":"EconAI: Dynamic Persona Evolution and Memory-Aware Agents in Evolving Economic Environments","ref_index":17,"is_internal_anchor":true},{"citing_arxiv_id":"2410.09024","citing_title":"AgentHarm: A Benchmark for Measuring Harmfulness of LLM Agents","ref_index":5,"is_internal_anchor":true},{"citing_arxiv_id":"2604.03361","citing_title":"The limits of bio-molecular modeling with large language models : a cross-scale evaluation","ref_index":14,"is_internal_anchor":true}]},"formal_canon":{"evidence_count":2,"sample":[],"anchors":[]},"links":{"html":"https://pith.science/pith/CRLXGX6HLNAR5TGXA74634JOUB","json":"https://pith.science/pith/CRLXGX6HLNAR5TGXA74634JOUB.json","graph_json":"https://pith.science/api/pith-number/CRLXGX6HLNAR5TGXA74634JOUB/graph.json","events_json":"https://pith.science/api/pith-number/CRLXGX6HLNAR5TGXA74634JOUB/events.json","paper":"https://pith.science/paper/CRLXGX6H"},"agent_actions":{"view_html":"https://pith.science/pith/CRLXGX6HLNAR5TGXA74634JOUB","download_json":"https://pith.science/pith/CRLXGX6HLNAR5TGXA74634JOUB.json","view_paper":"https://pith.science/paper/CRLXGX6H","resolve_alias":"https://pith.science/api/pith-number/resolve?arxiv=2304.05376&json=true","fetch_graph":"https://pith.science/api/pith-number/CRLXGX6HLNAR5TGXA74634JOUB/graph.json","fetch_events":"https://pith.science/api/pith-number/CRLXGX6HLNAR5TGXA74634JOUB/events.json","actions":{"anchor_timestamp":"https://pith.science/pith/CRLXGX6HLNAR5TGXA74634JOUB/action/timestamp_anchor","attest_storage":"https://pith.science/pith/CRLXGX6HLNAR5TGXA74634JOUB/action/storage_attestation","attest_author":"https://pith.science/pith/CRLXGX6HLNAR5TGXA74634JOUB/action/author_attestation","sign_citation":"https://pith.science/pith/CRLXGX6HLNAR5TGXA74634JOUB/action/citation_signature","submit_replication":"https://pith.science/pith/CRLXGX6HLNAR5TGXA74634JOUB/action/replication_record"}},"created_at":"2026-05-17T23:38:50.481052+00:00","updated_at":"2026-05-17T23:38:50.481052+00:00"}