From Experiments to Expertise: Scientific Knowledge Consolidation for AI-Driven Computational Physics
read the original abstract
While large language models (LLMs) have transformed AI agents into proficient executors of computational materials science, performing a hundred simulations does not make a researcher. What distinguishes research from routine execution is the progressive accumulation of knowledge - learning which approaches fail, recognizing patterns across systems, and applying understanding to new problems. However, the prevailing paradigm in AI-driven computational science treats each execution in isolation, largely discarding hard-won insights between runs. Here we present QMatSuite, an open-source platform closing this gap. Agents record findings with full provenance, retrieve knowledge before new calculations, and in dedicated reflection sessions correct erroneous findings and synthesize observations into cross-compound patterns. In benchmarks on a six-step quantum-mechanical simulation workflow, accumulated knowledge reduces reasoning overhead by 67% and improves accuracy from 47% to 3% deviation from literature - and when transferred to an unfamiliar material, achieves 1% deviation with zero pipeline failures.
This paper has not been read by Pith yet.
Forward citations
Cited by 1 Pith paper
-
Grounded autonomous research: a fault-tolerant LLM pipeline from corpus to manuscript in frontier computational physics
An LLM pipeline with fresh-context sessions and literature calibration produces a publication-grade manuscript with three substantive findings on altermagnetic piezomagnetism from a corpus of 11,083 papers.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.