pith. sign in

arxiv: 2603.13191 · v2 · pith:7U4TFAJInew · submitted 2026-03-13 · ⚛️ physics.comp-ph · cond-mat.mtrl-sci· cs.AI

From Experiments to Expertise: Scientific Knowledge Consolidation for AI-Driven Computational Physics

classification ⚛️ physics.comp-ph cond-mat.mtrl-scics.AI
keywords knowledgecomputationalagentsai-drivendeviationexecutionfindingspatterns
0
0 comments X
read the original abstract

While large language models (LLMs) have transformed AI agents into proficient executors of computational materials science, performing a hundred simulations does not make a researcher. What distinguishes research from routine execution is the progressive accumulation of knowledge - learning which approaches fail, recognizing patterns across systems, and applying understanding to new problems. However, the prevailing paradigm in AI-driven computational science treats each execution in isolation, largely discarding hard-won insights between runs. Here we present QMatSuite, an open-source platform closing this gap. Agents record findings with full provenance, retrieve knowledge before new calculations, and in dedicated reflection sessions correct erroneous findings and synthesize observations into cross-compound patterns. In benchmarks on a six-step quantum-mechanical simulation workflow, accumulated knowledge reduces reasoning overhead by 67% and improves accuracy from 47% to 3% deviation from literature - and when transferred to an unfamiliar material, achieves 1% deviation with zero pipeline failures.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Grounded autonomous research: a fault-tolerant LLM pipeline from corpus to manuscript in frontier computational physics

    cs.AI 2026-07 unverdicted novelty 6.0

    An LLM pipeline with fresh-context sessions and literature calibration produces a publication-grade manuscript with three substantive findings on altermagnetic piezomagnetism from a corpus of 11,083 papers.