AutoMat benchmark shows current LLM coding agents achieve at most 54.1% success when reproducing computational materials science claims from papers.
Computer physics communications , volume=
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
years
2026 2representative citing papers
Molecular dynamics of a ramp-shoulder fluid shows water-like anomalies arising from cooperative radial restructuring and amorphous radial frustration rather than shell competition alone.
citing papers explorer
-
Can Coding Agents Reproduce Findings in Computational Materials Science?
AutoMat benchmark shows current LLM coding agents achieve at most 54.1% success when reproducing computational materials science claims from papers.
-
Amorphous Radial Frustration and Water-Like Anomalies in a Ramp-Shoulder Fluid
Molecular dynamics of a ramp-shoulder fluid shows water-like anomalies arising from cooperative radial restructuring and amorphous radial frustration rather than shell competition alone.