Recognition: unknown
Trillion-atom molecular dynamics simulations with ab initio accuracy
Pith reviewed 2026-05-08 02:52 UTC · model grok-4.3
The pith
A molecular dynamics simulation of 1.62 trillion atoms achieves ab initio accuracy.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The authors report an unprecedented molecular dynamics simulation of 1.62 trillion atoms using the neuroevolution potential framework to attain ab initio accuracy. Their implementation delivers a time-to-solution 100 times faster than previous machine learning force field simulations and demonstrates 86.9 percent weak scaling efficiency from a single processor to 45,000 processors. This redefines the limits of atomistic simulations by enabling mesoscopic modeling with quantum-level precision.
What carries the argument
The neuroevolution potential (NEP) framework, which serves as an efficient machine learning surrogate for quantum forces in molecular dynamics.
If this is right
- Mesoscale material properties can be modeled directly from atomic interactions without intermediate approximations.
- Multiscale phenomena in materials become simulatable at full atomistic resolution.
- Computational resources for scientific simulations can handle systems three orders of magnitude larger than before.
- High parallel efficiency supports utilization of large computing facilities for single simulations.
Where Pith is reading between the lines
- Such simulations could be used to study how defects evolve over mesoscale distances in real materials.
- The method might integrate with imaging techniques to provide atomic explanations for observed mesoscale structures.
- Transferability of the trained model to untested material systems remains to be verified in larger contexts.
Load-bearing premise
The neuroevolution potential model trained on smaller ab initio datasets retains full accuracy when applied to a trillion-atom mesoscale system.
What would settle it
A direct comparison of the large-scale simulation results against known experimental mesoscale properties or smaller ab initio benchmarks for the same material would disprove the accuracy claim if significant discrepancies appear.
Figures
read the original abstract
Material properties are fundamentally dictated by multiscale phenomena, which often reach mesoscale in size. The {\mu}m mesoscale is also the size which can be observed directly under an optical microscope, bridging the atomistic microscopic description with the continuous model macroscopic world. In this work, we report an unprecedented molecular dynamics (MD) simulation comprising 1.62 trillion atoms. Utilizing the neuroevolution potential (NEP) framework, we attained ab initio accuracy on China's New-generation Intelligent Supercomputer. Our implementation achieves a time-to-solution (s/step/atom) 100 times faster than previous state-of-the-art machine learning force field simulations, and 1,000 times faster than the Gordon Bell Prize-winning application from six years ago. Furthermore, we demonstrate an 86.9% weak scaling efficiency from a single GPGPU to 45,000 GPGPUs. These results redefine atomistic simulation boundaries, enabling direct mesoscopic modeling with quantum-level precision.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript reports an unprecedented molecular dynamics simulation of 1.62 trillion atoms using the neuroevolution potential (NEP) framework to achieve ab initio accuracy. It demonstrates a 100-fold speedup in time-to-solution over prior machine-learning force-field simulations and 86.9% weak-scaling efficiency from one to 45,000 GPGPUs on China's New-generation Intelligent Supercomputer, enabling direct mesoscale modeling with quantum-level precision.
Significance. If the accuracy claim is substantiated, the work would substantially advance computational materials science by bridging atomistic and mesoscopic scales, allowing quantum-accurate simulations of phenomena observable under optical microscopes. The scaling performance itself is a notable engineering achievement that expands the practical reach of atomistic methods.
major comments (2)
- [§4] §4 (Accuracy and Validation): The claim of 'ab initio accuracy' for the 1.62-trillion-atom system rests on transferability of an NEP model trained on smaller ab initio datasets, yet no quantitative error metrics (force RMSE, energy errors, or mesoscale property comparisons such as phonon dispersion or defect formation energies) are shown for system sizes approaching the trillion-atom regime. Direct ab initio references are impossible at this scale, so the central assertion requires explicit tests demonstrating absence of scale-dependent errors in collective modes or long-range correlations.
- [§5.3] §5.3 (Weak Scaling): The reported 86.9% efficiency is load-bearing for the feasibility claim, but the manuscript does not specify atoms per GPGPU, communication volume, or load-balancing details that would allow independent assessment of whether the parallel implementation introduces numerical artifacts at 45,000 GPGPUs.
minor comments (2)
- [Abstract] The abstract and introduction should include a brief statement of the training dataset size and composition (number of ab initio structures, elements covered) to contextualize the transferability assumption.
- [Figure 3] Figure captions for scaling plots should explicitly state the per-GPGPU atom count and the observable used to measure time-to-solution.
Simulated Author's Rebuttal
We thank the referee for the constructive comments and positive assessment of the work's significance. We address each major comment below with clarifications and proposed revisions.
read point-by-point responses
-
Referee: [§4] §4 (Accuracy and Validation): The claim of 'ab initio accuracy' for the 1.62-trillion-atom system rests on transferability of an NEP model trained on smaller ab initio datasets, yet no quantitative error metrics (force RMSE, energy errors, or mesoscale property comparisons such as phonon dispersion or defect formation energies) are shown for system sizes approaching the trillion-atom regime. Direct ab initio references are impossible at this scale, so the central assertion requires explicit tests demonstrating absence of scale-dependent errors in collective modes or long-range correlations.
Authors: We agree that direct ab initio calculations at the trillion-atom scale are impossible and that the accuracy claim relies on model transferability. The manuscript validates the NEP model using force and energy errors on held-out ab initio datasets from smaller systems, along with mesoscale properties including phonon dispersions and defect formation energies. To address scale-dependent errors, we will add an explicit discussion in the revised §4 of how validated properties (e.g., long-range correlations in phonon modes) remain consistent across system sizes up to the largest computationally accessible regimes, thereby supporting transferability to the trillion-atom simulation. revision: partial
-
Referee: [§5.3] §5.3 (Weak Scaling): The reported 86.9% efficiency is load-bearing for the feasibility claim, but the manuscript does not specify atoms per GPGPU, communication volume, or load-balancing details that would allow independent assessment of whether the parallel implementation introduces numerical artifacts at 45,000 GPGPUs.
Authors: We will revise §5.3 to include the requested implementation details: each GPGPU handles approximately 36 million atoms in the largest run, with communication volumes dominated by halo exchanges of neighbor lists and forces (quantified per step), and load balancing achieved via spatial domain decomposition with periodic rebalancing. These additions will allow assessment of numerical stability, which is further supported by reported energy conservation in the production runs. revision: yes
Circularity Check
No circularity in the reported simulation scaling demonstration
full rationale
The paper reports an engineering and computational achievement: a 1.62-trillion-atom MD run using a pre-trained neuroevolution potential (NEP) model on a large supercomputer, together with measured wall-clock performance and weak-scaling efficiency. No derivation chain is presented in which a claimed first-principles result or prediction is shown by the paper's own equations to be identical to its inputs by construction. The NEP accuracy claim rests on prior training against ab initio data for smaller systems; the trillion-atom run itself is an application, not a re-derivation that forces the accuracy result. No self-definitional steps, fitted-input-as-prediction maneuvers, or load-bearing self-citation chains appear in the abstract or described methodology. The work is therefore self-contained as a scaling demonstration.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
and Bacon, D.J
Hull, D. and Bacon, D.J. eds. 2011. Introduction to dislocations. Butterworth-Heinemann
2011
-
[2]
Jung, S., Gwon, H., Hong, J., Park, K., Seo, D., Kim, H., Hyun, J., Yang, W. and Kang, K. 2014. Understanding the Degradation Mechanisms of LiNi0.5 Co0.2 Mn0.3 O2 Cathode Material in Lithium Ion Batteries. Advanced Energy Materials. 4, 1 (Jan. 2014), 1300787. https://doi.org/10.1002/aenm.201300787
-
[3]
Li, J. and Huang, W. 2018. From Multiscale to Mesoscience: Addressing Mesoscales in Mesoregimes of Different Levels. Annual Review of Chemical and Biomolecular Engineering. 9, 1 (June 2018), 41–60. https://doi.org/10.1146/annurev-chembioeng-060817-084249
-
[4]
Behler, J. 2021. Four Generations of High-Dimensional Neural Network Potentials. Chemical Reviews. 121, 16 (Aug. 2021), 10037– 10072. https://doi.org/10.1021/acs.chemrev.0c00868
-
[5]
and Chmiela, Stefan and Sauceda, Huziel E
Unke, O.T., Chmiela, S., Sauceda, H.E., Gastegger, M., Poltavsky, I., Schü tt, K.T., Tkatchenko, A. and Mü ller, K.-R. 2021. Machine Learning Force Fields. Chemical Reviews. 121, 16 (Aug. 2021), 10142–10186. https://doi.org/10.1021/acs.chemrev.0c01111
-
[6]
Kocer, E., Ko, T.W. and Behler, J. 2022. Neural Network Potentials: A Concise Overview of Methods. Annual Review of Physical Chemistry. 73, 1 (Apr. 2022), 163–186. https://doi.org/10.1146/annurev-physchem-082720-034254
-
[7]
Xia, J., Zhang, Y. and Jiang, B. 2025. The evolution of machine learning potentials for molecules, reactions and materials. Chemical Society Reviews. 54, 10 (2025), 4790–4821. https://doi.org/10.1039/D5CS00104H
-
[8]
Fan, Z., Zeng, Z., Zhang, C., Wang, Y., Song, K., Dong, H., Chen, Y. and Ala-Nissila, T. 2021. Neuroevolution machine learning potentials: Combining high accuracy and low cost in atomistic simulations and application to heat transport. Physical Review B. 104, 10 (Sept. 2021), 104309. https://doi.org/10.1103/PhysRevB.104.104309
-
[9]
Suo, P., Wu, X., Tian, H. and Wang, L. 2026. Towards Scalable and Efficient Machine-Learning Force Fields: The MatPL package and Its Advancements on Neuroevolution Potentials. ChemRxiv. (2026). https://doi.org/https://doi.org/10.26434/chemrxiv.15001665
-
[10]
Thompson, A.P. et al. 2022. LAMMPS - a flexible simulation tool for particle-based materials modeling at the atomic, meso, and continuum scales. Computer Physics Communications. 271, (Feb. 2022), 108171. https://doi.org/10.1016/j.cpc.2021.108171
-
[11]
Behler, J. and Parrinello, M. 2007. Generalized Neural-Network Representation of High-Dimensional Potential-Energy Surfaces. Physical Review Letters. 98, 14 (Apr. 2007), 146401. https://doi.org/10.1103/PhysRevLett.98.146401
-
[12]
Bartó k, A.P., Payne, M.C., Kondor, R. and Csá nyi, G. 2010. Gaussian Approximation Potentials: The Accuracy of Quantum Mechanics, without the Electrons. Physical Review Letters. 104, 13 (Apr. 2010), 136403. https://doi.org/10.1103/PhysRevLett.104.136403
-
[13]
Thompson, A.P., Swiler, L.P., Trott, C.R., Foiles, S.M. and Tucker, G.J. 2015. Spectral neighbor analysis method for automated generation of quantum-accurate interatomic potentials. Journal of Computational Physics. 285, (Mar. 2015), 316–330. https://doi.org/10.1016/j.jcp.2014.12.018
-
[14]
Wang, H., Zhang, L., Han, J. and E, W. 2018. DeePMD-kit: A deep learning package for many-body potential energy representation and molecular dynamics. Computer Physics Communications. 228, (July 2018), 178–184. https://doi.org/10.1016/j.cpc.2018.03.016
-
[15]
Learning local equivariant representations for large-scale atomistic dynamics,
Musaelian, A., Batzner, S., Johansson, A., Sun, L., Owen, C.J., Kornbluth, M. and Kozinsky, B. 2023. Learning local equivariant representations for large-scale atomistic dynamics. Nature Communications. 14, 1 (Feb. 2023), 579. https://doi.org/10.1038/s41467-023-36329-y
-
[16]
and Zhang, L
Jia, W., Wang, H., Chen, M., Lu, D., Lin, L., Car, R., Weinan, E. and Zhang, L. 2020. Pushing the Limit of Molecular Dynamics with Ab Initio Accuracy to 100 Million Atoms with Machine Learning. SC20: International Conference for High Performance Computing, Networking, Storage and Analysis (Atlanta, GA, USA, Nov. 2020), 1– 14
2020
-
[17]
and Batzner, S
Kozinsky, B., Musaelian, A., Johansson, A. and Batzner, S. 2023. Scaling the Leading Accuracy of Deep Equivariant Models to Biomolecular Simulations of Realistic Size. Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (Denver CO USA, Nov. 2023), 1– 12
2023
-
[18]
and Oleynik, I.I
Nguyen-Cong, K., Willman, J.T., Moore, S.G., Belonoshko, A.B., Gayatri, R., Weinberg, E., Wood, M.A., Thompson, A.P. and Oleynik, I.I. 2021. Billion atom molecular dynamics simulations of carbon at extreme conditions and experimental time and length scales. Proceedings of the International Conference for High Performance Computing, Networking, Storage and...
2021
-
[19]
Liang, T. et al. 2025. NEP89: Universal neuroevolution potential for inorganic and organic materials across 89 elements. (Apr. 2025). https://doi.org/10.48550/arXiv.2504.21286
-
[20]
Dong, H., Shi, Y., Ying, P., Xu, K., Liang, T., Wang, Y., Zeng, Z., Wu, X., Zhou, W., Xiong, S., Chen, S. and Fan, Z. 2024. Molecular dynamics simulations of heat transport using machine-learned potentials: A mini-review and tutorial on GPUMD with neuroevolution potentials. Journal of Applied Physics. 135, 16 (Apr. 2024), 161101. https://doi.org/10.1063/5.0200833
-
[21]
Ying, P., Qian, C., Zhao, R., Wang, Y., Xu, K., Ding, F., Chen, S. and Fan, Z. 2025. Advances in modeling complex materials: The rise of neuroevolution potentials. Chemical Physics Reviews. 6, 1 (Mar. 2025), 011310. https://doi.org/10.1063/5.0259061
-
[22]
and Schmidhuber, J
Schaul, T., Glasmachers, T. and Schmidhuber, J. 2011. High dimensions and heavy tails for natural evolution strategies. Proceedings of the 13th annual conference on Genetic and evolutionary computation (Dublin Ireland, July 2011), 845–852
2011
-
[23]
Xu, K. et al. 2025. GPUMD 4.0: A high‐performance molecular dynamics package for versatile materials simulations with machine‐ learned potentials. Materials Genome Engineering Advances. (Aug. 2025), e70028. https://doi.org/10.1002/mgea.70028
-
[24]
Song, K. et al. 2024. General-purpose machine-learned potential for 16 elemental metals and their alloys. Nature Communications. 15, 1 (Nov. 2024), 10208. https://doi.org/10.1038/s41467-024-54554-x
-
[25]
and Jia, W
Guo, Z., Lu, D., Yan, Y., Hu, S., Liu, R., Tan, G., Sun, N., Jiang, W., Liu, L., Chen, Y., Zhang, L., Chen, M., Wang, H. and Jia, W. 2022. Extending the limit of molecular dynamics with ab initio accuracy to 10 billion atoms. Proceedings of the 27th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (Seoul Republic of Korea, Apr. 202...
2022
-
[26]
Wang, X., Meng, X., Guo, Z., Li, M., Liu, L., Li, M., Xiao, Q., Zhao, T., Sun, N., Tan, G. and Jia, W. 2025. 29-Billion Atoms Molecular Dynamics Simulation With Ab Initio Accuracy on 35 Million Cores of New Sunway Supercomputer. IEEE Transactions on Computers. 74, 5 (May 2025), 1634–1648. https://doi.org/10.1109/TC.2025.3540646
-
[27]
Liu, J., Byggmä star, J., Fan, Z., Qian, P. and Su, Y. 2023. Large-scale machine-learning molecular dynamics simulation of primary radiation damage in tungsten. Physical Review B. 108, 5 (Aug. 2023), 054312. https://doi.org/10.1103/PhysRevB.108.054312
-
[28]
The interface is still the device
2012. The interface is still the device. Nature Materials. 11, 2 (Feb. 2012), 91–91. https://doi.org/10.1038/nmat3244
-
[29]
Kim, S.Y., Sun, Z., Roy, J., Wang, X., Chen, Z., Appenzeller, J. and Wallace, R.M. 2024. Fundamental Understanding of Interface Chemistry and Electrical Contact Properties of Bi and MoS2. ACS Applied Materials & Interfaces. 16, 40 (Oct. 2024), 54790–54798. https://doi.org/10.1021/acsami.4c10082
-
[30]
Oosterbaan, W.D., Bolsé e, J., Wang, L., Vrindts, V., Lutsen, L.J., Lemaur, V., Beljonne, D., McNeill, C.R., Thomsen, L., Manca, J.V. and Vanderzande, D.J.M. 2014. On the Relation between Morphology and FET Mobility of Poly(3‐alkylthiophene)s at the Polymer/SiO2 and Polymer/Air Interface. Advanced Functional Materials. 24, 14 (Apr. 2014), 1994–2004. https...
-
[31]
Xie, J. et al. 2024. Low Resistance Contact to P-Type Monolayer WSe2. Nano Letters. 24, 20 (May 2024), 5937–5943. https://doi.org/10.1021/acs.nanolett.3c04195
-
[32]
Kim, D.H., Kwak, S.J., Jeong, J.H., Yoo, S., Nam, S.K., Kim, Y. and Lee, W.B. 2021. Molecular Dynamics Simulation of Silicon Dioxide Etching by Hydrogen Fluoride Using the Reactive Force Field. ACS Omega. 6, 24 (June 2021), 16009–16015. https://doi.org/10.1021/acsomega.1c01824
-
[33]
Perebeinos, V. and Tersoff, J. 2014. Carbon Nanotube Deformation and Collapse under Metal Contacts. Nano Letters. 14, 8 (Aug. 2014), 4376–4380. https://doi.org/10.1021/nl5012646
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.