pith. sign in

arxiv: 2509.11703 · v2 · submitted 2025-09-15 · ⚛️ physics.comp-ph · cond-mat.dis-nn· cond-mat.mtrl-sci

AiiDA-TrainsPot: Towards automated training of neural-network interatomic potentials

Pith reviewed 2026-05-18 17:04 UTC · model grok-4.3

classification ⚛️ physics.comp-ph cond-mat.dis-nncond-mat.mtrl-sci
keywords neural-network interatomic potentialsactive learningautomated workflowdensity functional theorycommittee disagreementcarbon allotropesalloy phase transitionsuncertainty quantification
0
0 comments X p. Extension

The pith

An automated workflow calibrates committee disagreement on the fly to select ab initio calculations reliably when training neural-network interatomic potentials.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces AiiDA-TrainsPot, a modular open-source workflow that orchestrates density-functional-theory calculations, data augmentation, and molecular dynamics to create neural-network interatomic potentials with less manual intervention. Its central active-learning strategy calibrates committee disagreement against reference ab initio errors during the process itself, producing uncertainty estimates that reduce both false positives and false negatives in deciding which new structures need expensive first-principles runs. Electronic-structure descriptors and dimensionality reduction are then used to check how well this calibrated criterion covers relevant configuration space in the examples shown. The workflow supports training from scratch or fine-tuning existing models and is demonstrated on carbon allotropes including amorphous phases and on phase transitions in WxMo1-xTe2 monolayers. A reader would care because the approach lowers the expertise barrier for producing accurate potentials that can be used in larger-scale simulations.

Core claim

AiiDA-TrainsPot automates the creation of neural-network interatomic potentials by linking DFT computations with active learning. The key advance is on-the-fly calibration of committee disagreement against ab initio reference errors, which yields reliable uncertainty estimates. This calibrated measure, validated through electronic-structure descriptors and dimensionality reduction, minimizes both false positives and false negatives when choosing structures for further first-principles evaluation across the tested carbon and alloy systems.

What carries the argument

On-the-fly calibration of committee disagreement against ab initio reference errors, which supplies reliable uncertainty estimates that guide active-learning decisions on which structures to compute from first principles.

If this is right

  • The same calibrated criterion supports both training potentials from scratch and fine-tuning foundation models.
  • The workflow successfully handles pristine, defective, and amorphous carbon structures as well as alloy phase transitions.
  • Modular design allows swapping in different neural-network interatomic potential backends without rewriting the active-learning loop.
  • On-the-fly calibration reduces the number of expensive ab initio calculations needed while maintaining coverage of relevant configurations.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The calibration approach could be tested on systems outside the demonstrated carbon and dichalcogenide cases to check transferability of the uncertainty estimates.
  • Lower false-negative rates might allow safer use of the resulting potentials in long molecular-dynamics runs where rare events matter.
  • Because the workflow is built on an existing automation platform, it could be combined with high-throughput material-screening campaigns that already generate large structure databases.

Load-bearing premise

Electronic-structure descriptors combined with dimensionality reduction can reliably validate the calibrated committee disagreement criterion without introducing selection bias in the active-learning loop.

What would settle it

Apply the workflow to a new material system, collect the structures it selects versus those it skips, and check whether many skipped structures later show large actual errors when computed ab initio or whether many selected structures prove unnecessary.

Figures

Figures reproduced from arXiv: 2509.11703 by Antimo Marrazzo, Davide Bidoggia, Maria Peressi, Nataliia Manko.

Figure 1
Figure 1. Figure 1: FIG. 1 [PITH_FULL_IMAGE:figures/full_fig_p003_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: FIG. 2 [PITH_FULL_IMAGE:figures/full_fig_p006_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: FIG. 3 [PITH_FULL_IMAGE:figures/full_fig_p007_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: FIG. 4 [PITH_FULL_IMAGE:figures/full_fig_p009_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: FIG. 5 [PITH_FULL_IMAGE:figures/full_fig_p010_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: FIG. 6. Energy as a function of interlayer separation in [PITH_FULL_IMAGE:figures/full_fig_p011_6.png] view at source ↗
Figure 7
Figure 7. Figure 7: FIG. 7. Equations of state. Left panel: Binding energy per atom as a function of nearest-neighbor bond distance for various [PITH_FULL_IMAGE:figures/full_fig_p012_7.png] view at source ↗
Figure 8
Figure 8. Figure 8: FIG. 8. Phonon dispersion and density of states for graphene, [PITH_FULL_IMAGE:figures/full_fig_p013_8.png] view at source ↗
read the original abstract

Crafting neural-network interatomic potentials (NNIPs) remains a complex task, demanding specialized expertise in both machine learning and electronic-structure calculations. Here, we introduce AiiDA-TrainsPot, an automated, open-source, and user-friendly workflow that streamlines the creation of accurate NNIPs by orchestrating density-functional-theory calculations, data augmentation strategies, and classical molecular dynamics. Our active-learning strategy leverages on-the-fly calibration of committee disagreement against ab initio reference errors to ensure reliable uncertainty estimates. We use electronic-structure descriptors and dimensionality reduction to analyze the efficiency of this calibrated criterion, and show that it minimizes both false positives and false negatives when deciding what to compute from first principles. AiiDA-TrainsPot has a modular design that supports multiple NNIP backends, enabling both the training of NNIPs from scratch and the fine-tuning of foundation models. We demonstrate its capabilities through automated training campaigns targeting pristine and defective carbon allotropes, including amorphous carbon, as well as structural phase transitions in monolayer $\mathrm{W_xMo_{1-x}Te_2}$ alloys.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

1 major / 2 minor

Summary. The manuscript introduces AiiDA-TrainsPot, an automated open-source workflow built on AiiDA for training neural-network interatomic potentials (NNIPs). It integrates DFT calculations, data augmentation, and molecular dynamics, with a core active-learning loop that performs on-the-fly calibration of committee disagreement against ab initio reference errors to produce uncertainty estimates. Electronic-structure descriptors combined with dimensionality reduction are used to analyze the efficiency of this calibrated criterion on carbon allotropes (including amorphous carbon) and WxMo1-xTe2 alloy systems, with the claim that the approach minimizes both false positives and false negatives when selecting structures for first-principles evaluation. The workflow is modular and supports training from scratch as well as fine-tuning of foundation models.

Significance. If the on-the-fly calibration produces genuinely independent and reliable uncertainty estimates that generalize beyond the training data, the work would meaningfully advance automated, reproducible NNIP development for complex materials systems. The demonstrations on defective carbon and alloy phase transitions, together with support for multiple backends, would provide practical value for reducing expert intervention in potential training campaigns.

major comments (1)
  1. [Results section on descriptor-based analysis of the calibrated criterion] The section describing the analysis of the calibrated committee disagreement (via electronic-structure descriptors and dimensionality reduction) must explicitly state whether this validation is performed on a fully independent hold-out set of structures or on configurations already visited or selected by the active-learning procedure itself. If the reduced descriptor space is constructed from the same committee evaluations used for calibration, the reported minimization of false positives and false negatives risks selection bias and does not independently confirm the reliability of the uncertainty estimates.
minor comments (2)
  1. [Abstract] The abstract states that the calibrated criterion 'minimizes both false positives and false negatives' but provides no quantitative thresholds, error metrics, or cross-validation details; a brief clarification of the decision rule and evaluation protocol would improve clarity.
  2. [Methods] Ensure that the precise definition of the committee (number of models, training protocol, and disagreement metric) is given in a dedicated methods subsection or table so that the on-the-fly calibration procedure can be reproduced without ambiguity.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for their careful reading of the manuscript and for the constructive comment on the descriptor-based analysis. We have revised the relevant section to explicitly address the nature of the dataset used and the implications for interpreting the results.

read point-by-point responses
  1. Referee: [Results section on descriptor-based analysis of the calibrated criterion] The section describing the analysis of the calibrated committee disagreement (via electronic-structure descriptors and dimensionality reduction) must explicitly state whether this validation is performed on a fully independent hold-out set of structures or on configurations already visited or selected by the active-learning procedure itself. If the reduced descriptor space is constructed from the same committee evaluations used for calibration, the reported minimization of false positives and false negatives risks selection bias and does not independently confirm the reliability of the uncertainty estimates.

    Authors: We thank the referee for this observation. The analysis presented in the Results section was performed on configurations visited or selected during the active-learning procedure itself, as the on-the-fly calibration of committee disagreement is an integral part of the workflow and occurs on the structures encountered in each training campaign. The reduced descriptor space is therefore constructed from the same set of committee evaluations used for calibration. We agree that this constitutes an in-sample analysis and carries a risk of selection bias; it does not provide an independent confirmation of the uncertainty estimates on fully unseen data. The intent of the section is to demonstrate the practical efficiency of the calibrated criterion in minimizing false positives and false negatives within the automated campaigns on carbon allotropes and the alloy systems, rather than to claim external validation. In the revised manuscript we have added explicit statements clarifying the in-sample nature of the analysis and discussing its limitations for assessing generalization. We believe this improves transparency without altering the reported findings. revision: yes

Circularity Check

0 steps flagged

No significant circularity in the presented workflow or validation chain.

full rationale

The abstract describes an on-the-fly calibration of committee disagreement against ab initio reference errors as part of the active-learning strategy, followed by separate analysis using electronic-structure descriptors and dimensionality reduction to assess efficiency on carbon allotropes and alloy systems. No equations or explicit statements in the provided text demonstrate that calibration parameters or decision thresholds are fitted directly to the same data used for the final NNIP training or that the descriptor-based validation reduces to the active-learning selections by construction. The workflow is presented as modular with demonstrations on specific targets, and the analysis functions as an external check rather than a self-referential loop. No load-bearing self-citations, imported uniqueness theorems, or ansatz smuggling are evident. The derivation chain remains self-contained against the described benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 0 invented entities

The central claim rests on standard DFT accuracy for reference data, the validity of committee disagreement as a proxy for error, and the effectiveness of electronic-structure descriptors for analysis; no new free parameters or invented entities are introduced in the abstract.

axioms (2)
  • domain assumption DFT calculations provide reliable reference errors for calibrating committee disagreement.
    Invoked when using ab initio data to calibrate the active learning criterion.
  • domain assumption Committee disagreement can be mapped to actual prediction errors via on-the-fly calibration.
    Central to the active-learning strategy described in the abstract.

pith-pipeline@v0.9.0 · 5739 in / 1427 out tokens · 45674 ms · 2026-05-18T17:04:35.760060+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Spin Dynamics from Atomistic Quantum Simulations

    cond-mat.mtrl-sci 2026-05 unverdicted novelty 6.0

    Kubo theory yields T1 and T2 from spin-lattice correlation functions; these are evaluated with ML-driven ab initio molecular dynamics and match experimental T1 values for the NV center in diamond.

Reference graph

Works this paper leans on

73 extracted references · 73 canonical work pages · cited by 1 Pith paper

  1. [1]

    open), cell parameters, atomic species and atomic positions

    Input structures AiiDA-TrainsPot can start from a small set of ini- tial atomistic structures{X } (0), determined by boundary conditions (periodic vs. open), cell parameters, atomic species and atomic positions. The number and diversity of input structures should reflect the target applications: for example, the study of temperature-dependent prop- erties...

  2. [2]

    Dataset augmentation In the dataset augmentation stage, additional struc- tures are generated by manipulating the initial set {X }(0). All manipulations can be controlled through cus- tomizable parameters to tailor the augmentation process according to specific user needs; we group them in the following categories: •Supercells: Initial structures are repl...

  3. [3]

    Ab Initio Labelling After the data augmentation stage, AiiDA-TrainsPot starts the active learning loop, which is represented by the orange circle in Fig. 1. Each structureX i in the augmented dataset is labeled through DFT calcula- tions to obtain high-fidelity reference values for energies, forces, and stress tensors. We use the compact notation LDF T (X...

  4. [4]

    Training neural-network interatomic potentials The labeled dataset Xi,L DF T (Xi) (1) is used to train a committee ofMNNIPs{Φ j}M j=1, each with identical architecture but initialized with different random seeds. Prior to training, all structures are systematically par- titioned into three subsets, ensuring representative sam- pling across different struc...

  5. [5]

    Exploration by molecular dynamics After a committee of NNIPs is trained, the workflow employs MD simulations to systematically explore the potential energy landscape. This exploration phase is critical for identifying configurations where the NNIPs might have insufficient accuracy, thus guiding the selec- tion of additional structures forab initiocalculat...

  6. [6]

    fast explo- ration

    Committee Evaluation This stage aims at identifying structures that are poorly predicted by the NNIPs; those are good candi- dates to be labeled withab initiocalculations and in- cluded in the training dataset. However, while Bayesian neural networks (NNs) come with a well-defined proba- bilistic uncertainty quantification, no such Bayesian error estimati...

  7. [7]

    fast exploration

    to 9,537 structures in the final iteration. For energy and stress tensor components, we observe a consistent decrease in prediction errors as the active learning pro- gresses. Interestingly, errors on forces increase from the first to the second iteration, and then decrease monoton- ically for all the following iterations, suggesting that the active learn...

  8. [8]

    fast”/“stringent

    or DFT (row 2). The third and fourth rows correspond to structures relaxed with DFT, with subsequent energy evalu- ation using the NNIP (row 3) or DFT (row 4). To assess the transferability of our NNIP to defective structures, we compute the formation energies of three representative point defects in graphene: the monova- cancy, divacancy, and Stone-Wales...

  9. [9]

    MD simulations explored tempera- tures ranging from 0 to 5000 K and pressures from−5 to 5 kbar

    for k-point grid (λ= 0.30 ˚A−1) and smearing (σcold = 0.0275Ry). MD simulations explored tempera- tures ranging from 0 to 5000 K and pressures from−5 to 5 kbar. Forrun B, dataset augmentation parameters were op- timized for near-equilibrium conditions:rattle fraction was reduced to 0.3, while strain ranges were increased tomax compressive strainof 0.3 and...

  10. [10]

    MaX - Materials De- sign at the Exascale

    in LAMMPS. Active learning thresholds on energy, forces, and stress tensor were set to 2 meV, 50 meV/ ˚A, 14 10 meV/˚A3, respectively, with a maximum of 1000 struc- tures selected per iteration. IV. DATA AVAILABILITY The training datasets and trained models are available on the Materials Cloud Archive [63]. V. CODE AVAILABILITY AiiDA-TrainsPot is availabl...

  11. [11]

    Car and M

    R. Car and M. Parrinello, Unified Approach for Molecu- lar Dynamics and Density-Functional Theory, Phys. Rev. Lett.55, 2471 (1985)

  12. [12]

    V. L. Deringer, M. A. Caro, and G. Cs´ anyi, Ma- chine Learning Interatomic Potentials as Emerging Tools for Materials Science, Advanced Materials31, 1902765 (2019)

  13. [13]

    Batzner, A

    S. Batzner, A. Musaelian, L. Sun, M. Geiger, J. P. Mailoa, M. Kornbluth, N. Molinari, T. E. Smidt, and B. Kozinsky, E(3)-equivariant graph neural networks for data-efficient and accurate interatomic potentials, Nature Communications13, 2453 (2022)

  14. [14]

    Musaelian, S

    A. Musaelian, S. Batzner, A. Johansson, L. Sun, C. J. Owen, M. Kornbluth, and B. Kozinsky, Learning local equivariant representations for large-scale atomistic dy- namics, Nature Communications14, 579 (2023)

  15. [15]

    Batatia, D

    I. Batatia, D. P. Kovacs, G. N. C. Simm, C. Ortner, and G. Csanyi, MACE: Higher Order Equivariant Message Passing Neural Networks for Fast and Accurate Molec- ular Force Fields, Advances in Neural Information Pro- cessing Systems35, 11423 (2022)

  16. [16]

    D. P. Kov´ acs, J. H. Moore, N. J. Browning, I. Batatia, J. T. Horton, Y. Pu, V. Kapil, W. C. Witt, I.-B. Magd˘ au, D. J. Cole, and G. Cs´ anyi, MACE-OFF: Transferable Short Range Machine Learning Force Fields for Organic Molecules (2025), arXiv:2312.15211 [physics.chem-ph]

  17. [17]

    Mazitov, F

    A. Mazitov, F. Bigi, M. Kellner, P. Pegolo, D. Tisi, G. Fraux, S. Pozdnyakov, P. Loche, and M. Ceri- otti, PET-MAD, a lightweight universal interatomic potential for advanced materials modeling (2025), arXiv:2503.14118 [cond-mat.mtrl-sci]

  18. [18]

    Zhang, H

    Y. Zhang, H. Wang, W. Chen, J. Zeng, L. Zhang, H. Wang, and W. E, DP-GEN: A concurrent learning platform for the generation of reliable deep learning based potential energy models, Computer Physics Communica- tions253, 107206 (2020)

  19. [19]

    K. T. Sch¨ utt, P. Kessel, M. Gastegger, K. A. Nicoli, A. Tkatchenko, and K.-R. M¨ uller, SchNetPack: A Deep Learning Toolbox For Atomistic Systems, Journal of Chemical Theory and Computation15, 448–455 (2018)

  20. [20]

    Talirz, S

    L. Talirz, S. Kumbhar, E. Passaro, A. V. Yakutovich, V. Granata, F. Gargiulo, M. Borelli, M. Uhrin, S. P. Huber, S. Zoupanos, C. S. Adorf, C. W. Andersen, O. Sch¨ utt, C. A. Pignedoli, D. Passerone, J. VandeVon- dele, T. C. Schulthess, B. Smit, G. Pizzi, and N. Marzari, Materials Cloud, a platform for open computational sci- ence, Scientific Data7, 299 (2020)

  21. [21]

    A. Jain, S. P. Ong, G. Hautier, W. Chen, W. D. Richards, S. Dacek, S. Cholia, D. Gunter, D. Skinner, G. Ceder, and K. A. Persson, The Materials Project: A materials genome approach to accelerating materials innovation, APL Materials1, 011002 (2013)

  22. [22]

    Bergerhoff, R

    G. Bergerhoff, R. Hundt, R. Sievers, and I. D. Brown, The inorganic crystal structure data base, Journal of Chemical Information and Computer Sciences23, 66 (1983)

  23. [23]

    Graˇ zulis, D

    S. Graˇ zulis, D. Chateigner, R. T. Downs, A. F. T. Yokochi, M. Quir´ os, L. Lutterotti, E. Manakova, J. Butkus, P. Moeck, and A. Le Bail, Crystallogra- phy Open Database—an open-access collection of crystal structures, Journal of Applied Crystallography42, 726 (2009). 15

  24. [24]

    Graˇ zulis, A

    S. Graˇ zulis, A. Daˇ skeviˇ c, A. Merkys, D. Chateigner, L. Lutterotti, M. Quir´ os, N. R. Serebryanaya, P. Moeck, R. T. Downs, and A. Le Bail, Crystallography Open Database (COD): an open-access collection of crystal structures and platform for world-wide collaboration, Nucleic Acids Research40, D420 (2011)

  25. [25]

    Villars, N

    P. Villars, N. Onodera, and S. Iwata, The linus pauling file (lpf) and its application to materials design, Journal of Alloys and Compounds279, 1 (1998)

  26. [26]

    Villars, M

    P. Villars, M. Berndt, K. Brandenburg, K. Cenzual, J. Daams, F. Hulliger, T. Massalski, H. Okamoto, K. Os- aki, A. Prince, H. Putz, and S. Iwata, The pauling file, binaries edition, Journal of Alloys and Compounds367, 293 (2004)

  27. [27]

    Blokhin and P

    E. Blokhin and P. Villars, The PAULING FILE Project and Materials Platform for Data Science: From Big Data Toward Materials Genome, inHandbook of Materials Modeling : Methods: Theory and Modeling, edited by W. Andreoni and S. Yip (Springer International Pub- lishing, Cham, 2018) pp. 1–26

  28. [28]

    Prandini, A

    G. Prandini, A. Marrazzo, I. E. Castelli, N. Mounet, and N. Marzari, Precision and efficiency in solid-state pseu- dopotential calculations, npj Computational Materials4, 72 (2018)

  29. [29]

    P. E. Bl¨ ochl, Projector augmented-wave method, Phys. Rev. B50, 17953 (1994)

  30. [30]

    Dal Corso, Pseudopotentials periodic table: From H to Pu, Computational Materials Science95, 337 (2014)

    A. Dal Corso, Pseudopotentials periodic table: From H to Pu, Computational Materials Science95, 337 (2014)

  31. [31]

    G. d. M. Nascimento, F. J. d. Santos, M. Bercx, D. Gras- sano, G. Pizzi, and N. Marzari, Accurate and efficient protocols for high-throughput first-principles materi- als simulations (2025), arXiv:2504.03962 [cond-mat.mtrl- sci]

  32. [32]

    Qamar, M

    M. Qamar, M. Mrovec, Y. Lysogorskiy, A. Bochkarev, and R. Drautz, Atomic Cluster Expansion for Quantum- Accurate Large-Scale Simulations of Carbon, Journal of Chemical Theory and Computation19, 5151 (2023)

  33. [33]

    Grimme, Semiempirical GGA-type density functional constructed with a long-range dispersion correction, Journal of Computational Chemistry27, 1787 (2006)

    S. Grimme, Semiempirical GGA-type density functional constructed with a long-range dispersion correction, Journal of Computational Chemistry27, 1787 (2006)

  34. [34]

    Abdar, F

    M. Abdar, F. Pourpanah, S. Hussain, D. Rezazadegan, L. Liu, M. Ghavamzadeh, P. Fieguth, X. Cao, A. Khos- ravi, U. R. Acharya, V. Makarenkov, and S. Nahavandi, A review of uncertainty quantification in deep learning: Techniques, applications and challenges, Information Fu- sion76, 243 (2021)

  35. [35]

    Behler, Representing potential energy surfaces by high-dimensional neural network potentials, Journal of Physics: Condensed Matter26, 183001 (2014)

    J. Behler, Representing potential energy surfaces by high-dimensional neural network potentials, Journal of Physics: Condensed Matter26, 183001 (2014)

  36. [36]

    L. Chen, I. Sukuba, M. Probst, and A. Kaiser, Itera- tive training set refinement enables reactive molecular dynamicsviamachine learned forces, RSC Advances10, 4293 (2020)

  37. [37]

    Kahle and F

    L. Kahle and F. Zipoli, Quality of uncertainty estimates from neural network potential ensembles, Physical Re- view E105, 015311 (2022)

  38. [38]

    S. P. Huber, S. Zoupanos, M. Uhrin, L. Talirz, L. Kahle, R. H¨ auselmann, D. Gresch, T. M¨ uller, A. V. Yakutovich, C. W. Andersen, F. F. Ramirez, C. S. Adorf, F. Gargiulo, S. Kumbhar, E. Passaro, C. Johnston, A. Merkys, A. Ce- pellotti, N. Mounet, N. Marzari, B. Kozinsky, and G. Pizzi, AiiDA 1.0, a scalable computational infras- tructure for automated re...

  39. [39]

    Uhrin, S

    M. Uhrin, S. P. Huber, J. Yu, N. Marzari, and G. Pizzi, Workflows in AiiDA: Engineering a high-throughput, event-based engine for robust and modular computa- tional workflows, Computational Materials Science187, 110086 (2021)

  40. [40]

    Theaiida-lammpsplugin is available athttps:// github.com/aiidaplugins/aiida-lammps

  41. [41]

    A. H. Larsen, J. J. Mortensen, J. Blomqvist, I. E. Castelli, R. Christensen, M. Du lak, J. Friis, M. N. Groves, B. Hammer, C. Hargus, E. D. Hermes, P. C. Jennings, P. B. Jensen, J. Kermode, J. R. Kitchin, E. L. Kolsbjerg, J. Kubal, K. Kaasbjerg, S. Lysgaard, J. B. Maronsson, T. Maxson, T. Olsen, L. Pastewka, A. Pe- terson, C. Rostgaard, J. Schiøtz, O. Sch...

  42. [42]

    Collette,Python and HDF5(O’Reilly, 2013)

    A. Collette,Python and HDF5(O’Reilly, 2013)

  43. [43]

    Zadoks, A

    A. Zadoks, A. Marrazzo, and N. Marzari, Spectral op- erator representations, npj Computational Materials10, 278 (2024)

  44. [44]

    B. P. Pritchard, D. Altarawy, B. Didier, T. D. Gibson, and T. L. Windus, New Basis Set Exchange: An Open, Up-to-Date Resource for the Molecular Sciences Commu- nity, Journal of Chemical Information and Modeling59, 4814 (2019)

  45. [45]

    Feller, The role of databases in support of computa- tional chemistry calculations, Journal of Computational Chemistry17, 1571 (1996)

    D. Feller, The role of databases in support of computa- tional chemistry calculations, Journal of Computational Chemistry17, 1571 (1996)

  46. [46]

    K. L. Schuchardt, B. T. Didier, T. Elsethagen, L. Sun, V. Gurumoorthi, J. Chase, J. Li, and T. L. Windus, Ba- sis Set Exchange: A Community Database for Compu- tational Sciences, Journal of Chemical Information and Modeling47, 1045 (2007)

  47. [47]

    B. O. Roos, R. Lindh, P. Malmqvist, V. Veryazov, and P. Widmark, Main Group Atoms and Dimers Studied with a New Relativistic ANO Basis Set, The Journal of Physical Chemistry A108, 2851 (2004)

  48. [48]

    B. O. Roos, R. Lindh, P. Malmqvist, V. Veryazov, and P. Widmark, New Relativistic ANO Basis Sets for Tran- sition Metal Atoms, The Journal of Physical Chemistry A109, 6575 (2005)

  49. [49]

    B. O. Roos, R. Lindh, P. Malmqvist, V. Veryazov, and P. Widmark, New relativistic ANO basis sets for actinide atoms, Chemical Physics Letters409, 295 (2005)

  50. [50]

    B. O. Roos, R. Lindh, P. Malmqvist, V. Veryazov, P. Widmark, and A. C. Borin, New Relativistic Atomic Natural Orbital Basis Sets for Lanthanide Atoms with Applications to the Ce Diatom and LuF3, The Journal of Physical Chemistry A112, 11431 (2008)

  51. [51]

    J. P. Perdew, K. Burke, and M. Ernzerhof, Generalized Gradient Approximation Made Simple, Phys. Rev. Lett. 77, 3865 (1996)

  52. [52]

    T. W. Ko, J. A. Finkler, S. Goedecker, and J. Behler, A fourth-generation high-dimensional neural network po- tential with accurate electrostatics including non-local charge transfer, Nature Communications12, 398 (2021)

  53. [53]

    Banhart, J

    F. Banhart, J. Kotakoski, and A. V. Krasheninnikov, Structural Defects in Graphene, ACS Nano5, 26 (2011)

  54. [54]

    J. J. Palacios, J. Fern´ andez-Rossier, and L. Brey, Vacancy-induced magnetism in graphene and graphene ribbons, Physical Review B77, 195428 (2008). 16

  55. [55]

    Baroni, S

    S. Baroni, S. de Gironcoli, A. Dal Corso, and P. Gi- annozzi, Phonons and related crystal properties from density-functional perturbation theory, Rev. Mod. Phys. 73, 515 (2001)

  56. [56]

    Mounet, M

    N. Mounet, M. Gibertini, P. Schwaller, D. Campi, A. Merkys, A. Marrazzo, T. Sohier, I. E. Castelli, A. Ce- pellotti, G. Pizzi, and N. Marzari, Two-dimensional ma- terials from high-throughput computational exfoliation of experimentally known compounds (2020)

  57. [57]

    Laio and M

    A. Laio and M. Parrinello, Escaping free-energy min- ima, Proceedings of the National Academy of Sciences 99, 12562 (2002)

  58. [58]

    Bonomi, D

    M. Bonomi, D. Branduardi, G. Bussi, C. Camil- loni, D. Provasi, P. Raiteri, D. Donadio, F. Marinelli, F. Pietrucci, R. A. Broglia, and M. Parrinello, PLUMED: A portable plugin for free-energy calculations with molec- ular dynamics, Computer Physics Communications180, 1961 (2009)

  59. [59]

    G. A. Tribello, M. Bonomi, D. Branduardi, C. Camilloni, and G. Bussi, PLUMED 2: New feathers for an old bird, Computer Physics Communications185, 604 (2014)

  60. [60]

    The PLUMED consortium, Promoting transparency and reproducibility in enhanced molecular simulations, Na- ture Methods16, 670–673 (2019)

  61. [61]

    S. P. Huber, E. Bosoni, M. Bercx, J. Br¨ oder, A. De- gomme, V. Dikan, K. Eimre, E. Flage-Larsen, A. Gar- cia, L. Genovese, D. Gresch, C. Johnston, G. Petretto, S. Ponc´ e, G.-M. Rignanese, C. J. Sewell, B. Smit, V. Tse- plyaev, M. Uhrin, D. Wortmann, A. V. Yakutovich, A. Zadoks, P. Zarabadi-Poor, B. Zhu, N. Marzari, and G. Pizzi, Common workflows for com...

  62. [62]

    Timrov, N

    I. Timrov, N. Marzari, and M. Cococcioni, Hubbard parameters from density-functional perturbation theory, Phys. Rev. B98, 085127 (2018)

  63. [63]

    Timrov, N

    I. Timrov, N. Marzari, and M. Cococcioni, Self-consistent Hubbard parameters from density-functional perturba- tion theory in the ultrasoft and projector-augmented wave formulations, Phys. Rev. B103, 045141 (2021)

  64. [64]

    Timrov, N

    I. Timrov, N. Marzari, and M. Cococcioni, HP - A code for the calculation of Hubbard parameters using density- functional perturbation theory, Computer Physics Com- munications279, 108455 (2022)

  65. [65]

    Bastonero, C

    L. Bastonero, C. Malica, E. Macke, M. Bercx, S. Hu- ber, I. Timrov, and N. Marzari, First-principles Hubbard parameters with automated and reproducible workflows, npj Computational Materials11, 183 (2025)

  66. [66]

    Uhrin, A

    M. Uhrin, A. Zadoks, L. Binci, N. Marzari, and I. Tim- rov, Machine learning Hubbard parameters with equiv- ariant neural networks, npj Computational Materials11, 19 (2025)

  67. [67]

    Giannozzi, S

    P. Giannozzi, S. Baroni, N. Bonini, M. Calandra, R. Car, C. Cavazzoni, D. Ceresoli, G. L. Chiarotti, M. Cococ- cioni, I. Dabo, A. Dal Corso, S. de Gironcoli, S. Fabris, G. Fratesi, R. Gebauer, U. Gerstmann, C. Gougoussis, A. Kokalj, M. Lazzeri, L. Martin-Samos, N. Marzari, F. Mauri, R. Mazzarello, S. Paolini, A. Pasquarello, L. Paulatto, C. Sbraccia, S. S...

  68. [68]

    Giannozzi, O

    P. Giannozzi, O. Andreussi, T. Brumme, O. Bunau, M. Buongiorno Nardelli, M. Calandra, R. Car, C. Cavaz- zoni, D. Ceresoli, M. Cococcioni, N. Colonna, I. Carn- imeo, A. Dal Corso, S. de Gironcoli, P. Delugas, R. A. DiStasio, A. Ferretti, A. Floris, G. Fratesi, G. Fugallo, R. Gebauer, U. Gerstmann, F. Giustino, T. Gorni, J. Jia, M. Kawamura, H.-Y. Ko, A. Ko...

  69. [69]

    Giannozzi, O

    P. Giannozzi, O. Baseggio, P. Bonf` a, D. Brunato, R. Car, I. Carnimeo, C. Cavazzoni, S. de Gironcoli, P. Delugas, F. Ferrari Ruffino, A. Ferretti, N. Marzari, I. Timrov, A. Urru, and S. Baroni, Quantum ESPRESSO toward the exascale, The Journal of Chemical Physics152, 154105 (2020)

  70. [70]

    A. P. Thompson, H. M. Aktulga, R. Berger, D. S. Bolin- tineanu, W. M. Brown, P. S. Crozier, P. J. in ’t Veld, A. Kohlmeyer, S. G. Moore, T. D. Nguyen, R. Shan, M. J. Stevens, J. Tranchida, C. Trott, and S. J. Plimpton, LAMMPS—a flexible simulation tool for particle-based materials modeling at the atomic, meso, and continuum scales, Computer Physics Commun...

  71. [71]

    Huber, M

    S. Huber, M. Bercx, N. H¨ ormann, M. Uhrin, G. Pizzi, and N. Marzari, Materials Cloud three-dimensional crystals database (MC3D) (2022)

  72. [72]

    Y. Zhou, W. A. Saidi, and K. A. Fichthorn, A Force Field for Describing the Polyvinylpyrrolidone-Mediated Solution-Phase Synthesis of Shape-Selective Ag Nanopar- ticles, The Journal of Physical Chemistry C118, 3366 (2014)

  73. [73]

    Bidoggia, N

    D. Bidoggia, N. Manko, M. Peressi, and A. Marrazzo, Automated training of neural-network interatomic poten- tials, 10.24435/materialscloud:8d-kj (2025)