AiiDA-TrainsPot: Towards automated training of neural-network interatomic potentials

arxiv: 2509.11703 · v2 · submitted 2025-09-15 · ⚛️ physics.comp-ph · cond-mat.dis-nn· cond-mat.mtrl-sci

AiiDA-TrainsPot: Towards automated training of neural-network interatomic potentials

Davide Bidoggia , Nataliia Manko , Maria Peressi , Antimo Marrazzo This is my paper

Pith reviewed 2026-05-18 17:04 UTC · model grok-4.3

classification ⚛️ physics.comp-ph cond-mat.dis-nncond-mat.mtrl-sci

keywords neural-network interatomic potentialsactive learningautomated workflowdensity functional theorycommittee disagreementcarbon allotropesalloy phase transitionsuncertainty quantification

0 comments p. Extension

The pith

An automated workflow calibrates committee disagreement on the fly to select ab initio calculations reliably when training neural-network interatomic potentials.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces AiiDA-TrainsPot, a modular open-source workflow that orchestrates density-functional-theory calculations, data augmentation, and molecular dynamics to create neural-network interatomic potentials with less manual intervention. Its central active-learning strategy calibrates committee disagreement against reference ab initio errors during the process itself, producing uncertainty estimates that reduce both false positives and false negatives in deciding which new structures need expensive first-principles runs. Electronic-structure descriptors and dimensionality reduction are then used to check how well this calibrated criterion covers relevant configuration space in the examples shown. The workflow supports training from scratch or fine-tuning existing models and is demonstrated on carbon allotropes including amorphous phases and on phase transitions in WxMo1-xTe2 monolayers. A reader would care because the approach lowers the expertise barrier for producing accurate potentials that can be used in larger-scale simulations.

Core claim

AiiDA-TrainsPot automates the creation of neural-network interatomic potentials by linking DFT computations with active learning. The key advance is on-the-fly calibration of committee disagreement against ab initio reference errors, which yields reliable uncertainty estimates. This calibrated measure, validated through electronic-structure descriptors and dimensionality reduction, minimizes both false positives and false negatives when choosing structures for further first-principles evaluation across the tested carbon and alloy systems.

What carries the argument

On-the-fly calibration of committee disagreement against ab initio reference errors, which supplies reliable uncertainty estimates that guide active-learning decisions on which structures to compute from first principles.

If this is right

The same calibrated criterion supports both training potentials from scratch and fine-tuning foundation models.
The workflow successfully handles pristine, defective, and amorphous carbon structures as well as alloy phase transitions.
Modular design allows swapping in different neural-network interatomic potential backends without rewriting the active-learning loop.
On-the-fly calibration reduces the number of expensive ab initio calculations needed while maintaining coverage of relevant configurations.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The calibration approach could be tested on systems outside the demonstrated carbon and dichalcogenide cases to check transferability of the uncertainty estimates.
Lower false-negative rates might allow safer use of the resulting potentials in long molecular-dynamics runs where rare events matter.
Because the workflow is built on an existing automation platform, it could be combined with high-throughput material-screening campaigns that already generate large structure databases.

Load-bearing premise

Electronic-structure descriptors combined with dimensionality reduction can reliably validate the calibrated committee disagreement criterion without introducing selection bias in the active-learning loop.

What would settle it

Apply the workflow to a new material system, collect the structures it selects versus those it skips, and check whether many skipped structures later show large actual errors when computed ab initio or whether many selected structures prove unnecessary.

Figures

Figures reproduced from arXiv: 2509.11703 by Antimo Marrazzo, Davide Bidoggia, Maria Peressi, Nataliia Manko.

**Figure 2.** Figure 2: FIG. 2 [PITH_FULL_IMAGE:figures/full_fig_p006_2.png] view at source ↗

**Figure 3.** Figure 3: FIG. 3 [PITH_FULL_IMAGE:figures/full_fig_p007_3.png] view at source ↗

**Figure 4.** Figure 4: FIG. 4 [PITH_FULL_IMAGE:figures/full_fig_p009_4.png] view at source ↗

**Figure 5.** Figure 5: FIG. 5 [PITH_FULL_IMAGE:figures/full_fig_p010_5.png] view at source ↗

**Figure 6.** Figure 6: FIG. 6. Energy as a function of interlayer separation in [PITH_FULL_IMAGE:figures/full_fig_p011_6.png] view at source ↗

**Figure 7.** Figure 7: FIG. 7. Equations of state. Left panel: Binding energy per atom as a function of nearest-neighbor bond distance for various [PITH_FULL_IMAGE:figures/full_fig_p012_7.png] view at source ↗

**Figure 8.** Figure 8: FIG. 8. Phonon dispersion and density of states for graphene, [PITH_FULL_IMAGE:figures/full_fig_p013_8.png] view at source ↗

read the original abstract

Crafting neural-network interatomic potentials (NNIPs) remains a complex task, demanding specialized expertise in both machine learning and electronic-structure calculations. Here, we introduce AiiDA-TrainsPot, an automated, open-source, and user-friendly workflow that streamlines the creation of accurate NNIPs by orchestrating density-functional-theory calculations, data augmentation strategies, and classical molecular dynamics. Our active-learning strategy leverages on-the-fly calibration of committee disagreement against ab initio reference errors to ensure reliable uncertainty estimates. We use electronic-structure descriptors and dimensionality reduction to analyze the efficiency of this calibrated criterion, and show that it minimizes both false positives and false negatives when deciding what to compute from first principles. AiiDA-TrainsPot has a modular design that supports multiple NNIP backends, enabling both the training of NNIPs from scratch and the fine-tuning of foundation models. We demonstrate its capabilities through automated training campaigns targeting pristine and defective carbon allotropes, including amorphous carbon, as well as structural phase transitions in monolayer $\mathrm{W_xMo_{1-x}Te_2}$ alloys.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

AiiDA-TrainsPot gives a working automated pipeline for NNIP training with on-the-fly calibrated committee active learning on carbon and alloy systems, but the descriptor analysis for validating the uncertainty criterion needs clearer proof it avoids selection bias.

read the letter

The paper introduces AiiDA-TrainsPot, an open-source workflow that ties together DFT calculations, data augmentation, molecular dynamics, and active learning to train neural-network interatomic potentials with less manual work. The active-learning piece calibrates committee disagreement against actual ab initio errors during the run, then uses that to decide which new structures to compute from first principles. They demonstrate it on pristine and defective carbon allotropes plus amorphous carbon, and on structural transitions in WxMo1-xTe2 monolayers. The modular setup that lets users swap NNIP backends or fine-tune foundation models is a clear practical step forward from piecing together separate scripts. The use of electronic-structure descriptors and dimensionality reduction to inspect how the calibrated criterion performs is also a reasonable way to check coverage and selection quality. These elements together make the workflow more accessible for people who need custom potentials for defects or alloys but lack deep expertise in both ML and electronic structure. The main soft spot sits in the validation of that calibrated criterion. If the descriptors or the reduced space are derived from structures already picked by the active-learning loop itself, or from the same initial committee runs used for calibration, then the reported drop in false positives and negatives could reflect selection effects rather than genuine improvement in uncertainty estimates. The abstract does not spell out an independent hold-out set or cross-validation scheme that keeps the analysis separate from the active selections, so that part of the reliability claim rests on an assumption that needs explicit checking in the full methods. This work is aimed at computational materials researchers who want a ready orchestration layer for NNIP campaigns rather than a new theoretical method. Readers focused on 2D materials or carbon systems will find the concrete examples useful, and the open-source modular design could save time for groups building their own pipelines. The implementation and demonstrations are solid enough to justify sending it to peer review, where the circularity question on the validation can be resolved with more detail on the data splits.

Referee Report

1 major / 2 minor

Summary. The manuscript introduces AiiDA-TrainsPot, an automated open-source workflow built on AiiDA for training neural-network interatomic potentials (NNIPs). It integrates DFT calculations, data augmentation, and molecular dynamics, with a core active-learning loop that performs on-the-fly calibration of committee disagreement against ab initio reference errors to produce uncertainty estimates. Electronic-structure descriptors combined with dimensionality reduction are used to analyze the efficiency of this calibrated criterion on carbon allotropes (including amorphous carbon) and WxMo1-xTe2 alloy systems, with the claim that the approach minimizes both false positives and false negatives when selecting structures for first-principles evaluation. The workflow is modular and supports training from scratch as well as fine-tuning of foundation models.

Significance. If the on-the-fly calibration produces genuinely independent and reliable uncertainty estimates that generalize beyond the training data, the work would meaningfully advance automated, reproducible NNIP development for complex materials systems. The demonstrations on defective carbon and alloy phase transitions, together with support for multiple backends, would provide practical value for reducing expert intervention in potential training campaigns.

major comments (1)

[Results section on descriptor-based analysis of the calibrated criterion] The section describing the analysis of the calibrated committee disagreement (via electronic-structure descriptors and dimensionality reduction) must explicitly state whether this validation is performed on a fully independent hold-out set of structures or on configurations already visited or selected by the active-learning procedure itself. If the reduced descriptor space is constructed from the same committee evaluations used for calibration, the reported minimization of false positives and false negatives risks selection bias and does not independently confirm the reliability of the uncertainty estimates.

minor comments (2)

[Abstract] The abstract states that the calibrated criterion 'minimizes both false positives and false negatives' but provides no quantitative thresholds, error metrics, or cross-validation details; a brief clarification of the decision rule and evaluation protocol would improve clarity.
[Methods] Ensure that the precise definition of the committee (number of models, training protocol, and disagreement metric) is given in a dedicated methods subsection or table so that the on-the-fly calibration procedure can be reproduced without ambiguity.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for their careful reading of the manuscript and for the constructive comment on the descriptor-based analysis. We have revised the relevant section to explicitly address the nature of the dataset used and the implications for interpreting the results.

read point-by-point responses

Referee: [Results section on descriptor-based analysis of the calibrated criterion] The section describing the analysis of the calibrated committee disagreement (via electronic-structure descriptors and dimensionality reduction) must explicitly state whether this validation is performed on a fully independent hold-out set of structures or on configurations already visited or selected by the active-learning procedure itself. If the reduced descriptor space is constructed from the same committee evaluations used for calibration, the reported minimization of false positives and false negatives risks selection bias and does not independently confirm the reliability of the uncertainty estimates.

Authors: We thank the referee for this observation. The analysis presented in the Results section was performed on configurations visited or selected during the active-learning procedure itself, as the on-the-fly calibration of committee disagreement is an integral part of the workflow and occurs on the structures encountered in each training campaign. The reduced descriptor space is therefore constructed from the same set of committee evaluations used for calibration. We agree that this constitutes an in-sample analysis and carries a risk of selection bias; it does not provide an independent confirmation of the uncertainty estimates on fully unseen data. The intent of the section is to demonstrate the practical efficiency of the calibrated criterion in minimizing false positives and false negatives within the automated campaigns on carbon allotropes and the alloy systems, rather than to claim external validation. In the revised manuscript we have added explicit statements clarifying the in-sample nature of the analysis and discussing its limitations for assessing generalization. We believe this improves transparency without altering the reported findings. revision: yes

Circularity Check

0 steps flagged

No significant circularity in the presented workflow or validation chain.

full rationale

The abstract describes an on-the-fly calibration of committee disagreement against ab initio reference errors as part of the active-learning strategy, followed by separate analysis using electronic-structure descriptors and dimensionality reduction to assess efficiency on carbon allotropes and alloy systems. No equations or explicit statements in the provided text demonstrate that calibration parameters or decision thresholds are fitted directly to the same data used for the final NNIP training or that the descriptor-based validation reduces to the active-learning selections by construction. The workflow is presented as modular with demonstrations on specific targets, and the analysis functions as an external check rather than a self-referential loop. No load-bearing self-citations, imported uniqueness theorems, or ansatz smuggling are evident. The derivation chain remains self-contained against the described benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 0 invented entities

The central claim rests on standard DFT accuracy for reference data, the validity of committee disagreement as a proxy for error, and the effectiveness of electronic-structure descriptors for analysis; no new free parameters or invented entities are introduced in the abstract.

axioms (2)

domain assumption DFT calculations provide reliable reference errors for calibrating committee disagreement.
Invoked when using ab initio data to calibrate the active learning criterion.
domain assumption Committee disagreement can be mapped to actual prediction errors via on-the-fly calibration.
Central to the active-learning strategy described in the abstract.

pith-pipeline@v0.9.0 · 5739 in / 1427 out tokens · 45674 ms · 2026-05-18T17:04:35.760060+00:00 · methodology

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Spin Dynamics from Atomistic Quantum Simulations
cond-mat.mtrl-sci 2026-05 unverdicted novelty 6.0

Kubo theory yields T1 and T2 from spin-lattice correlation functions; these are evaluated with ML-driven ab initio molecular dynamics and match experimental T1 values for the NV center in diamond.

Reference graph

Works this paper leans on

73 extracted references · 73 canonical work pages · cited by 1 Pith paper

[1]

open), cell parameters, atomic species and atomic positions

Input structures AiiDA-TrainsPot can start from a small set of ini- tial atomistic structures{X } (0), determined by boundary conditions (periodic vs. open), cell parameters, atomic species and atomic positions. The number and diversity of input structures should reflect the target applications: for example, the study of temperature-dependent prop- erties...

work page
[2]

Dataset augmentation In the dataset augmentation stage, additional struc- tures are generated by manipulating the initial set {X }(0). All manipulations can be controlled through cus- tomizable parameters to tailor the augmentation process according to specific user needs; we group them in the following categories: •Supercells: Initial structures are repl...

work page
[3]

Ab Initio Labelling After the data augmentation stage, AiiDA-TrainsPot starts the active learning loop, which is represented by the orange circle in Fig. 1. Each structureX i in the augmented dataset is labeled through DFT calcula- tions to obtain high-fidelity reference values for energies, forces, and stress tensors. We use the compact notation LDF T (X...

work page
[4]

Training neural-network interatomic potentials The labeled dataset Xi,L DF T (Xi) (1) is used to train a committee ofMNNIPs{Φ j}M j=1, each with identical architecture but initialized with different random seeds. Prior to training, all structures are systematically par- titioned into three subsets, ensuring representative sam- pling across different struc...

work page
[5]

Exploration by molecular dynamics After a committee of NNIPs is trained, the workflow employs MD simulations to systematically explore the potential energy landscape. This exploration phase is critical for identifying configurations where the NNIPs might have insufficient accuracy, thus guiding the selec- tion of additional structures forab initiocalculat...

work page
[6]

fast explo- ration

Committee Evaluation This stage aims at identifying structures that are poorly predicted by the NNIPs; those are good candi- dates to be labeled withab initiocalculations and in- cluded in the training dataset. However, while Bayesian neural networks (NNs) come with a well-defined proba- bilistic uncertainty quantification, no such Bayesian error estimati...

work page
[7]

fast exploration

to 9,537 structures in the final iteration. For energy and stress tensor components, we observe a consistent decrease in prediction errors as the active learning pro- gresses. Interestingly, errors on forces increase from the first to the second iteration, and then decrease monoton- ically for all the following iterations, suggesting that the active learn...

work page
[8]

fast”/“stringent

or DFT (row 2). The third and fourth rows correspond to structures relaxed with DFT, with subsequent energy evalu- ation using the NNIP (row 3) or DFT (row 4). To assess the transferability of our NNIP to defective structures, we compute the formation energies of three representative point defects in graphene: the monova- cancy, divacancy, and Stone-Wales...

work page
[9]

MD simulations explored tempera- tures ranging from 0 to 5000 K and pressures from−5 to 5 kbar

for k-point grid (λ= 0.30 ˚A−1) and smearing (σcold = 0.0275Ry). MD simulations explored tempera- tures ranging from 0 to 5000 K and pressures from−5 to 5 kbar. Forrun B, dataset augmentation parameters were op- timized for near-equilibrium conditions:rattle fraction was reduced to 0.3, while strain ranges were increased tomax compressive strainof 0.3 and...

work page
[10]

MaX - Materials De- sign at the Exascale

in LAMMPS. Active learning thresholds on energy, forces, and stress tensor were set to 2 meV, 50 meV/ ˚A, 14 10 meV/˚A3, respectively, with a maximum of 1000 struc- tures selected per iteration. IV. DATA AVAILABILITY The training datasets and trained models are available on the Materials Cloud Archive [63]. V. CODE AVAILABILITY AiiDA-TrainsPot is availabl...

work page
[11]

Car and M

R. Car and M. Parrinello, Unified Approach for Molecu- lar Dynamics and Density-Functional Theory, Phys. Rev. Lett.55, 2471 (1985)

work page 1985
[12]

V. L. Deringer, M. A. Caro, and G. Cs´ anyi, Ma- chine Learning Interatomic Potentials as Emerging Tools for Materials Science, Advanced Materials31, 1902765 (2019)

work page 2019
[13]

Batzner, A

S. Batzner, A. Musaelian, L. Sun, M. Geiger, J. P. Mailoa, M. Kornbluth, N. Molinari, T. E. Smidt, and B. Kozinsky, E(3)-equivariant graph neural networks for data-efficient and accurate interatomic potentials, Nature Communications13, 2453 (2022)

work page 2022
[14]

Musaelian, S

A. Musaelian, S. Batzner, A. Johansson, L. Sun, C. J. Owen, M. Kornbluth, and B. Kozinsky, Learning local equivariant representations for large-scale atomistic dy- namics, Nature Communications14, 579 (2023)

work page 2023
[15]

Batatia, D

I. Batatia, D. P. Kovacs, G. N. C. Simm, C. Ortner, and G. Csanyi, MACE: Higher Order Equivariant Message Passing Neural Networks for Fast and Accurate Molec- ular Force Fields, Advances in Neural Information Pro- cessing Systems35, 11423 (2022)

work page 2022
[16]

D. P. Kov´ acs, J. H. Moore, N. J. Browning, I. Batatia, J. T. Horton, Y. Pu, V. Kapil, W. C. Witt, I.-B. Magd˘ au, D. J. Cole, and G. Cs´ anyi, MACE-OFF: Transferable Short Range Machine Learning Force Fields for Organic Molecules (2025), arXiv:2312.15211 [physics.chem-ph]

work page arXiv 2025
[17]

Mazitov, F

A. Mazitov, F. Bigi, M. Kellner, P. Pegolo, D. Tisi, G. Fraux, S. Pozdnyakov, P. Loche, and M. Ceri- otti, PET-MAD, a lightweight universal interatomic potential for advanced materials modeling (2025), arXiv:2503.14118 [cond-mat.mtrl-sci]

work page arXiv 2025
[18]

Zhang, H

Y. Zhang, H. Wang, W. Chen, J. Zeng, L. Zhang, H. Wang, and W. E, DP-GEN: A concurrent learning platform for the generation of reliable deep learning based potential energy models, Computer Physics Communica- tions253, 107206 (2020)

work page 2020
[19]

K. T. Sch¨ utt, P. Kessel, M. Gastegger, K. A. Nicoli, A. Tkatchenko, and K.-R. M¨ uller, SchNetPack: A Deep Learning Toolbox For Atomistic Systems, Journal of Chemical Theory and Computation15, 448–455 (2018)

work page 2018
[20]

Talirz, S

L. Talirz, S. Kumbhar, E. Passaro, A. V. Yakutovich, V. Granata, F. Gargiulo, M. Borelli, M. Uhrin, S. P. Huber, S. Zoupanos, C. S. Adorf, C. W. Andersen, O. Sch¨ utt, C. A. Pignedoli, D. Passerone, J. VandeVon- dele, T. C. Schulthess, B. Smit, G. Pizzi, and N. Marzari, Materials Cloud, a platform for open computational sci- ence, Scientific Data7, 299 (2020)

work page 2020
[21]

A. Jain, S. P. Ong, G. Hautier, W. Chen, W. D. Richards, S. Dacek, S. Cholia, D. Gunter, D. Skinner, G. Ceder, and K. A. Persson, The Materials Project: A materials genome approach to accelerating materials innovation, APL Materials1, 011002 (2013)

work page 2013
[22]

Bergerhoff, R

G. Bergerhoff, R. Hundt, R. Sievers, and I. D. Brown, The inorganic crystal structure data base, Journal of Chemical Information and Computer Sciences23, 66 (1983)

work page 1983
[23]

Graˇ zulis, D

S. Graˇ zulis, D. Chateigner, R. T. Downs, A. F. T. Yokochi, M. Quir´ os, L. Lutterotti, E. Manakova, J. Butkus, P. Moeck, and A. Le Bail, Crystallogra- phy Open Database—an open-access collection of crystal structures, Journal of Applied Crystallography42, 726 (2009). 15

work page 2009
[24]

Graˇ zulis, A

S. Graˇ zulis, A. Daˇ skeviˇ c, A. Merkys, D. Chateigner, L. Lutterotti, M. Quir´ os, N. R. Serebryanaya, P. Moeck, R. T. Downs, and A. Le Bail, Crystallography Open Database (COD): an open-access collection of crystal structures and platform for world-wide collaboration, Nucleic Acids Research40, D420 (2011)

work page 2011
[25]

Villars, N

P. Villars, N. Onodera, and S. Iwata, The linus pauling file (lpf) and its application to materials design, Journal of Alloys and Compounds279, 1 (1998)

work page 1998
[26]

Villars, M

P. Villars, M. Berndt, K. Brandenburg, K. Cenzual, J. Daams, F. Hulliger, T. Massalski, H. Okamoto, K. Os- aki, A. Prince, H. Putz, and S. Iwata, The pauling file, binaries edition, Journal of Alloys and Compounds367, 293 (2004)

work page 2004
[27]

Blokhin and P

E. Blokhin and P. Villars, The PAULING FILE Project and Materials Platform for Data Science: From Big Data Toward Materials Genome, inHandbook of Materials Modeling : Methods: Theory and Modeling, edited by W. Andreoni and S. Yip (Springer International Pub- lishing, Cham, 2018) pp. 1–26

work page 2018
[28]

Prandini, A

G. Prandini, A. Marrazzo, I. E. Castelli, N. Mounet, and N. Marzari, Precision and efficiency in solid-state pseu- dopotential calculations, npj Computational Materials4, 72 (2018)

work page 2018
[29]

P. E. Bl¨ ochl, Projector augmented-wave method, Phys. Rev. B50, 17953 (1994)

work page 1994
[30]

Dal Corso, Pseudopotentials periodic table: From H to Pu, Computational Materials Science95, 337 (2014)

A. Dal Corso, Pseudopotentials periodic table: From H to Pu, Computational Materials Science95, 337 (2014)

work page 2014
[31]

G. d. M. Nascimento, F. J. d. Santos, M. Bercx, D. Gras- sano, G. Pizzi, and N. Marzari, Accurate and efficient protocols for high-throughput first-principles materi- als simulations (2025), arXiv:2504.03962 [cond-mat.mtrl- sci]

work page arXiv 2025
[32]

Qamar, M

M. Qamar, M. Mrovec, Y. Lysogorskiy, A. Bochkarev, and R. Drautz, Atomic Cluster Expansion for Quantum- Accurate Large-Scale Simulations of Carbon, Journal of Chemical Theory and Computation19, 5151 (2023)

work page 2023
[33]

Grimme, Semiempirical GGA-type density functional constructed with a long-range dispersion correction, Journal of Computational Chemistry27, 1787 (2006)

S. Grimme, Semiempirical GGA-type density functional constructed with a long-range dispersion correction, Journal of Computational Chemistry27, 1787 (2006)

work page 2006
[34]

Abdar, F

M. Abdar, F. Pourpanah, S. Hussain, D. Rezazadegan, L. Liu, M. Ghavamzadeh, P. Fieguth, X. Cao, A. Khos- ravi, U. R. Acharya, V. Makarenkov, and S. Nahavandi, A review of uncertainty quantification in deep learning: Techniques, applications and challenges, Information Fu- sion76, 243 (2021)

work page 2021
[35]

Behler, Representing potential energy surfaces by high-dimensional neural network potentials, Journal of Physics: Condensed Matter26, 183001 (2014)

J. Behler, Representing potential energy surfaces by high-dimensional neural network potentials, Journal of Physics: Condensed Matter26, 183001 (2014)

work page 2014
[36]

L. Chen, I. Sukuba, M. Probst, and A. Kaiser, Itera- tive training set refinement enables reactive molecular dynamicsviamachine learned forces, RSC Advances10, 4293 (2020)

work page 2020
[37]

Kahle and F

L. Kahle and F. Zipoli, Quality of uncertainty estimates from neural network potential ensembles, Physical Re- view E105, 015311 (2022)

work page 2022
[38]

S. P. Huber, S. Zoupanos, M. Uhrin, L. Talirz, L. Kahle, R. H¨ auselmann, D. Gresch, T. M¨ uller, A. V. Yakutovich, C. W. Andersen, F. F. Ramirez, C. S. Adorf, F. Gargiulo, S. Kumbhar, E. Passaro, C. Johnston, A. Merkys, A. Ce- pellotti, N. Mounet, N. Marzari, B. Kozinsky, and G. Pizzi, AiiDA 1.0, a scalable computational infras- tructure for automated re...

work page 2020
[39]

Uhrin, S

M. Uhrin, S. P. Huber, J. Yu, N. Marzari, and G. Pizzi, Workflows in AiiDA: Engineering a high-throughput, event-based engine for robust and modular computa- tional workflows, Computational Materials Science187, 110086 (2021)

work page 2021
[40]

Theaiida-lammpsplugin is available athttps:// github.com/aiidaplugins/aiida-lammps

work page
[41]

A. H. Larsen, J. J. Mortensen, J. Blomqvist, I. E. Castelli, R. Christensen, M. Du lak, J. Friis, M. N. Groves, B. Hammer, C. Hargus, E. D. Hermes, P. C. Jennings, P. B. Jensen, J. Kermode, J. R. Kitchin, E. L. Kolsbjerg, J. Kubal, K. Kaasbjerg, S. Lysgaard, J. B. Maronsson, T. Maxson, T. Olsen, L. Pastewka, A. Pe- terson, C. Rostgaard, J. Schiøtz, O. Sch...

work page 2017
[42]

Collette,Python and HDF5(O’Reilly, 2013)

A. Collette,Python and HDF5(O’Reilly, 2013)

work page 2013
[43]

Zadoks, A

A. Zadoks, A. Marrazzo, and N. Marzari, Spectral op- erator representations, npj Computational Materials10, 278 (2024)

work page 2024
[44]

B. P. Pritchard, D. Altarawy, B. Didier, T. D. Gibson, and T. L. Windus, New Basis Set Exchange: An Open, Up-to-Date Resource for the Molecular Sciences Commu- nity, Journal of Chemical Information and Modeling59, 4814 (2019)

work page 2019
[45]

Feller, The role of databases in support of computa- tional chemistry calculations, Journal of Computational Chemistry17, 1571 (1996)

D. Feller, The role of databases in support of computa- tional chemistry calculations, Journal of Computational Chemistry17, 1571 (1996)

work page 1996
[46]

K. L. Schuchardt, B. T. Didier, T. Elsethagen, L. Sun, V. Gurumoorthi, J. Chase, J. Li, and T. L. Windus, Ba- sis Set Exchange: A Community Database for Compu- tational Sciences, Journal of Chemical Information and Modeling47, 1045 (2007)

work page 2007
[47]

B. O. Roos, R. Lindh, P. Malmqvist, V. Veryazov, and P. Widmark, Main Group Atoms and Dimers Studied with a New Relativistic ANO Basis Set, The Journal of Physical Chemistry A108, 2851 (2004)

work page 2004
[48]

B. O. Roos, R. Lindh, P. Malmqvist, V. Veryazov, and P. Widmark, New Relativistic ANO Basis Sets for Tran- sition Metal Atoms, The Journal of Physical Chemistry A109, 6575 (2005)

work page 2005
[49]

B. O. Roos, R. Lindh, P. Malmqvist, V. Veryazov, and P. Widmark, New relativistic ANO basis sets for actinide atoms, Chemical Physics Letters409, 295 (2005)

work page 2005
[50]

B. O. Roos, R. Lindh, P. Malmqvist, V. Veryazov, P. Widmark, and A. C. Borin, New Relativistic Atomic Natural Orbital Basis Sets for Lanthanide Atoms with Applications to the Ce Diatom and LuF3, The Journal of Physical Chemistry A112, 11431 (2008)

work page 2008
[51]

J. P. Perdew, K. Burke, and M. Ernzerhof, Generalized Gradient Approximation Made Simple, Phys. Rev. Lett. 77, 3865 (1996)

work page 1996
[52]

T. W. Ko, J. A. Finkler, S. Goedecker, and J. Behler, A fourth-generation high-dimensional neural network po- tential with accurate electrostatics including non-local charge transfer, Nature Communications12, 398 (2021)

work page 2021
[53]

Banhart, J

F. Banhart, J. Kotakoski, and A. V. Krasheninnikov, Structural Defects in Graphene, ACS Nano5, 26 (2011)

work page 2011
[54]

J. J. Palacios, J. Fern´ andez-Rossier, and L. Brey, Vacancy-induced magnetism in graphene and graphene ribbons, Physical Review B77, 195428 (2008). 16

work page 2008
[55]

Baroni, S

S. Baroni, S. de Gironcoli, A. Dal Corso, and P. Gi- annozzi, Phonons and related crystal properties from density-functional perturbation theory, Rev. Mod. Phys. 73, 515 (2001)

work page 2001
[56]

Mounet, M

N. Mounet, M. Gibertini, P. Schwaller, D. Campi, A. Merkys, A. Marrazzo, T. Sohier, I. E. Castelli, A. Ce- pellotti, G. Pizzi, and N. Marzari, Two-dimensional ma- terials from high-throughput computational exfoliation of experimentally known compounds (2020)

work page 2020
[57]

Laio and M

A. Laio and M. Parrinello, Escaping free-energy min- ima, Proceedings of the National Academy of Sciences 99, 12562 (2002)

work page 2002
[58]

Bonomi, D

M. Bonomi, D. Branduardi, G. Bussi, C. Camil- loni, D. Provasi, P. Raiteri, D. Donadio, F. Marinelli, F. Pietrucci, R. A. Broglia, and M. Parrinello, PLUMED: A portable plugin for free-energy calculations with molec- ular dynamics, Computer Physics Communications180, 1961 (2009)

work page 1961
[59]

G. A. Tribello, M. Bonomi, D. Branduardi, C. Camilloni, and G. Bussi, PLUMED 2: New feathers for an old bird, Computer Physics Communications185, 604 (2014)

work page 2014
[60]

The PLUMED consortium, Promoting transparency and reproducibility in enhanced molecular simulations, Na- ture Methods16, 670–673 (2019)

work page 2019
[61]

S. P. Huber, E. Bosoni, M. Bercx, J. Br¨ oder, A. De- gomme, V. Dikan, K. Eimre, E. Flage-Larsen, A. Gar- cia, L. Genovese, D. Gresch, C. Johnston, G. Petretto, S. Ponc´ e, G.-M. Rignanese, C. J. Sewell, B. Smit, V. Tse- plyaev, M. Uhrin, D. Wortmann, A. V. Yakutovich, A. Zadoks, P. Zarabadi-Poor, B. Zhu, N. Marzari, and G. Pizzi, Common workflows for com...

work page 2021
[62]

Timrov, N

I. Timrov, N. Marzari, and M. Cococcioni, Hubbard parameters from density-functional perturbation theory, Phys. Rev. B98, 085127 (2018)

work page 2018
[63]

Timrov, N

I. Timrov, N. Marzari, and M. Cococcioni, Self-consistent Hubbard parameters from density-functional perturba- tion theory in the ultrasoft and projector-augmented wave formulations, Phys. Rev. B103, 045141 (2021)

work page 2021
[64]

Timrov, N

I. Timrov, N. Marzari, and M. Cococcioni, HP - A code for the calculation of Hubbard parameters using density- functional perturbation theory, Computer Physics Com- munications279, 108455 (2022)

work page 2022
[65]

Bastonero, C

L. Bastonero, C. Malica, E. Macke, M. Bercx, S. Hu- ber, I. Timrov, and N. Marzari, First-principles Hubbard parameters with automated and reproducible workflows, npj Computational Materials11, 183 (2025)

work page 2025
[66]

Uhrin, A

M. Uhrin, A. Zadoks, L. Binci, N. Marzari, and I. Tim- rov, Machine learning Hubbard parameters with equiv- ariant neural networks, npj Computational Materials11, 19 (2025)

work page 2025
[67]

Giannozzi, S

P. Giannozzi, S. Baroni, N. Bonini, M. Calandra, R. Car, C. Cavazzoni, D. Ceresoli, G. L. Chiarotti, M. Cococ- cioni, I. Dabo, A. Dal Corso, S. de Gironcoli, S. Fabris, G. Fratesi, R. Gebauer, U. Gerstmann, C. Gougoussis, A. Kokalj, M. Lazzeri, L. Martin-Samos, N. Marzari, F. Mauri, R. Mazzarello, S. Paolini, A. Pasquarello, L. Paulatto, C. Sbraccia, S. S...

work page 2009
[68]

Giannozzi, O

P. Giannozzi, O. Andreussi, T. Brumme, O. Bunau, M. Buongiorno Nardelli, M. Calandra, R. Car, C. Cavaz- zoni, D. Ceresoli, M. Cococcioni, N. Colonna, I. Carn- imeo, A. Dal Corso, S. de Gironcoli, P. Delugas, R. A. DiStasio, A. Ferretti, A. Floris, G. Fratesi, G. Fugallo, R. Gebauer, U. Gerstmann, F. Giustino, T. Gorni, J. Jia, M. Kawamura, H.-Y. Ko, A. Ko...

work page 2017
[69]

Giannozzi, O

P. Giannozzi, O. Baseggio, P. Bonf` a, D. Brunato, R. Car, I. Carnimeo, C. Cavazzoni, S. de Gironcoli, P. Delugas, F. Ferrari Ruffino, A. Ferretti, N. Marzari, I. Timrov, A. Urru, and S. Baroni, Quantum ESPRESSO toward the exascale, The Journal of Chemical Physics152, 154105 (2020)

work page 2020
[70]

A. P. Thompson, H. M. Aktulga, R. Berger, D. S. Bolin- tineanu, W. M. Brown, P. S. Crozier, P. J. in ’t Veld, A. Kohlmeyer, S. G. Moore, T. D. Nguyen, R. Shan, M. J. Stevens, J. Tranchida, C. Trott, and S. J. Plimpton, LAMMPS—a flexible simulation tool for particle-based materials modeling at the atomic, meso, and continuum scales, Computer Physics Commun...

work page 2022
[71]

Huber, M

S. Huber, M. Bercx, N. H¨ ormann, M. Uhrin, G. Pizzi, and N. Marzari, Materials Cloud three-dimensional crystals database (MC3D) (2022)

work page 2022
[72]

Y. Zhou, W. A. Saidi, and K. A. Fichthorn, A Force Field for Describing the Polyvinylpyrrolidone-Mediated Solution-Phase Synthesis of Shape-Selective Ag Nanopar- ticles, The Journal of Physical Chemistry C118, 3366 (2014)

work page 2014
[73]

Bidoggia, N

D. Bidoggia, N. Manko, M. Peressi, and A. Marrazzo, Automated training of neural-network interatomic poten- tials, 10.24435/materialscloud:8d-kj (2025)

work page doi:10.24435/materialscloud:8d-kj 2025

[1] [1]

open), cell parameters, atomic species and atomic positions

Input structures AiiDA-TrainsPot can start from a small set of ini- tial atomistic structures{X } (0), determined by boundary conditions (periodic vs. open), cell parameters, atomic species and atomic positions. The number and diversity of input structures should reflect the target applications: for example, the study of temperature-dependent prop- erties...

work page

[2] [2]

Dataset augmentation In the dataset augmentation stage, additional struc- tures are generated by manipulating the initial set {X }(0). All manipulations can be controlled through cus- tomizable parameters to tailor the augmentation process according to specific user needs; we group them in the following categories: •Supercells: Initial structures are repl...

work page

[3] [3]

Ab Initio Labelling After the data augmentation stage, AiiDA-TrainsPot starts the active learning loop, which is represented by the orange circle in Fig. 1. Each structureX i in the augmented dataset is labeled through DFT calcula- tions to obtain high-fidelity reference values for energies, forces, and stress tensors. We use the compact notation LDF T (X...

work page

[4] [4]

Training neural-network interatomic potentials The labeled dataset Xi,L DF T (Xi) (1) is used to train a committee ofMNNIPs{Φ j}M j=1, each with identical architecture but initialized with different random seeds. Prior to training, all structures are systematically par- titioned into three subsets, ensuring representative sam- pling across different struc...

work page

[5] [5]

Exploration by molecular dynamics After a committee of NNIPs is trained, the workflow employs MD simulations to systematically explore the potential energy landscape. This exploration phase is critical for identifying configurations where the NNIPs might have insufficient accuracy, thus guiding the selec- tion of additional structures forab initiocalculat...

work page

[6] [6]

fast explo- ration

Committee Evaluation This stage aims at identifying structures that are poorly predicted by the NNIPs; those are good candi- dates to be labeled withab initiocalculations and in- cluded in the training dataset. However, while Bayesian neural networks (NNs) come with a well-defined proba- bilistic uncertainty quantification, no such Bayesian error estimati...

work page

[7] [7]

fast exploration

to 9,537 structures in the final iteration. For energy and stress tensor components, we observe a consistent decrease in prediction errors as the active learning pro- gresses. Interestingly, errors on forces increase from the first to the second iteration, and then decrease monoton- ically for all the following iterations, suggesting that the active learn...

work page

[8] [8]

fast”/“stringent

or DFT (row 2). The third and fourth rows correspond to structures relaxed with DFT, with subsequent energy evalu- ation using the NNIP (row 3) or DFT (row 4). To assess the transferability of our NNIP to defective structures, we compute the formation energies of three representative point defects in graphene: the monova- cancy, divacancy, and Stone-Wales...

work page

[9] [9]

MD simulations explored tempera- tures ranging from 0 to 5000 K and pressures from−5 to 5 kbar

for k-point grid (λ= 0.30 ˚A−1) and smearing (σcold = 0.0275Ry). MD simulations explored tempera- tures ranging from 0 to 5000 K and pressures from−5 to 5 kbar. Forrun B, dataset augmentation parameters were op- timized for near-equilibrium conditions:rattle fraction was reduced to 0.3, while strain ranges were increased tomax compressive strainof 0.3 and...

work page

[10] [10]

MaX - Materials De- sign at the Exascale

in LAMMPS. Active learning thresholds on energy, forces, and stress tensor were set to 2 meV, 50 meV/ ˚A, 14 10 meV/˚A3, respectively, with a maximum of 1000 struc- tures selected per iteration. IV. DATA AVAILABILITY The training datasets and trained models are available on the Materials Cloud Archive [63]. V. CODE AVAILABILITY AiiDA-TrainsPot is availabl...

work page

[11] [11]

Car and M

R. Car and M. Parrinello, Unified Approach for Molecu- lar Dynamics and Density-Functional Theory, Phys. Rev. Lett.55, 2471 (1985)

work page 1985

[12] [12]

V. L. Deringer, M. A. Caro, and G. Cs´ anyi, Ma- chine Learning Interatomic Potentials as Emerging Tools for Materials Science, Advanced Materials31, 1902765 (2019)

work page 2019

[13] [13]

Batzner, A

S. Batzner, A. Musaelian, L. Sun, M. Geiger, J. P. Mailoa, M. Kornbluth, N. Molinari, T. E. Smidt, and B. Kozinsky, E(3)-equivariant graph neural networks for data-efficient and accurate interatomic potentials, Nature Communications13, 2453 (2022)

work page 2022

[14] [14]

Musaelian, S

A. Musaelian, S. Batzner, A. Johansson, L. Sun, C. J. Owen, M. Kornbluth, and B. Kozinsky, Learning local equivariant representations for large-scale atomistic dy- namics, Nature Communications14, 579 (2023)

work page 2023

[15] [15]

Batatia, D

I. Batatia, D. P. Kovacs, G. N. C. Simm, C. Ortner, and G. Csanyi, MACE: Higher Order Equivariant Message Passing Neural Networks for Fast and Accurate Molec- ular Force Fields, Advances in Neural Information Pro- cessing Systems35, 11423 (2022)

work page 2022

[16] [16]

D. P. Kov´ acs, J. H. Moore, N. J. Browning, I. Batatia, J. T. Horton, Y. Pu, V. Kapil, W. C. Witt, I.-B. Magd˘ au, D. J. Cole, and G. Cs´ anyi, MACE-OFF: Transferable Short Range Machine Learning Force Fields for Organic Molecules (2025), arXiv:2312.15211 [physics.chem-ph]

work page arXiv 2025

[17] [17]

Mazitov, F

A. Mazitov, F. Bigi, M. Kellner, P. Pegolo, D. Tisi, G. Fraux, S. Pozdnyakov, P. Loche, and M. Ceri- otti, PET-MAD, a lightweight universal interatomic potential for advanced materials modeling (2025), arXiv:2503.14118 [cond-mat.mtrl-sci]

work page arXiv 2025

[18] [18]

Zhang, H

Y. Zhang, H. Wang, W. Chen, J. Zeng, L. Zhang, H. Wang, and W. E, DP-GEN: A concurrent learning platform for the generation of reliable deep learning based potential energy models, Computer Physics Communica- tions253, 107206 (2020)

work page 2020

[19] [19]

K. T. Sch¨ utt, P. Kessel, M. Gastegger, K. A. Nicoli, A. Tkatchenko, and K.-R. M¨ uller, SchNetPack: A Deep Learning Toolbox For Atomistic Systems, Journal of Chemical Theory and Computation15, 448–455 (2018)

work page 2018

[20] [20]

Talirz, S

L. Talirz, S. Kumbhar, E. Passaro, A. V. Yakutovich, V. Granata, F. Gargiulo, M. Borelli, M. Uhrin, S. P. Huber, S. Zoupanos, C. S. Adorf, C. W. Andersen, O. Sch¨ utt, C. A. Pignedoli, D. Passerone, J. VandeVon- dele, T. C. Schulthess, B. Smit, G. Pizzi, and N. Marzari, Materials Cloud, a platform for open computational sci- ence, Scientific Data7, 299 (2020)

work page 2020

[21] [21]

A. Jain, S. P. Ong, G. Hautier, W. Chen, W. D. Richards, S. Dacek, S. Cholia, D. Gunter, D. Skinner, G. Ceder, and K. A. Persson, The Materials Project: A materials genome approach to accelerating materials innovation, APL Materials1, 011002 (2013)

work page 2013

[22] [22]

Bergerhoff, R

G. Bergerhoff, R. Hundt, R. Sievers, and I. D. Brown, The inorganic crystal structure data base, Journal of Chemical Information and Computer Sciences23, 66 (1983)

work page 1983

[23] [23]

Graˇ zulis, D

S. Graˇ zulis, D. Chateigner, R. T. Downs, A. F. T. Yokochi, M. Quir´ os, L. Lutterotti, E. Manakova, J. Butkus, P. Moeck, and A. Le Bail, Crystallogra- phy Open Database—an open-access collection of crystal structures, Journal of Applied Crystallography42, 726 (2009). 15

work page 2009

[24] [24]

Graˇ zulis, A

S. Graˇ zulis, A. Daˇ skeviˇ c, A. Merkys, D. Chateigner, L. Lutterotti, M. Quir´ os, N. R. Serebryanaya, P. Moeck, R. T. Downs, and A. Le Bail, Crystallography Open Database (COD): an open-access collection of crystal structures and platform for world-wide collaboration, Nucleic Acids Research40, D420 (2011)

work page 2011

[25] [25]

Villars, N

P. Villars, N. Onodera, and S. Iwata, The linus pauling file (lpf) and its application to materials design, Journal of Alloys and Compounds279, 1 (1998)

work page 1998

[26] [26]

Villars, M

P. Villars, M. Berndt, K. Brandenburg, K. Cenzual, J. Daams, F. Hulliger, T. Massalski, H. Okamoto, K. Os- aki, A. Prince, H. Putz, and S. Iwata, The pauling file, binaries edition, Journal of Alloys and Compounds367, 293 (2004)

work page 2004

[27] [27]

Blokhin and P

E. Blokhin and P. Villars, The PAULING FILE Project and Materials Platform for Data Science: From Big Data Toward Materials Genome, inHandbook of Materials Modeling : Methods: Theory and Modeling, edited by W. Andreoni and S. Yip (Springer International Pub- lishing, Cham, 2018) pp. 1–26

work page 2018

[28] [28]

Prandini, A

G. Prandini, A. Marrazzo, I. E. Castelli, N. Mounet, and N. Marzari, Precision and efficiency in solid-state pseu- dopotential calculations, npj Computational Materials4, 72 (2018)

work page 2018

[29] [29]

P. E. Bl¨ ochl, Projector augmented-wave method, Phys. Rev. B50, 17953 (1994)

work page 1994

[30] [30]

Dal Corso, Pseudopotentials periodic table: From H to Pu, Computational Materials Science95, 337 (2014)

A. Dal Corso, Pseudopotentials periodic table: From H to Pu, Computational Materials Science95, 337 (2014)

work page 2014

[31] [31]

G. d. M. Nascimento, F. J. d. Santos, M. Bercx, D. Gras- sano, G. Pizzi, and N. Marzari, Accurate and efficient protocols for high-throughput first-principles materi- als simulations (2025), arXiv:2504.03962 [cond-mat.mtrl- sci]

work page arXiv 2025

[32] [32]

Qamar, M

M. Qamar, M. Mrovec, Y. Lysogorskiy, A. Bochkarev, and R. Drautz, Atomic Cluster Expansion for Quantum- Accurate Large-Scale Simulations of Carbon, Journal of Chemical Theory and Computation19, 5151 (2023)

work page 2023

[33] [33]

Grimme, Semiempirical GGA-type density functional constructed with a long-range dispersion correction, Journal of Computational Chemistry27, 1787 (2006)

S. Grimme, Semiempirical GGA-type density functional constructed with a long-range dispersion correction, Journal of Computational Chemistry27, 1787 (2006)

work page 2006

[34] [34]

Abdar, F

M. Abdar, F. Pourpanah, S. Hussain, D. Rezazadegan, L. Liu, M. Ghavamzadeh, P. Fieguth, X. Cao, A. Khos- ravi, U. R. Acharya, V. Makarenkov, and S. Nahavandi, A review of uncertainty quantification in deep learning: Techniques, applications and challenges, Information Fu- sion76, 243 (2021)

work page 2021

[35] [35]

Behler, Representing potential energy surfaces by high-dimensional neural network potentials, Journal of Physics: Condensed Matter26, 183001 (2014)

J. Behler, Representing potential energy surfaces by high-dimensional neural network potentials, Journal of Physics: Condensed Matter26, 183001 (2014)

work page 2014

[36] [36]

L. Chen, I. Sukuba, M. Probst, and A. Kaiser, Itera- tive training set refinement enables reactive molecular dynamicsviamachine learned forces, RSC Advances10, 4293 (2020)

work page 2020

[37] [37]

Kahle and F

L. Kahle and F. Zipoli, Quality of uncertainty estimates from neural network potential ensembles, Physical Re- view E105, 015311 (2022)

work page 2022

[38] [38]

S. P. Huber, S. Zoupanos, M. Uhrin, L. Talirz, L. Kahle, R. H¨ auselmann, D. Gresch, T. M¨ uller, A. V. Yakutovich, C. W. Andersen, F. F. Ramirez, C. S. Adorf, F. Gargiulo, S. Kumbhar, E. Passaro, C. Johnston, A. Merkys, A. Ce- pellotti, N. Mounet, N. Marzari, B. Kozinsky, and G. Pizzi, AiiDA 1.0, a scalable computational infras- tructure for automated re...

work page 2020

[39] [39]

Uhrin, S

M. Uhrin, S. P. Huber, J. Yu, N. Marzari, and G. Pizzi, Workflows in AiiDA: Engineering a high-throughput, event-based engine for robust and modular computa- tional workflows, Computational Materials Science187, 110086 (2021)

work page 2021

[40] [40]

Theaiida-lammpsplugin is available athttps:// github.com/aiidaplugins/aiida-lammps

work page

[41] [41]

A. H. Larsen, J. J. Mortensen, J. Blomqvist, I. E. Castelli, R. Christensen, M. Du lak, J. Friis, M. N. Groves, B. Hammer, C. Hargus, E. D. Hermes, P. C. Jennings, P. B. Jensen, J. Kermode, J. R. Kitchin, E. L. Kolsbjerg, J. Kubal, K. Kaasbjerg, S. Lysgaard, J. B. Maronsson, T. Maxson, T. Olsen, L. Pastewka, A. Pe- terson, C. Rostgaard, J. Schiøtz, O. Sch...

work page 2017

[42] [42]

Collette,Python and HDF5(O’Reilly, 2013)

A. Collette,Python and HDF5(O’Reilly, 2013)

work page 2013

[43] [43]

Zadoks, A

A. Zadoks, A. Marrazzo, and N. Marzari, Spectral op- erator representations, npj Computational Materials10, 278 (2024)

work page 2024

[44] [44]

B. P. Pritchard, D. Altarawy, B. Didier, T. D. Gibson, and T. L. Windus, New Basis Set Exchange: An Open, Up-to-Date Resource for the Molecular Sciences Commu- nity, Journal of Chemical Information and Modeling59, 4814 (2019)

work page 2019

[45] [45]

Feller, The role of databases in support of computa- tional chemistry calculations, Journal of Computational Chemistry17, 1571 (1996)

D. Feller, The role of databases in support of computa- tional chemistry calculations, Journal of Computational Chemistry17, 1571 (1996)

work page 1996

[46] [46]

K. L. Schuchardt, B. T. Didier, T. Elsethagen, L. Sun, V. Gurumoorthi, J. Chase, J. Li, and T. L. Windus, Ba- sis Set Exchange: A Community Database for Compu- tational Sciences, Journal of Chemical Information and Modeling47, 1045 (2007)

work page 2007

[47] [47]

B. O. Roos, R. Lindh, P. Malmqvist, V. Veryazov, and P. Widmark, Main Group Atoms and Dimers Studied with a New Relativistic ANO Basis Set, The Journal of Physical Chemistry A108, 2851 (2004)

work page 2004

[48] [48]

B. O. Roos, R. Lindh, P. Malmqvist, V. Veryazov, and P. Widmark, New Relativistic ANO Basis Sets for Tran- sition Metal Atoms, The Journal of Physical Chemistry A109, 6575 (2005)

work page 2005

[49] [49]

B. O. Roos, R. Lindh, P. Malmqvist, V. Veryazov, and P. Widmark, New relativistic ANO basis sets for actinide atoms, Chemical Physics Letters409, 295 (2005)

work page 2005

[50] [50]

B. O. Roos, R. Lindh, P. Malmqvist, V. Veryazov, P. Widmark, and A. C. Borin, New Relativistic Atomic Natural Orbital Basis Sets for Lanthanide Atoms with Applications to the Ce Diatom and LuF3, The Journal of Physical Chemistry A112, 11431 (2008)

work page 2008

[51] [51]

J. P. Perdew, K. Burke, and M. Ernzerhof, Generalized Gradient Approximation Made Simple, Phys. Rev. Lett. 77, 3865 (1996)

work page 1996

[52] [52]

T. W. Ko, J. A. Finkler, S. Goedecker, and J. Behler, A fourth-generation high-dimensional neural network po- tential with accurate electrostatics including non-local charge transfer, Nature Communications12, 398 (2021)

work page 2021

[53] [53]

Banhart, J

F. Banhart, J. Kotakoski, and A. V. Krasheninnikov, Structural Defects in Graphene, ACS Nano5, 26 (2011)

work page 2011

[54] [54]

J. J. Palacios, J. Fern´ andez-Rossier, and L. Brey, Vacancy-induced magnetism in graphene and graphene ribbons, Physical Review B77, 195428 (2008). 16

work page 2008

[55] [55]

Baroni, S

S. Baroni, S. de Gironcoli, A. Dal Corso, and P. Gi- annozzi, Phonons and related crystal properties from density-functional perturbation theory, Rev. Mod. Phys. 73, 515 (2001)

work page 2001

[56] [56]

Mounet, M

N. Mounet, M. Gibertini, P. Schwaller, D. Campi, A. Merkys, A. Marrazzo, T. Sohier, I. E. Castelli, A. Ce- pellotti, G. Pizzi, and N. Marzari, Two-dimensional ma- terials from high-throughput computational exfoliation of experimentally known compounds (2020)

work page 2020

[57] [57]

Laio and M

A. Laio and M. Parrinello, Escaping free-energy min- ima, Proceedings of the National Academy of Sciences 99, 12562 (2002)

work page 2002

[58] [58]

Bonomi, D

M. Bonomi, D. Branduardi, G. Bussi, C. Camil- loni, D. Provasi, P. Raiteri, D. Donadio, F. Marinelli, F. Pietrucci, R. A. Broglia, and M. Parrinello, PLUMED: A portable plugin for free-energy calculations with molec- ular dynamics, Computer Physics Communications180, 1961 (2009)

work page 1961

[59] [59]

G. A. Tribello, M. Bonomi, D. Branduardi, C. Camilloni, and G. Bussi, PLUMED 2: New feathers for an old bird, Computer Physics Communications185, 604 (2014)

work page 2014

[60] [60]

The PLUMED consortium, Promoting transparency and reproducibility in enhanced molecular simulations, Na- ture Methods16, 670–673 (2019)

work page 2019

[61] [61]

S. P. Huber, E. Bosoni, M. Bercx, J. Br¨ oder, A. De- gomme, V. Dikan, K. Eimre, E. Flage-Larsen, A. Gar- cia, L. Genovese, D. Gresch, C. Johnston, G. Petretto, S. Ponc´ e, G.-M. Rignanese, C. J. Sewell, B. Smit, V. Tse- plyaev, M. Uhrin, D. Wortmann, A. V. Yakutovich, A. Zadoks, P. Zarabadi-Poor, B. Zhu, N. Marzari, and G. Pizzi, Common workflows for com...

work page 2021

[62] [62]

Timrov, N

I. Timrov, N. Marzari, and M. Cococcioni, Hubbard parameters from density-functional perturbation theory, Phys. Rev. B98, 085127 (2018)

work page 2018

[63] [63]

Timrov, N

I. Timrov, N. Marzari, and M. Cococcioni, Self-consistent Hubbard parameters from density-functional perturba- tion theory in the ultrasoft and projector-augmented wave formulations, Phys. Rev. B103, 045141 (2021)

work page 2021

[64] [64]

Timrov, N

I. Timrov, N. Marzari, and M. Cococcioni, HP - A code for the calculation of Hubbard parameters using density- functional perturbation theory, Computer Physics Com- munications279, 108455 (2022)

work page 2022

[65] [65]

Bastonero, C

L. Bastonero, C. Malica, E. Macke, M. Bercx, S. Hu- ber, I. Timrov, and N. Marzari, First-principles Hubbard parameters with automated and reproducible workflows, npj Computational Materials11, 183 (2025)

work page 2025

[66] [66]

Uhrin, A

M. Uhrin, A. Zadoks, L. Binci, N. Marzari, and I. Tim- rov, Machine learning Hubbard parameters with equiv- ariant neural networks, npj Computational Materials11, 19 (2025)

work page 2025

[67] [67]

Giannozzi, S

P. Giannozzi, S. Baroni, N. Bonini, M. Calandra, R. Car, C. Cavazzoni, D. Ceresoli, G. L. Chiarotti, M. Cococ- cioni, I. Dabo, A. Dal Corso, S. de Gironcoli, S. Fabris, G. Fratesi, R. Gebauer, U. Gerstmann, C. Gougoussis, A. Kokalj, M. Lazzeri, L. Martin-Samos, N. Marzari, F. Mauri, R. Mazzarello, S. Paolini, A. Pasquarello, L. Paulatto, C. Sbraccia, S. S...

work page 2009

[68] [68]

Giannozzi, O

P. Giannozzi, O. Andreussi, T. Brumme, O. Bunau, M. Buongiorno Nardelli, M. Calandra, R. Car, C. Cavaz- zoni, D. Ceresoli, M. Cococcioni, N. Colonna, I. Carn- imeo, A. Dal Corso, S. de Gironcoli, P. Delugas, R. A. DiStasio, A. Ferretti, A. Floris, G. Fratesi, G. Fugallo, R. Gebauer, U. Gerstmann, F. Giustino, T. Gorni, J. Jia, M. Kawamura, H.-Y. Ko, A. Ko...

work page 2017

[69] [69]

Giannozzi, O

P. Giannozzi, O. Baseggio, P. Bonf` a, D. Brunato, R. Car, I. Carnimeo, C. Cavazzoni, S. de Gironcoli, P. Delugas, F. Ferrari Ruffino, A. Ferretti, N. Marzari, I. Timrov, A. Urru, and S. Baroni, Quantum ESPRESSO toward the exascale, The Journal of Chemical Physics152, 154105 (2020)

work page 2020

[70] [70]

A. P. Thompson, H. M. Aktulga, R. Berger, D. S. Bolin- tineanu, W. M. Brown, P. S. Crozier, P. J. in ’t Veld, A. Kohlmeyer, S. G. Moore, T. D. Nguyen, R. Shan, M. J. Stevens, J. Tranchida, C. Trott, and S. J. Plimpton, LAMMPS—a flexible simulation tool for particle-based materials modeling at the atomic, meso, and continuum scales, Computer Physics Commun...

work page 2022

[71] [71]

Huber, M

S. Huber, M. Bercx, N. H¨ ormann, M. Uhrin, G. Pizzi, and N. Marzari, Materials Cloud three-dimensional crystals database (MC3D) (2022)

work page 2022

[72] [72]

Y. Zhou, W. A. Saidi, and K. A. Fichthorn, A Force Field for Describing the Polyvinylpyrrolidone-Mediated Solution-Phase Synthesis of Shape-Selective Ag Nanopar- ticles, The Journal of Physical Chemistry C118, 3366 (2014)

work page 2014

[73] [73]

Bidoggia, N

D. Bidoggia, N. Manko, M. Peressi, and A. Marrazzo, Automated training of neural-network interatomic poten- tials, 10.24435/materialscloud:8d-kj (2025)

work page doi:10.24435/materialscloud:8d-kj 2025