pith. sign in

arxiv: 2605.19050 · v1 · pith:J5HAWMPWnew · submitted 2026-05-18 · 💻 cs.LG · physics.chem-ph· q-bio.QM

Generative Pseudo-Force Fields for Molecular Generation

Pith reviewed 2026-05-20 11:51 UTC · model grok-4.3

classification 💻 cs.LG physics.chem-phq-bio.QM
keywords molecular generationmachine learning force fieldsdiffusion modelspseudo-potentialsgenerative modelsQM9molecular conformationsdrug design
0
0 comments X

The pith

Generative pseudo-force fields allow training on Gaussian-perturbed equilibria to produce time-step-free molecular denoising via pseudo-forces.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper proposes training machine learning force fields on a simple quadratic energy surface defined around known molecular equilibrium structures. Perturbations are created on the fly with Gaussian noise, eliminating the need for expensive quantum calculations on non-equilibrium geometries. The resulting pseudo-forces implicitly carry the noise magnitude information, so the model does not require explicit time-step inputs like standard diffusion models. This setup supports multiple sampling strategies and achieves high rates of chemically valid molecules even with very few model evaluations.

Core claim

By training on forces from a quadratic pseudo-potential relative to reference equilibria, the generative pseudo-force field acts as a time-step-agnostic variant of variance exploding diffusion models. The predicted forces serve as the score function for denoising, with their magnitudes encoding the current noise level, allowing use in standard samplers as well as adaptive and direct denoising methods that incorporate structural priors and constraints.

What carries the argument

Generative Pseudo-Force Field, a machine learning force field trained to predict forces derived from a quadratic pseudo-potential energy surface centered at reference equilibrium molecular structures.

If this is right

  • GPFF serves as a direct replacement for diffusion models in ancestral and Heun sampling algorithms.
  • Adaptive sampling variants become feasible due to the absence of time-step conditioning.
  • An MLFF-inspired direct denoising scheme can be applied for efficient generation.
  • Generation supports arbitrary structural priors and geometric constraints for targeted molecular design.
  • Validity reaches 100 percent at 256 neural function evaluations and exceeds 50 percent at 6 on the QM9 dataset.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • Extending the approach to proteins or materials could leverage the same on-the-fly perturbation strategy for larger systems.
  • The method might integrate with existing machine learning force fields to add generative capabilities without retraining from scratch.
  • Real-time generation in a molecular editor points toward interactive tools for chemists exploring chemical space.
  • Testing on datasets with known unstable conformations could reveal limits of the pseudo-potential approximation.

Load-bearing premise

The magnitudes of forces from the quadratic pseudo-potential implicitly encode sufficient information about the noise level to generate valid and stable molecular conformations.

What would settle it

Compare validity rates of GPFF sampling against standard diffusion models on QM9 when the training forces are replaced with random magnitudes of the same scale to break the implicit noise encoding.

Figures

Figures reproduced from arXiv: 2605.19050 by Frank No\'e, Khaled Kahouli, Klaus-Robert M\"uller, Michael Plainer, Niklas Wolf Andreas Gebauer, Stefaan Simon Pierre Hessmann, Stefan Gugler.

Figure 1
Figure 1. Figure 1: Overview of generative pseudo-force fields (GPFFs). [PITH_FULL_IMAGE:figures/full_fig_p002_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Comparison of GPFF (ours, solid) and DM baselines (dashed). Left: deterministic sampling, [PITH_FULL_IMAGE:figures/full_fig_p011_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: Alternative GPFF samplers and their similarity to QM9. Left: validity as a function of [PITH_FULL_IMAGE:figures/full_fig_p011_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: Adaptive versus non-adaptive sampling. Panels compare non-adaptive DM (dashed), non [PITH_FULL_IMAGE:figures/full_fig_p012_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: Each point represents a molecule mapped to the shape space spanned by rod, sphere, and [PITH_FULL_IMAGE:figures/full_fig_p013_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: Iterative scaffold-guided design with shape-constrained denoising. Each row shows one stage [PITH_FULL_IMAGE:figures/full_fig_p014_6.png] view at source ↗
Figure 7
Figure 7. Figure 7: Adaptive sampling trajectories for ancestral, Heun, and stochastic Heun sampling with [PITH_FULL_IMAGE:figures/full_fig_p027_7.png] view at source ↗
Figure 8
Figure 8. Figure 8: Loss as a function of noise level σ for (a) GPFF and (b) DM, evaluated on preliminary models trained with uniformly sampled σ. Gray dots show individual test samples, the green line shows the smoothed mean loss, and the blue curve shows the fitted log-normal sampling density p(σ) used for the final training runs. sphere, and disc shapes (see Section 4.4). We train a lightweight generator to sample valid co… view at source ↗
Figure 9
Figure 9. Figure 9: Effect of atom index alignment during training on GPFF sampling. Validity as a function [PITH_FULL_IMAGE:figures/full_fig_p030_9.png] view at source ↗
Figure 10
Figure 10. Figure 10: Impact of time-step conditioning of GPFF models on sampling quality. Validity as a function [PITH_FULL_IMAGE:figures/full_fig_p030_10.png] view at source ↗
Figure 11
Figure 11. Figure 11: Impact of time-step conditioning of DMs on sampling quality. Validity as a function of [PITH_FULL_IMAGE:figures/full_fig_p031_11.png] view at source ↗
Figure 12
Figure 12. Figure 12: Distribution comparison to QM9 at 256 NFE. Rows correspond to direct denoising (GPFF), [PITH_FULL_IMAGE:figures/full_fig_p032_12.png] view at source ↗
read the original abstract

Generating stable molecular conformations typically forces a tradeoff between the physical realism of energy-based relaxation and the sampling efficiency of data-driven generative models. While machine learning force fields (MLFFs) can sample stable conformations by relaxing molecular geometries according to physical forces, they require costly ab-initio training data. Conversely, diffusion models (DMs) learn from equilibrium data alone but are dependent on noise schedules and time-step conditioning. In this work, we propose generative pseudo-force fields (GPFFs) to bridge these paradigms by training an MLFF on a quadratic pseudo-potential energy surface relative to reference equilibrium structures. Because no ab-initio calculations are required for the perturbed geometries, non-equilibrium training data can be generated on the fly by perturbing the equilibria with Gaussian noise. We show that GPFFs constitute a time-step-agnostic variant of variance exploding DMs: the score comes from the predicted pseudo-forces but because force magnitudes implicitly encode the noise level, no time-step conditioning is needed. Our GPFF can hence be used as a drop-in replacement in standard diffusion sampling (ancestral, Heun) but also facilitates more efficient, adaptive variants and an MLFF inspired direct denoising scheme. Our proposed sampling algorithms support arbitrary structural priors and geometric constraints. On QM9, GPFF has 100 % validity at 256 neural function evaluations (NFE) and over 50 % at just 6 NFE, outperforming diffusion baselines across all samplers. Combined with custom priors, we showcase the fast and accurate generation process of our method in a molecular editor for a drug design setting, where a molecule is generated in real time.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

3 major / 2 minor

Summary. The paper introduces generative pseudo-force fields (GPFFs) for molecular conformation generation. It trains a neural network on pseudo-forces derived from a quadratic potential energy surface centered at reference equilibrium structures, with training data generated on-the-fly via Gaussian perturbations; no ab-initio calculations on non-equilibrium geometries are required. The authors frame GPFFs as a time-step-agnostic variant of variance-exploding diffusion models in which predicted force magnitudes implicitly encode noise level, enabling drop-in replacement in standard samplers (ancestral, Heun) as well as adaptive and direct-denoising schemes. On QM9 the method reports 100% validity at 256 neural function evaluations and >50% validity at 6 NFE while outperforming diffusion baselines; custom structural priors are demonstrated in a real-time molecular editor for drug-design use cases.

Significance. If the central performance claims and the implicit noise-encoding mechanism hold under rigorous verification, the work offers a practical bridge between machine-learning force fields and diffusion models that avoids costly ab-initio data for perturbed geometries and removes explicit time conditioning. The reported low-NFE validity and support for arbitrary priors could accelerate conformation sampling in computational chemistry and interactive molecular design. The on-the-fly data-generation strategy and the MLFF-inspired direct-denoising sampler are concrete strengths that merit further exploration.

major comments (3)
  1. [§3.2, Eq. (7)] §3.2 and Eq. (7): the assertion that force magnitudes implicitly encode noise level and thereby render explicit time-step conditioning unnecessary is load-bearing for the time-agnostic claim, yet the manuscript provides no quantitative analysis (e.g., correlation plots or ablation on magnitude regression error) showing that ||F_pred|| remains accurate across the full range of Gaussian perturbation variances used at training and inference.
  2. [Table 2] Table 2 (QM9 results): the headline figures of 100% validity at 256 NFE and >50% at 6 NFE lack reported error bars, the precise definition of 'validity' (chemical, geometric, or both), the number of independent runs, and the exact baseline implementations (including whether the diffusion models used identical network capacity and training data). These omissions prevent verification that the low-NFE gains are robust rather than artifacts of evaluation protocol.
  3. [§4.1] §4.1: the quadratic coefficient k is listed among free parameters, but no systematic study or cross-validation procedure is described for its selection; because k directly controls the magnitude distribution of the training targets, its arbitrary choice risks undermining the claim that magnitudes alone suffice for time-agnostic sampling without degrading validity below 50% at 6 NFE.
minor comments (2)
  1. [Figure 3] Figure 3 caption: the legend does not distinguish the GPFF direct-denoising curve from the adaptive-sampler curve; readers cannot immediately map the plotted lines to the algorithms described in §3.3.
  2. [§2.3] §2.3: the notation for the pseudo-potential V(x) = (k/2)||x - x_eq||^2 should explicitly state that x_eq is fixed per molecule and not updated during sampling, to avoid confusion with learned equilibrium structures in other MLFF literature.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for their thorough review and constructive feedback on our manuscript. We address each of the major comments below and will incorporate revisions to strengthen the paper.

read point-by-point responses
  1. Referee: [§3.2, Eq. (7)] §3.2 and Eq. (7): the assertion that force magnitudes implicitly encode noise level and thereby render explicit time-step conditioning unnecessary is load-bearing for the time-agnostic claim, yet the manuscript provides no quantitative analysis (e.g., correlation plots or ablation on magnitude regression error) showing that ||F_pred|| remains accurate across the full range of Gaussian perturbation variances used at training and inference.

    Authors: We agree that empirical validation would strengthen the claim. The derivation in Section 3.2 and Equation (7) demonstrates that under the quadratic pseudo-potential, the expected force magnitude is proportional to the perturbation variance, allowing the network to implicitly learn the noise level from the force predictions. However, to address this concern, we will add quantitative analysis including correlation plots of predicted ||F|| versus true noise variance and an ablation on magnitude regression error across the training noise range in the revised manuscript. revision: yes

  2. Referee: [Table 2] Table 2 (QM9 results): the headline figures of 100% validity at 256 NFE and >50% at 6 NFE lack reported error bars, the precise definition of 'validity' (chemical, geometric, or both), the number of independent runs, and the exact baseline implementations (including whether the diffusion models used identical network capacity and training data). These omissions prevent verification that the low-NFE gains are robust rather than artifacts of evaluation protocol.

    Authors: We acknowledge these omissions and will revise Table 2 to include error bars computed over 5 independent runs, explicitly define validity as requiring both chemically valid molecules (via RDKit sanitization) and geometrically valid structures (bond lengths and angles within thresholds), report the number of runs, and detail that baselines were re-implemented with matching network architectures and the same QM9 training data for fair comparison. revision: yes

  3. Referee: [§4.1] §4.1: the quadratic coefficient k is listed among free parameters, but no systematic study or cross-validation procedure is described for its selection; because k directly controls the magnitude distribution of the training targets, its arbitrary choice risks undermining the claim that magnitudes alone suffice for time-agnostic sampling without degrading validity below 50% at 6 NFE.

    Authors: The value of k was chosen through initial experiments to ensure the pseudo-force magnitudes align with typical noise levels in the diffusion process. We will expand Section 4.1 to include a systematic sensitivity analysis and cross-validation procedure for k, demonstrating that the reported performance holds across a range of k values and does not critically depend on a specific choice. revision: yes

Circularity Check

1 steps flagged

GPFF time-step-agnostic property reduces to definitional encoding in quadratic pseudo-force targets

specific steps
  1. self definitional [Abstract]
    "We show that GPFFs constitute a time-step-agnostic variant of variance exploding DMs: the score comes from the predicted pseudo-forces but because force magnitudes implicitly encode the noise level, no time-step conditioning is needed."

    Training data are generated by perturbing equilibria with Gaussian noise and assigning targets F = -k(x - x_eq) from the quadratic pseudo-potential. Thus ||F|| = k * ||perturbation|| exactly, where the perturbation distance is the noise level. The implicit encoding and resulting time-agnostic sampling therefore hold by construction of the targets rather than as a non-trivial prediction or derivation.

full rationale

The paper's central framing—that GPFFs form a time-step-agnostic variant of variance-exploding diffusion models because force magnitudes implicitly encode noise level—is shown by direct inspection of the training construction rather than derived as an independent result. The quadratic pseudo-potential on Gaussian perturbations makes ||F|| proportional to perturbation size by definition, so the claimed absence of explicit t-conditioning follows tautologically. Empirical validity numbers on QM9 remain independent content and are not forced by this step, yielding partial rather than complete circularity. No load-bearing self-citations or other patterns appear in the provided text.

Axiom & Free-Parameter Ledger

2 free parameters · 1 axioms · 1 invented entities

The central claim rests on the modeling choice of a quadratic pseudo-potential and the assumption that force magnitudes can stand in for explicit noise-level information; both are introduced without external benchmarks in the abstract.

free parameters (2)
  • quadratic potential coefficient
    Defines the strength of the restoring force toward each reference equilibrium structure; chosen to create the pseudo-energy surface.
  • Gaussian perturbation variance
    Controls the distribution of non-equilibrium training geometries generated on the fly.
axioms (1)
  • domain assumption A quadratic pseudo-potential around equilibrium geometries yields forces whose magnitudes implicitly encode the noise level.
    Invoked to justify removal of time-step conditioning in the generative process.
invented entities (1)
  • Generative Pseudo-Force Field (GPFF) no independent evidence
    purpose: Bridge between ML force fields and diffusion models for molecular sampling.
    Newly defined object whose properties are derived from the quadratic pseudo-potential construction.

pith-pipeline@v0.9.0 · 5862 in / 1549 out tokens · 115195 ms · 2026-05-20T11:51:26.546904+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

107 extracted references · 107 canonical work pages · 2 internal anchors

  1. [1]

    Hajduk and Jonathan Greer

    Philip J. Hajduk and Jonathan Greer. A decade of fragment-based drug design: strategic advances and lessons learned.Nature Reviews Drug Discovery, 6(3):211–219, 2007. doi: 10.1038/nrd2220

  2. [2]

    Novel mixed polyanions lithium-ion battery cathode materials predicted by high-throughput ab initio computations.Journal of Materials Chemistry, 21(43):17147–17153, 2011

    Geoffroy Hautier, Anubhav Jain, Hailong Chen, Charles Moore, Shyue Ping Ong, and Gerbrand Ceder. Novel mixed polyanions lithium-ion battery cathode materials predicted by high-throughput ab initio computations.Journal of Materials Chemistry, 21(43):17147–17153, 2011. doi: 10.1039/ c1jm12216a

  3. [3]

    Anatole von Lilienfeld

    Matthias Rupp, Alexandre Tkatchenko, Klaus-Robert M¨ uller, and O. Anatole von Lilienfeld. Fast and accurate modeling of molecular atomization energies with machine learning.Physical Review Letters, 108(5):058301, 2012

  4. [4]

    The chemical space project.Accounts of Chemical Research, 48(3):722–730, February 2015

    Jean-Louis Reymond. The chemical space project.Accounts of Chemical Research, 48(3):722–730, February 2015. ISSN 1520-4898. doi: 10.1021/ar500432k. URL http://dx.doi.org/10.1021/ ar500432k

  5. [5]

    Boltzmann generators: Sampling equilibrium states of many-body systems with deep learning.Science, 365(6457):eaaw1147, 2019

    Frank No´ e, Simon Olsson, Jonas K¨ ohler, and Hao Wu. Boltzmann generators: Sampling equilibrium states of many-body systems with deep learning.Science, 365(6457):eaaw1147, 2019. doi: 10.1126/science.aaw1147

  6. [6]

    Anatole von Lilienfeld, Klaus-Robert M¨ uller, and Alexandre Tkatchenko

    O. Anatole von Lilienfeld, Klaus-Robert M¨ uller, and Alexandre Tkatchenko. Exploring chemical compound space with quantum-based machine learning.Nature Reviews Chemistry, 4(7):347–358,

  7. [7]

    doi: 10.1038/s41570-020-0189-9

  8. [8]

    Stefan Gugler, Jon Paul Janet, and Heather J. Kulik. Enumeration of de novo inorganic complexes for chemical discovery and machine learning.Molecular Systems Design & Engineering, 5(1): 139–152, 2020. ISSN 2058-9689. doi: 10.1039/c9me00069k. URL http://dx.doi.org/10.1039/ C9ME00069K

  9. [9]

    Castelli, Juan Maria Garcia-Lastra, Peter Bjørn Jørgensen, Ole Winther, and Tejs Vegge

    Arghya Bhowmik, Ivano E. Castelli, Juan Maria Garcia-Lastra, Peter Bjørn Jørgensen, Ole Winther, and Tejs Vegge. A perspective on inverse design of battery interphases using multi-scale modelling, experiments and generative deep learning.Energy Storage Materials, 21:446–456, 2019. ISSN 2405-8297. doi: 10.1016/j.ensm.2019.06.011

  10. [10]

    Freeze, H

    Jessica G. Freeze, H. Ray Kelly, and Victor S. Batista. Search for catalysts by inverse design: Artificial intelligence, mountain climbers, and alchemists.Chemical Reviews, 119(11):6595–6612,

  11. [11]

    PMID: 31059236

    doi: 10.1021/acs.chemrev.8b00759. PMID: 31059236

  12. [12]

    Inverse-qspr for de novo design: A review.Molecular Informatics, 39(4):1900087, 2020

    Philippe Gantzer, Benoit Creton, and Carlos Nieto-Draghi. Inverse-qspr for de novo design: A review.Molecular Informatics, 39(4):1900087, 2020. doi: 10.1002/minf.201900087

  13. [13]

    Joshi, Niklas W

    Rajendra P. Joshi, Niklas W. A. Gebauer, Mridula Bontha, Mercedeh Khazaieli, Rhema M. James, James B. Brown, and Neeraj Kumar. 3D-Scaffold: A deep learning framework to generate 3d coordinates of drug-like molecules with desired scaffolds.Journal of Physical Chemistry B, 125(44):12166–12176, 2021. doi: 10.1021/acs.jpcb.1c06437

  14. [14]

    Miteva, Anne-Claude Camproux, and Bruno O

    St´ ephanie P´ erot, Olivier Sperandio, Maria A. Miteva, Anne-Claude Camproux, and Bruno O. Villoutreix. Druggable pockets and binding site centric chemical space: a paradigm shift in drug discovery.Drug Discovery Today, 15(15–16):656–667, August 2010. ISSN 1359-6446. doi: 10.1016/j.drudis.2010.05.015. URLhttp://dx.doi.org/10.1016/j.drudis.2010.05.015

  15. [15]

    Wiley, 3 edition, 2017

    Frank Jensen.Introduction to Computational Chemistry. Wiley, 3 edition, 2017. ISBN 978-1-118- 82599-0. 17

  16. [16]

    Cramer.Essentials of Computational Chemistry: Theories and Models

    Christopher J. Cramer.Essentials of Computational Chemistry: Theories and Models. Wiley, 2 edition, 2004. ISBN 978-0-470-09182-1

  17. [17]

    Bernhard Schlegel

    H. Bernhard Schlegel. Geometry optimization.WIREs Computational Molecular Science, 1 (5):790–809, May 2011. ISSN 1759-0884. doi: 10.1002/wcms.34. URL http://dx.doi.org/10. 1002/wcms.34

  18. [18]

    Inhomogeneous electron gas.Physical Review, 136(3B):B864,

    Pierre Hohenberg and Walter Kohn. Inhomogeneous electron gas.Physical Review, 136(3B):B864,

  19. [19]

    doi: 10.1103/PhysRev.136.B864

  20. [20]

    Machine Learning for Molecular Simulation

    Frank No´ e, Alexandre Tkatchenko, Klaus-Robert M¨ uller, and Cecilia Clementi. Machine learning for molecular simulation.Annual Review of Physical Chemistry, 71(1):361–390, 2020. doi: 10.1146/annurev-physchem-042018-052331

  21. [21]

    Chemical Reviews 121(16), 10142–10186 (2021) https: //doi.org/10.1021/acs.chemrev.0c01111

    Oliver T. Unke, Stefan Chmiela, Huziel E. Sauceda, Michael Gastegger, Igor Poltavsky, Kristof T. Sch¨ utt, Alexandre Tkatchenko, and Klaus-Robert M¨ uller. Machine learning force fields.Chemical Reviews, 121(16):10142–10186, 2021. doi: 10.1021/acs.chemrev.0c01111

  22. [22]

    Unke, Martin St¨ ohr, Stefan Ganscha, Thomas Unterthiner, Hartmut Maennel, Sergii Kashubin, Daniel Ahlin, Michael Gastegger, Leonardo Medrano Sandonas, Joshua T

    Oliver T. Unke, Martin St¨ ohr, Stefan Ganscha, Thomas Unterthiner, Hartmut Maennel, Sergii Kashubin, Daniel Ahlin, Michael Gastegger, Leonardo Medrano Sandonas, Joshua T. Berryman, Alexandre Tkatchenko, and Klaus-Robert M¨ uller. Biomolecular dynamics with machine-learned quantum-mechanical force fields trained on diverse chemical fragments.Science Advan...

  23. [23]

    Physics-inspired structural representations for molecules and materials.Chemical reviews, 121(16):9759–9815, 2021

    Felix Musil, Andrea Grisafi, Albert P Bart´ ok, Christoph Ortner, G´ abor Cs´ anyi, and Michele Ceriotti. Physics-inspired structural representations for molecules and materials.Chemical reviews, 121(16):9759–9815, 2021

  24. [24]

    Molecular similarity in machine learning of energies in chemical reaction networks, 2025

    Stefan Gugler and Markus Reiher. Molecular similarity in machine learning of energies in chemical reaction networks, 2025. URLhttps://arxiv.org/abs/2504.18742

  25. [25]

    and Poltavsky, Igor and Schütt, Kristof T

    Stefan Chmiela, Alexandre Tkatchenko, Huziel E. Sauceda, Igor Poltavsky, Kristof T. Sch¨ utt, and Klaus-Robert M¨ uller. Machine learning of accurate energy-conserving molecular force fields. Science Advances, 3(5):e1603015, 2017. doi: 10.1126/sciadv.1603015

  26. [26]

    Computer Physics Communications 240, 38–45 (2019) https://doi.org/10.1016/j.cpc.2019.02.007

    Stefan Chmiela, Huziel E. Sauceda, Igor Poltavsky, Klaus-Robert M¨ uller, and Alexandre Tkatchenko. sgdml: Constructing accurate and data efficient molecular force fields using machine learning.Computer Physics Communications, 240:38–45, 2019. doi: 10.1016/j.cpc.2019.02.007

  27. [27]

    Sauceda, Klaus-Robert M¨ uller, and Alexandre Tkatchenko

    Stefan Chmiela, Huziel E. Sauceda, Klaus-Robert M¨ uller, and Alexandre Tkatchenko. Towards exact molecular dynamics simulations with machine-learned force fields.Nature Communications, 9(1):3887, 2018. doi: 10.1038/s41467-018-06169-2

  28. [28]

    Gaussian approximation potentials: The accuracy of quantum mechanics, without the electrons.Physical review letters, 104(13):136403, 2010

    Albert P Bart´ ok, Mike C Payne, Risi Kondor, and G´ abor Cs´ anyi. Gaussian approximation potentials: The accuracy of quantum mechanics, without the electrons.Physical review letters, 104(13):136403, 2010

  29. [29]

    Machine learning unifies the modeling of materials and molecules.Science advances, 3(12):e1701816, 2017

    Albert P Bart´ ok, Sandip De, Carl Poelking, Noam Bernstein, James R Kermode, G´ abor Cs´ anyi, and Michele Ceriotti. Machine learning unifies the modeling of materials and molecules.Science advances, 3(12):e1701816, 2017

  30. [30]

    Anatole von Lilienfeld

    Gr´ egoire Montavon, Matthias Rupp, Vivekanand Gobre, Alvaro Vazquez-Mayagoitia, Katja Hansen, Alexandre Tkatchenko, Klaus-Robert M¨ uller, and O. Anatole von Lilienfeld. Machine learning of molecular electronic properties in chemical compound space.New Journal of Physics, 15(9):095003, 2013. 18

  31. [31]

    Sch¨ utt, Pieter-Jan Kindermans, Huziel E

    Kristof T. Sch¨ utt, Pieter-Jan Kindermans, Huziel E. Sauceda, Stefan Chmiela, Alexandre Tkatchenko, and Klaus-Robert M¨ uller. SchNet: A continuous-filter convolutional neural network for modeling quantum interactions. In I. Guyon, U. Von Luxburg, S. Bengio, H. Wallach, R. Fergus, S. Vishwanathan, and R. Garnett, editors,Advances in Neural Information Pr...

  32. [32]

    Sch¨ utt, Huziel E

    Kristof T. Sch¨ utt, Huziel E. Sauceda, Pieter-Jan Kindermans, Alexandre Tkatchenko, and Klaus- Robert M¨ uller. SchNet – A deep learning architecture for molecules and materials.The Journal of Chemical Physics, 148(24):241722, 2018. doi: 10.1063/1.5019779

  33. [33]

    Unke, Stefan Chmiela, Michael Gastegger, Kristof T

    Oliver T. Unke, Stefan Chmiela, Michael Gastegger, Kristof T. Sch¨ utt, Huziel E. Sauceda, and Klaus-Robert M¨ uller. Spookynet: Learning force fields with electronic degrees of freedom and nonlocal effects.Nature Communications, 12(1):7273, 2021. doi: 10.1038/s41467-021-27504-0

  34. [34]

    Unke and Markus Meuwly

    Oliver T. Unke and Markus Meuwly. PhysNet: a neural network for predicting energies, forces, dipole moments, and partial charges.Journal of Chemical Theory and Computation, 15(6): 3678–3693, 2019. doi: 10.1021/acs.jctc.9b00181

  35. [35]

    Sch¨ utt, Oliver T

    Kristof T. Sch¨ utt, Oliver T. Unke, and Michael Gastegger. Equivariant message passing for the prediction of tensorial properties and molecular spectra. In Marina Meila and Tong Zhang, editors,Proceedings of the 38th International Conference on Machine Learning, volume 139 ofProceedings of Machine Learning Research, pages 9377–9388. PMLR, 18–24 Jul 2021....

  36. [36]

    Keith, Valentin Vassilev-Galindo, Bingqing Cheng, Stefan Chmiela, Michael Gastegger, Klaus-Robert M¨ uller, and Alexandre Tkatchenko

    John A. Keith, Valentin Vassilev-Galindo, Bingqing Cheng, Stefan Chmiela, Michael Gastegger, Klaus-Robert M¨ uller, and Alexandre Tkatchenko. Combining machine learning and computational chemistry for predictive insights into chemical systems.Chemical Reviews, 121(16):9816–9872, 2021

  37. [37]

    Mailoa, Mordechai Kornbluth, Nicola Molinari, Tess E

    Simon Batzner, Albert Musaelian, Lixin Sun, Mario Geiger, Jonathan P. Mailoa, Mordechai Kornbluth, Nicola Molinari, Tess E. Smidt, and Boris Kozinsky. E(3)-equivariant graph neural networks for data-efficient and accurate interatomic potentials.Nature Communications, 13(1): 2453, May 2022. ISSN 2041-1723. doi: 10.1038/s41467-022-29939-5

  38. [38]

    A foundation model for atomistic materials chemistry.The Journal of chemical physics, 163(18), 2025

    Ilyes Batatia, Philipp Benner, Yuan Chiang, Alin M Elena, D´ avid P Kov´ acs, Janosh Riebesell, Xavier R Advincula, Mark Asta, Matthew Avaylon, William J Baldwin, et al. A foundation model for atomistic materials chemistry.The Journal of chemical physics, 163(18), 2025

  39. [39]

    Thorben Frank, Oliver T

    J. Thorben Frank, Oliver T. Unke, and Klaus-Robert M¨ uller. So3krates: Equivariant attention for interactions on arbitrary length-scales in molecular systems. In S. Koyejo, S. Mohamed, A. Agarwal, D. Belgrave, K. Cho, and A. Oh, editors,Advances in Neural Information Processing Systems, volume 35, pages 29400–29413. Curran Asso- ciates, Inc., 2022. URL h...

  40. [40]

    Thorben Frank, Oliver T

    J. Thorben Frank, Oliver T. Unke, Klaus-Robert M¨ uller, and Stefan Chmiela. A euclidean transformer for fast and stable machine learned force fields.Nature Communications, 15(1):6539, 2024

  41. [41]

    Unke, Klaus-Robert M¨ uller, and Stefan Gugler

    Max Eissler, Tim Korjakow, Stefan Ganscha, Oliver T. Unke, Klaus-Robert M¨ uller, and Stefan Gugler. How simple can you go? an off-the-shelf transformer approach to molecular dynamics. The Journal of Chemical Physics, 164(9), 2026. 19

  42. [42]

    Thorben Frank, Stefan Chmiela, Klaus-Robert M¨ uller, and Oliver T

    J. Thorben Frank, Stefan Chmiela, Klaus-Robert M¨ uller, and Oliver T. Unke. Machine learning global atomic representations with euclidean fast attention.Nature Machine Intelligence, 8(3): 388–402, 2026

  43. [43]

    Klara Bonneau, Aldo S. Pasos-Trejo, Michael Plainer, Luca Sagresti, Jacopo Venturin, Iryna Zaporozhets, Alessandro Caruso, Edoardo Rolando, Andrea Guljas, Leon Klein, Maximilian Schebek, Filippo Albani, Raquel L´ opez-R´ ıos de Castro, Zakariya El Machachi, Lorenzo Giambagli, and Cecilia Clementi. Breaking the barriers of molecular dynamics with deep-lear...

  44. [44]

    Unke, Daniel Ahlin, Hartmut Maennel, Sergii Kashubin, and Klaus- Robert M¨ uller

    Stefan Ganscha, Oliver T. Unke, Daniel Ahlin, Hartmut Maennel, Sergii Kashubin, and Klaus- Robert M¨ uller. The qcml dataset, quantum chemistry reference data from 33.5 m dft and 14.7 b semi-empirical calculations.Scientific Data, 12(1):406, 2025

  45. [45]

    Smith, Roman Zubatyuk, Benjamin Nebgen, Nicholas Lubbers, Kipton Barros, Adrian E

    Justin S. Smith, Roman Zubatyuk, Benjamin Nebgen, Nicholas Lubbers, Kipton Barros, Adrian E. Roitberg, Olexandr Isayev, and Sergei Tretiak. The ANI-1ccx and ANI-1x data sets, coupled- cluster and density functional theory properties for molecules.Scientific Data, 7(1):134, 2020. doi: 10.1038/s41597-020-0473-z

  46. [46]

    Ernst, Alvaro Vazquez-Mayagoitia, Robert A

    Johannes Hoja, Leonardo Medrano Sandonas, Brian G. Ernst, Alvaro Vazquez-Mayagoitia, Robert A. DiStasio Jr., and Alexandre Tkatchenko. QM7-X, a comprehensive dataset of quantum- mechanical properties spanning the chemical space of small organic molecules.Scientific Data, 8 (1):43, Feb 2021. ISSN 2052-4463. doi: 10.1038/s41597-021-00812-2

  47. [47]

    Khaled Kahouli, Stefaan S. P. Hessmann, Klaus-Robert M¨ uller, Shinichi Nakajima, Stefan Gugler, and Niklas W. A. Gebauer. Molecular relaxation by reverse diffusion with time step prediction. Machine Learning: Science and Technology, 5(3):035038, 2024

  48. [48]

    Equivariant flows: Exact likelihood generative learning for symmetric densities

    Jonas K¨ ohler, Leon Klein, and Frank No´ e. Equivariant flows: Exact likelihood generative learning for symmetric densities. In Hal Daum´ e III and Aarti Singh, editors,Proceedings of the 37th International Conference on Machine Learning, volume 119 ofProceedings of Machine Learning Research, pages 5361–5370. PMLR, 13–18 Jul 2020. URL https://proceedings...

  49. [49]

    E(n) equivariant normalizing flows

    Victor Garcia Satorras, Emiel Hoogeboom, Fabian Fuchs, Ingmar Posner, and Max Welling. E(n) equivariant normalizing flows. In M. Ranzato, A. Beygelzimer, Y. Dauphin, P.S. Liang, and J. Wortman Vaughan, editors,Advances in Neural Information Processing Systems, volume 34, pages 4181–4192. Curran Associates, Inc., 2021. URL https://proceedings.neurips.cc/ p...

  50. [50]

    Leon Klein, Andrew Y. K. Foong, Tor Erlend Fjelde, Bruno Kacper Mlodozeniec, Marc Brockschmidt, Sebastian Nowozin, Frank No´ e, and Ryota Tomioka. Timewarp: Transferable acceleration of molecular dynamics by learning time-coarsened dynamics. InThirty-seventh Conference on Neural Information Processing Systems, 2023. URL https://openreview.net/ forum?id=EjMLpTgvKH

  51. [51]

    Veeling, Iryna Zaporozhets, Yaoyi Chen, Soojung Yang, Adam E

    Sarah Lewis, Tim Hempel, Jos´ e Jim´ enez-Luna, Michael Gastegger, Yu Xie, Andrew YK Foong, Victor Garc´ ıa Satorras, Osama Abdin, Bastiaan S. Veeling, Iryna Zaporozhets, Yaoyi Chen, Soojung Yang, Adam E. Foster, Arne Schneuing, Jigyasa Nigam, Federico Barbero, Vincent Stimper, Andrew Campbell, Jason Yim, Marten Lienen, Yu Shi, Shuxin Zheng, Hannes Schulz...

  52. [52]

    doi: 10.1126/science.adv9817. 20

  53. [53]

    Niklas W. A. Gebauer, Michael Gastegger, and Kristof T. Sch¨ utt. Symmetry-adapted generation of 3d point sets for the targeted discovery of molecules. In H. Wallach, H. Larochelle, A. Beygelzimer, F. d'Alch´ e-Buc, E. Fox, and R. Garnett, editors,Ad- vances in Neural Information Processing Systems, volume 32, pages 7566–7578. Curran Asso- ciates, Inc., 2...

  54. [54]

    Niklas W. A. Gebauer, Michael Gastegger, Stefaan S. P. Hessmann, Klaus-Robert M¨ uller, and Kristof T. Sch¨ utt. Inverse design of 3d molecular structures with conditional generative neural networks.Nature Communications, 13(1):973, 2022. ISSN 2041-1723. doi: 10.1038/ s41467-022-28526-y

  55. [55]

    Denoising diffusion probabilistic models

    Jonathan Ho, Ajay Jain, and Pieter Abbeel. Denoising diffusion probabilistic models. In H. Larochelle, M. Ranzato, R. Hadsell, M.F. Balcan, and H. Lin, editors,Advances in Neural Information Processing Systems, volume 33, pages 6840–6851. Curran Asso- ciates, Inc., 2020. URL https://proceedings.neurips.cc/paper_files/paper/2020/file/ 4c5bcfec8584af0d967f1...

  56. [56]

    Kingma, Abhishek Kumar, Stefano Ermon, and Ben Poole

    Yang Song, Jascha Sohl-Dickstein, Diederik P. Kingma, Abhishek Kumar, Stefano Ermon, and Ben Poole. Score-based generative modeling through stochastic differential equations. In International Conference on Learning Representations, 2021. URL https://openreview.net/ forum?id=PxTIG12RRHS

  57. [57]

    Equivariant diffusion for molecule generation in 3D

    Emiel Hoogeboom, V´ ıctor Garcia Satorras, Cl´ ement Vignac, and Max Welling. Equivariant diffusion for molecule generation in 3D. In Kamalika Chaudhuri, Stefanie Jegelka, Le Song, Csaba Szepesvari, Gang Niu, and Sivan Sabato, editors,Proceedings of the 39th International Conference on Machine Learning, volume 162 ofProceedings of Machine Learning Researc...

  58. [58]

    Diffusion-based molecule generation with informative prior bridges

    Lemeng Wu, Chengyue Gong, Xingchao Liu, Mao Ye, and Qiang Liu. Diffusion-based molecule generation with informative prior bridges. In S. Koyejo, S. Mohamed, A. Agarwal, D. Belgrave, K. Cho, and A. Oh, editors,Advances in Neural Information Processing Systems, volume 35, pages 36533–36545. Curran Associates, Inc., 2022. URL https://proceedings.neurips.cc/p...

  59. [59]

    Mdm: Molecular diffusion model for 3d molecule generation.Proceedings of the AAAI Conference on Artificial Intelligence, 37(4):5105–5112, Jun

    Lei Huang, Hengtong Zhang, Tingyang Xu, and Ka-Chun Wong. Mdm: Molecular diffusion model for 3d molecule generation.Proceedings of the AAAI Conference on Artificial Intelligence, 37(4):5105–5112, Jun. 2023. doi: 10.1609/aaai.v37i4.25639. URL https://ojs.aaai.org/index. php/AAAI/article/view/25639

  60. [60]

    Powers, Ron O

    Minkai Xu, Alexander S. Powers, Ron O. Dror, Stefano Ermon, and Jure Leskovec. Geometric latent diffusion models for 3D molecule generation. In Andreas Krause, Emma Brunskill, Kyunghyun Cho, Barbara Engelhardt, Sivan Sabato, and Jonathan Scarlett, editors,Proceedings of the 40th International Conference on Machine Learning, volume 202 ofProceedings of Mac...

  61. [61]

    MolDiff: Addressing the atom-bond inconsistency problem in 3D molecule diffusion generation

    Xingang Peng, Jiaqi Guan, Qiang Liu, and Jianzhu Ma. MolDiff: Addressing the atom-bond inconsistency problem in 3D molecule diffusion generation. In Andreas Krause, Emma Brunskill, Kyunghyun Cho, Barbara Engelhardt, Sivan Sabato, and Jonathan Scarlett, editors,Proceedings of the 40th International Conference on Machine Learning, volume 202 ofProceedings o...

  62. [62]

    GeoDiff: A geometric diffusion model for molecular conformation generation

    Minkai Xu, Lantao Yu, Yang Song, Chence Shi, Stefano Ermon, and Jian Tang. GeoDiff: A geometric diffusion model for molecular conformation generation. InInternational Conference on Learning Representations, 2022. URLhttps://openreview.net/forum?id=PzcvxEMzvQC

  63. [63]

    Jaakkola

    Bowen Jing, Gabriele Corso, Jeffrey Chang, Regina Barzilay, and Tommi S. Jaakkola. Torsional diffusion for molecular conformer generation. In Alice H. Oh, Alekh Agarwal, Danielle Belgrave, and Kyunghyun Cho, editors,Advances in Neural Information Processing Systems, 2022. URL https://openreview.net/forum?id=w6fj2r62r_H

  64. [64]

    Thorben Frank, Winfried Ripken, Gregor Lied, Klaus-Robert M¨ uller, Oliver T

    J. Thorben Frank, Winfried Ripken, Gregor Lied, Klaus-Robert M¨ uller, Oliver T. Unke, and Stefan Chmiela. Sampling 3d molecular conformers with diffusion transformers.Advances in Neural Information Processing Systems, 38:168881–168931, 2026

  65. [65]

    Nathaniel R. Bennett, Brian Coventry, Inna Goreshnik, Buwei Huang, Aza Allen, Dionne Vafeados, Ying Po Peng, Justas Dauparas, Minkyung Baek, Lance Stewart, Frank DiMaio, Steven De Munck, Savvas N. Savvides, and David Baker. Improving de novo protein binder design with deep learning. Nature Communications, 14(1), May 2023. ISSN 2041-1723. doi: 10.1038/s414...

  66. [66]

    Xie, T., Xue, S., Feng, Z., Hu, T., Sun, J., Li, Z., and Zhang, C

    Joseph L. Watson, David Juergens, Nathaniel R. Bennett, Brian L. Trippe, Jason Yim, Helen E. Eisenach, Woody Ahern, Andrew J. Borst, Robert J. Ragotte, Lukas F. Milles, Basile I. M. Wicky, Nikita Hanikel, Samuel J. Pellock, Alexis Courbet, William Sheffler, Jue Wang, Preetham Venkatesh, Isaac Sappington, Susana V´ azquez Torres, Anna Lauko, Valentin De Bo...

  67. [67]

    Progressive Distillation for Fast Sampling of Diffusion Models

    Tim Salimans and Jonathan Ho. Progressive distillation for fast sampling of diffusion models. arXiv preprint arXiv:2202.00512, 2022

  68. [68]

    Knowledge Distillation in Iterative Generative Models for Improved Sampling Speed

    Eric Luhman and Troy Luhman. Knowledge distillation in iterative generative models for improved sampling speed.arXiv preprint arXiv:2101.02388, 2021

  69. [69]

    One-step diffusion via shortcut models

    Kevin Frans, Danijar Hafner, Sergey Levine, and Pieter Abbeel. One-step diffusion via shortcut models. InProceedings of the Thirteenth International Conference on Learning Representations (ICLR), 2025

  70. [70]

    Improved denoising diffusion probabilistic models

    Alexander Quinn Nichol and Prafulla Dhariwal. Improved denoising diffusion probabilistic models. In Marina Meila and Tong Zhang, editors,Proceedings of the 38th International Conference on Machine Learning, volume 139 ofProceedings of Machine Learning Research, pages 8162–8171. PMLR, 18–24 Jul 2021. URLhttps://proceedings.mlr.press/v139/nichol21a.html

  71. [71]

    Elucidating the design space of diffusion-based generative models

    Tero Karras, Miika Aittala, Timo Aila, and Samuli Laine. Elucidating the design space of diffusion-based generative models. In Alice H. Oh, Alekh Agarwal, Danielle Belgrave, and Kyunghyun Cho, editors,Advances in Neural Information Processing Systems, 2022. URL https://openreview.net/forum?id=k7FuTOWMOc7

  72. [72]

    On the importance of noise scheduling for diffu- sion models

    Ting Chen. On the importance of noise scheduling for diffusion models.arXiv preprint arXiv:2301.10972, 2023

  73. [73]

    Common diffusion noise schedules and sample steps are flawed

    Shanchuan Lin, Bingchen Liu, Jiashi Li, and Xiao Yang. Common diffusion noise schedules and sample steps are flawed. InProceedings of the IEEE/CVF winter conference on applications of computer vision, pages 5404–5411, 2024. 22

  74. [74]

    Unke, Klaus-Robert M¨ uller, and Shinichi Nakajima

    Khaled Kahouli, Winfried Ripken, Stefan Gugler, Oliver T. Unke, Klaus-Robert M¨ uller, and Shinichi Nakajima. Disentangling total-variance and signal-to-noise-ratio improves diffusion models, 2025. URLhttps://arxiv.org/abs/2502.08598

  75. [75]

    Consistency models

    Yang Song, Prafulla Dhariwal, Mark Chen, and Ilya Sutskever. Consistency models. InProceedings of the 40th International Conference on Machine Learning, pages 32211–32252, 2023

  76. [76]

    Consistency trajectory models: Learning probability flow ODE trajectory of diffusion

    Dongjun Kim, Chieh-Hsin Lai, Wei-Hsiang Liao, Naoki Murata, Yuhta Takida, Toshimitsu Uesaka, Yutong He, Yuki Mitsufuji, and Stefano Ermon. Consistency trajectory models: Learning probability flow ODE trajectory of diffusion. InThe Twelfth International Conference on Learning Representations, 2024. URLhttps://openreview.net/forum?id=ymjI8feDTD

  77. [77]

    How to build a consistency model: Learning flow maps via self-distillation.arXiv preprint arXiv:2505.18825, 2025

    Nicholas M. Boffi, Michael S. Albergo, and Eric Vanden-Eijnden. How to build a consistency model: Learning flow maps via self-distillation, 2025. URLhttps://arxiv.org/abs/2505.18825

  78. [78]

    Foster, Jos´ e Jim´ enez-Luna, Tim Hempel, Michael Gastegger, Yaoyi Chen, Iryna Zaporozhets, Cecilia Clementi, Christopher M

    Yu Xie, Ludwig Winkler, Lixin Sun, Sarah Lewis, Adam E. Foster, Jos´ e Jim´ enez-Luna, Tim Hempel, Michael Gastegger, Yaoyi Chen, Iryna Zaporozhets, Cecilia Clementi, Christopher M. Bishop, and Frank No´ e. Enhanced diffusion sampling: Efficient rare event sampling and free energy calculation with diffusion models, 2026. URLhttps://arxiv.org/abs/2602.16634

  79. [79]

    Generative modeling by estimating gradients of the data distribution

    Yang Song and Stefano Ermon. Generative modeling by estimating gradients of the data distribution. In H. Wallach, H. Larochelle, A. Beygelzimer, F. d'Alch´ e-Buc, E. Fox, and R. Gar- nett, editors,Advances in Neural Information Processing Systems, volume 32. Curran Asso- ciates, Inc., 2019. URL https://proceedings.neurips.cc/paper_files/paper/2019/file/ 3...

  80. [80]

    Dral, Matthias Rupp, and O

    Raghunathan Ramakrishnan, Pavlo O. Dral, Matthias Rupp, and O. Anatole von Lilienfeld. Quantum chemistry structures and properties of 134 kilo molecules.Scientific Data, 1(1):1–7, 2014

Showing first 80 references.