Unsupervised Thermodynamics of Molecular Diffusion Models: Action-Operator Semantics and Auditable Free-Energy Readout

Wenjie Xi

arxiv: 2606.30687 · v1 · pith:J45TQSIGnew · submitted 2026-06-28 · ⚛️ physics.chem-ph · cond-mat.stat-mech· cs.AI

Unsupervised Thermodynamics of Molecular Diffusion Models: Action-Operator Semantics and Auditable Free-Energy Readout

Wenjie Xi This is my paper

Pith reviewed 2026-07-01 06:50 UTC · model grok-4.3

classification ⚛️ physics.chem-ph cond-mat.stat-mechcs.AI

keywords diffusion modelsmolecular thermodynamicsalchemical free energyaction-operator frameworknoisy operator bridgefree energy estimationunsupervised thermodynamics

0 comments

The pith

An action-operator framework allows diffusion models to compute alchemical free-energy differences from their learned score fields.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper develops a framework that assigns thermodynamic meaning to the representations learned by molecular diffusion models. It defines a molecular environment as a base action and an alchemical change as an operator, showing that diffusion noising creates noisy versions whose properties are captured by the model's outputs. This setup supports a noisy operator bridge that extracts free-energy differences directly from generated samples. Tests on alanine dipeptide recover action and operator shapes with physical biases, and on a ligand perturbation the method matches reference values closely even without phase-space overlap.

Core claim

By defining a fixed molecular environment as base action S_0(x) and an alchemical perturbation as operator O(x), standard diffusion noising induces effective noised actions and operators. The model's learned fields directly represent the gradients and alchemical derivatives of these quantities. This self-consistency enables a noisy operator bridge to read out free-energy differences ΔF from endpoint ensembles and per-frame evaluations, demonstrated on systems including alanine dipeptide and a C6-H to C6-F perturbation where it agrees with MBAR within 1 k_B T.

What carries the argument

The action-operator framework in which a base action S_0(x) represents the fixed environment and an operator O(x) the alchemical perturbation, with diffusion noising inducing versions whose derivatives are given by the model fields, enabling the noisy operator bridge for ΔF readout.

Load-bearing premise

That standard diffusion noising induces effective noised actions and operators whose gradients and alchemical derivatives are directly represented by the model's learned fields.

What would settle it

Recomputing the bridge estimate on the C6-H to C6-F ligand-pocket system and finding a deviation larger than 2 k_B T from the independent 19-state MBAR reference.

read the original abstract

Diffusion models are increasingly utilized for modeling molecular structures and conformational ensembles, yet the thermodynamic meaning of their learned representations and scores remains elusive. To resolve this ambiguity, we introduce a mathematically consistent action-operator framework natively compatible with diffusion models. By defining a fixed molecular environment as a base action $S_0(x)$ and an alchemical perturbation as an operator $O(x)$, standard diffusion noising induces effective noised actions and operators whose gradients and alchemical derivatives are directly represented by the model's learned fields. This rigorous self-consistency enables a ``noisy operator bridge'' capable of reading out free-energy differences ($\Delta F$) from endpoint ensembles and per-frame evaluations. In controlled experiments on alanine dipeptide systems, we show that incorporating physical inductive biases enables partial recovery of the base action and perturbation operator. When applied to a challenging C6-H to C6-F ligand-pocket nonbonded perturbation (185L/IND) with negligible phase-space overlap, our supervised bridge estimates the alchemical $\Delta F$ within approximately $1\ k_\mathrm{B}T$ of a stable 19-state MBAR reference. Finally, we demonstrate that endpoint coordinates and binary labels alone are sufficient to partially recover the operator shape and a centered free-energy scale without any force or action supervision. This work provides a rigorous path toward transforming generative molecular diffusion models from black-box coordinate samplers into auditable thermodynamic estimators.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper claims a new action-operator bridge lets diffusion models output alchemical free energies from endpoints alone, but the noising-to-operator derivative step is asserted more than derived.

read the letter

The main thing to know is that this work frames molecular diffusion models with a base action S0 and an alchemical operator O, then argues that ordinary noising turns the learned score into the gradient of the noised versions, allowing a supervised bridge to read out ΔF. They report that on the C6-H to C6-F perturbation in 185L/IND the estimate lands within about 1 kBT of a 19-state MBAR reference even with negligible overlap, and that endpoint coordinates plus binary labels alone recover part of the operator shape.

What is actually new is the specific construction of the noisy operator bridge and the claim that standard diffusion already supplies the necessary noised actions and operators without extra machinery. The unsupervised recovery result is also a distinct angle not directly in the cited diffusion or alchemical papers.

The numerical agreement on a hard test case is the strongest concrete output. The framework is presented as mathematically self-consistent, which is a clean way to link generative models to thermodynamics.

The soft spot is the load-bearing assumption that diffusion noising induces effective noised operators whose alchemical derivatives are exactly captured by the model's learned fields. The abstract states this follows directly from standard noising, but without visible derivation steps, error analysis, or controls on how the perturbation operator is noised (especially nonbonded terms), it is hard to judge whether the mapping is exact or approximate. If that step does not hold precisely, the endpoint-only readout could carry uncorrected bias. The reported agreement is encouraging but rests on unexamined details.

This is for computational chemists already running diffusion models who want thermodynamic readouts. A reader looking for new interpretive tools on generative models would find the ideas worth examining. It deserves peer review because the claim is specific enough to be checked and the application is relevant, even though the current evidence is preliminary.

Referee Report

2 major / 2 minor

Summary. The paper introduces an action-operator framework for diffusion models of molecular systems. It defines a base action S0(x) for a fixed environment and an alchemical perturbation as operator O(x); standard diffusion noising is claimed to induce noised versions whose gradients and derivatives are represented exactly by the model's learned score fields. This self-consistency is used to construct a 'noisy operator bridge' that reads out alchemical ΔF from endpoint ensembles and per-frame evaluations. Experiments on alanine dipeptide show partial recovery of S0 and O when physical inductive biases are included; on the challenging 185L/IND C6-H→C6-F nonbonded perturbation (negligible overlap), the supervised bridge recovers ΔF to ~1 kBT of a 19-state MBAR reference. Endpoint coordinates plus binary labels alone suffice for partial operator recovery without force supervision.

Significance. If the central mapping from diffusion noising to noised operators holds without circularity or hidden dependence on model parameters, the framework would convert black-box generative diffusion models into auditable thermodynamic estimators capable of free-energy calculations even in low-overlap regimes. The numerical agreement on a difficult ligand perturbation is potentially impactful for alchemical free-energy methods, but its value depends on independent verification of the derivation.

major comments (2)

[abstract (framework definition)] Abstract, framework-definition paragraph: the claim that 'standard diffusion noising induces effective noised actions and operators whose gradients and alchemical derivatives are directly represented by the model's learned fields' is asserted without an explicit derivation or equation showing how the noising kernel acts on the perturbation operator O(x) (especially for nonbonded parameter changes). This step is load-bearing for the entire noisy-operator bridge; without it, it is impossible to confirm that the learned score equals ∇(noised S0) and the corresponding alchemical derivative rather than an approximation that introduces uncorrectable bias.
[abstract (185L/IND experiment)] Abstract, 185L/IND result paragraph: the supervised bridge is reported to recover ΔF within ~1 kBT of the 19-state MBAR reference on a case with negligible phase-space overlap, yet no error analysis, variance estimates, or controls for operator-noising mismatch are provided. Because the central claim requires that the learned fields exactly encode the noised alchemical derivative, any deviation in how the nonbonded perturbation is noised would produce a systematic offset not removable by endpoint sampling alone; this must be demonstrated explicitly before the numerical agreement can be taken as support for the framework.

minor comments (2)

[abstract] Notation for the base action S0(x) and operator O(x) is introduced in the abstract but never defined with explicit functional forms or units; a short methods subsection giving the concrete expressions used for the alanine dipeptide and 185L/IND cases would improve reproducibility.
[alanine dipeptide experiments] The phrase 'physical inductive biases' is used to explain partial recovery of S0 and O, but the manuscript does not list which biases were added or how they were implemented in the diffusion training objective.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their careful reading and constructive comments. We address each major comment below.

read point-by-point responses

Referee: Abstract, framework-definition paragraph: the claim that 'standard diffusion noising induces effective noised actions and operators whose gradients and alchemical derivatives are directly represented by the model's learned fields' is asserted without an explicit derivation or equation showing how the noising kernel acts on the perturbation operator O(x) (especially for nonbonded parameter changes). This step is load-bearing for the entire noisy-operator bridge; without it, it is impossible to confirm that the learned score equals ∇(noised S0) and the corresponding alchemical derivative rather than an approximation that introduces uncorrectable bias.

Authors: We agree that an explicit derivation is required for the central claim. The full manuscript develops the action-operator framework in the Methods, but we will add a dedicated subsection with the step-by-step derivation of how the standard diffusion noising kernel acts on O(x), including the explicit expression for nonbonded alchemical parameter changes. This will confirm that the learned score equals the gradient of the noised action without approximation bias. revision: yes
Referee: Abstract, 185L/IND result paragraph: the supervised bridge is reported to recover ΔF within ~1 kBT of the 19-state MBAR reference on a case with negligible phase-space overlap, yet no error analysis, variance estimates, or controls for operator-noising mismatch are provided. Because the central claim requires that the learned fields exactly encode the noised alchemical derivative, any deviation in how the nonbonded perturbation is noised would produce a systematic offset not removable by endpoint sampling alone; this must be demonstrated explicitly before the numerical agreement can be taken as support for the framework.

Authors: We acknowledge the absence of error analysis and controls in the current version. In revision we will add bootstrap variance estimates on the ΔF readout and a control comparing noised-operator evaluations to direct computation on available frames. For the 185L/IND case we will also state the assumptions of the noising step explicitly. revision: partial

Circularity Check

1 steps flagged

Action-operator framework asserts learned fields represent noised gradients by definitional construction

specific steps

self definitional [Abstract, framework definition paragraph]
"By defining a fixed molecular environment as a base action $S_0(x)$ and an alchemical perturbation as an operator $O(x)$, standard diffusion noising induces effective noised actions and operators whose gradients and alchemical derivatives are directly represented by the model's learned fields. This rigorous self-consistency enables a ``noisy operator bridge'' capable of reading out free-energy differences ($\\Delta F$) from endpoint ensembles and per-frame evaluations."

The framework is introduced by defining S0 and O such that noising induces representations exactly matching the learned fields. The self-consistency and resulting bridge for ΔF readout therefore follow by construction from this definitional mapping rather than from a separate derivation or external thermodynamic principle.

full rationale

The paper's core claim rests on defining S0(x) and O(x) such that diffusion noising directly maps to the model's learned fields representing their gradients and alchemical derivatives. This self-consistency is invoked to justify the noisy operator bridge for ΔF readout. While the abstract presents this as enabling auditable thermodynamics, the equivalence between learned fields and noised operator derivatives is introduced as part of the framework definition rather than derived from independent equations or external benchmarks. The experimental recovery of ΔF to ~1 kBT of MBAR therefore inherits this construction. No self-citations or fitted-input predictions are quoted in the provided text, but the central derivation reduces to the definitional step.

Axiom & Free-Parameter Ledger

1 free parameters · 1 axioms · 0 invented entities

The framework rests on the assumption that diffusion noising maps to effective noised actions and operators; no explicit free parameters or invented entities are named in the abstract, but physical inductive biases are mentioned as enabling recovery.

free parameters (1)

physical inductive biases
Incorporated to enable partial recovery of base action and perturbation operator in alanine dipeptide experiments.

axioms (1)

domain assumption Standard diffusion noising induces effective noised actions and operators whose gradients are represented by the model's learned fields
This premise is invoked to define the noisy operator bridge and enable free-energy readout.

pith-pipeline@v0.9.1-grok · 5788 in / 1329 out tokens · 30435 ms · 2026-07-01T06:50:11.376535+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

31 extracted references · 20 canonical work pages · 2 internal anchors

[1]

A., Maheswaranathan, N

Sohl-Dickstein, J., Weiss, E. A., Maheswaranathan, N. & Ganguli, S. Deep unsupervised learning using nonequilibrium thermodynamics. InProceedings of ICML(2015). URL https://proceedings.mlr.press/v37/sohl-dickstein15.html

2015
[2]

& Abbeel, P

Ho, J., Jain, A. & Abbeel, P. Denoising diffusion probabilistic models. InAdvances in Neural Information Processing Systems(2020). URLhttps://papers.nips.cc/paper/ 2020/hash/4c5bcfec8584af0d967f1ab10179ca4b-Abstract.html

2020
[3]

In International Conference on Learning Representations(2021)

Song, Y.et al.Score-based generative modeling through stochastic differential equations. In International Conference on Learning Representations(2021). URLhttps://openreview. net/forum?id=PxTIG12RRHS

2021
[4]

URL https://doi.org/10.1021/acs.jctc.3c00702

Arts, M.et al.Two for one: Diffusion models and force fields for coarse-grained molecular dynamics.Journal of Chemical Theory and Computation19, 6151–6159 (2023). URL https://doi.org/10.1021/acs.jctc.3c00702

work page doi:10.1021/acs.jctc.3c00702 2023
[5]

Consistent Sampling and Simulation: Molecular Dynamics with Energy-Based Diffusion Models

Plainer, M., Wu, H., Klein, L., Günnemann, S. & Noé, F. Consistent sampling and sim- ulation: Molecular dynamics with energy-based diffusion models. InAdvances in Neural Information Processing Systems(2025). URLhttps://arxiv.org/abs/2506.17139

work page arXiv 2025
[6]

& Bereau, T

Mate, B., Fleuret, F. & Bereau, T. Neural thermodynamic integration: Free energies from energy-based diffusion models.Journal of Physical Chemistry Letters15, 11395–11404 (2024). URLhttps://doi.org/10.1021/acs.jpclett.4c01958

work page doi:10.1021/acs.jpclett.4c01958 2024
[7]

URLhttps://doi.org/10.1101/2025.11.28.690021

Sarma, S.et al.Can we extract physics-like energies from generative protein diffusion models?bioRxiv(2025). URLhttps://doi.org/10.1101/2025.11.28.690021

work page doi:10.1101/2025.11.28.690021 2025
[8]

Enhanced Diffusion Sampling: Efficient Rare Event Sampling and Free Energy Calculation with Diffusion Models

Xie, Y.et al.Enhanced diffusion sampling: Efficient rare event sampling and free energy calculation with diffusion models. arXiv preprint (2026). URLhttps://arxiv.org/abs/ 2602.16634

work page internal anchor Pith review Pith/arXiv arXiv 2026
[9]

Autonomous Emergence of Hamiltonian in Deep Generative Models

Xi, W. & Chen, W.-Q. Autonomous emergence of hamiltonian in deep generative models. arXiv preprint (2026). URLhttps://arxiv.org/abs/2604.20821

work page internal anchor Pith review Pith/arXiv arXiv 2026
[10]

Zwanzig, R. W. High-temperature equation of state by a perturbation method. i. nonpolar gases.Journal of Chemical Physics22, 1420–1426 (1954). URLhttps://doi.org/10. 1063/1.1740409. 14

1954
[11]

Kirkwood, J. G. Statistical mechanics of fluid mixtures.Journal of Chemical Physics3, 300–313 (1935). URLhttps://doi.org/10.1063/1.1749657

work page doi:10.1063/1.1749657 1935
[12]

Bennett, C. H. Efficient estimation of free energy differences from monte carlo data. Journal of Computational Physics22, 245–268 (1976). URLhttps://doi.org/10.1016/ 0021-9991(76)90078-4

1976
[13]

Shirts, M. R. & Chodera, J. D. Statistically optimal analysis of samples from multiple equilibrium states.Journal of Chemical Physics129, 124105 (2008). URLhttps://doi. org/10.1063/1.2978177

work page doi:10.1063/1.2978177 2008
[14]

& Wong, W

Meng, X.-L. & Wong, W. H. Simulating ratios of normalizing constants via a simple identity: A theoretical exploration.Statistica Sinica6, 831–860 (1996). URLhttps://www3.stat. sinica.edu.tw/statistica/j6n4/j6n43/j6n43.htm

1996
[15]

Nonequilibrium equality for free energy differences.Physical Review Letters 78, 2690–2693 (1997)

Jarzynski, C. Nonequilibrium equality for free energy differences.Physical Review Letters 78, 2690–2693 (1997). URLhttps://doi.org/10.1103/PhysRevLett.78.2690

work page doi:10.1103/physrevlett.78.2690 1997
[16]

Crooks, G. E. Entropy production fluctuation theorem and the nonequilibrium work relation for free energy differences.Physical Review E60, 2721–2726 (1999). URLhttps://doi. org/10.1103/PhysRevE.60.2721

work page doi:10.1103/physreve.60.2721 1999
[17]

Targeted free energy perturbation.Physical Review E65, 046122 (2002)

Jarzynski, C. Targeted free energy perturbation.Physical Review E65, 046122 (2002). URLhttps://doi.org/10.1103/PhysRevE.65.046122

work page doi:10.1103/physreve.65.046122 2002
[18]

URLhttps://doi.org/10.1063/5.0018903

Wirnsberger, P.et al.Targeted free energy estimation via learned mappings.Journal of Chemical Physics153, 144112 (2020). URLhttps://doi.org/10.1063/5.0018903

work page doi:10.1063/5.0018903 2020
[19]

& Minh, D

Yoo, S., Kang, L. & Minh, D. D. L. Learned mappings for targeted free energy perturbation between peptide conformations.Journal of Chemical Physics159, 124104 (2023). URL https://doi.org/10.1063/5.0164662

work page doi:10.1063/5.0164662 2023
[20]

Noe, F., Olsson, S., Kohler, J. & Wu, H. Boltzmann generators: Sampling equilibrium states of many-body systems with deep learning.Science365, eaaw1147 (2019). URL https://doi.org/10.1126/science.aaw1147

work page doi:10.1126/science.aaw1147 2019
[21]

& Zhang, B

Ding, X. & Zhang, B. DeepBAR: A fast and exact method for binding free energy com- putation.Journal of Physical Chemistry Letters12, 2509–2515 (2021). URLhttps: //doi.org/10.1021/acs.jpclett.1c00189. 15

work page doi:10.1021/acs.jpclett.1c00189 2021
[22]

& Fleuret, F

Mate, B. & Fleuret, F. Learning interpolations between boltzmann densities.Transac- tions on Machine Learning Research(2023). URLhttps://openreview.net/forum?id= TH6YrEcbth

2023
[23]

& Bereau, T

Mate, B., Fleuret, F. & Bereau, T. Solvation free energies from neural thermodynamic integration.Journal of Chemical Physics162, 124107 (2025). URLhttps://doi.org/10. 1063/5.0251736

2025
[24]

OpenReview preprint (2025)

Erdogan, E.et al.FreeFlow: Latent flow matching for free energy difference estimation. OpenReview preprint (2025). URLhttps://openreview.net/forum?id=D2EdWRWEQo

2025
[25]

InAdvances in Neural Information Processing Systems(2025)

Du, Y.et al.FEAT: Free energy estimators with adaptive transport. InAdvances in Neural Information Processing Systems(2025). URLhttps://openreview.net/forum? id=GQXeLGYMda

2025
[26]

McGibbon, R. T. MD trajectories of ALA2 (fileset). figshare Dataset (2014). URLhttps: //doi.org/10.6084/m9.figshare.1026131.v8

work page doi:10.6084/m9.figshare.1026131.v8 2014
[27]

URLhttps:// doi.org/10.1371/journal.pcbi.1005659

Eastman, P.et al.OpenMM 7: Rapid development of high performance algorithms for molecular dynamics.PLoS Computational Biology13, e1005659 (2017). URLhttps:// doi.org/10.1371/journal.pcbi.1005659

work page doi:10.1371/journal.pcbi.1005659 2017
[28]

PDB ID 185L: Specificity of ligand binding in a buried non- polar cavity of T4 lysozyme: linkage of dynamics and structural plasticity (1995)

RCSB Protein Data Bank. PDB ID 185L: Specificity of ligand binding in a buried non- polar cavity of T4 lysozyme: linkage of dynamics and structural plasticity (1995). URL https://doi.org/10.2210/pdb185L/pdb

work page doi:10.2210/pdb185l/pdb 1995
[29]

& Matthews, B

Morton, A. & Matthews, B. W. Specificity of ligand binding in a buried nonpolar cavity of T4 lysozyme: linkage of dynamics and structural plasticity.Biochemistry34, 8576–8588 (1995). URLhttps://doi.org/10.1021/bi00027a007

work page doi:10.1021/bi00027a007 1995
[30]

L.et al.Escaping atom types in force fields using direct chemical perception

Mobley, D. L.et al.Escaping atom types in force fields using direct chemical perception. Journal of Chemical Theory and Computation14, 6076–6092 (2018). URLhttps://doi. org/10.1021/acs.jctc.8b00640

work page doi:10.1021/acs.jctc.8b00640 2018
[31]

OpenFF force field file openff-2.2.1.offxml (2024)

Open Force Field Initiative. OpenFF force field file openff-2.2.1.offxml (2024). URLhttps://github.com/openforcefield/openff-forcefields/blob/main/ openforcefields/offxml/openff-2.2.1.offxml. 16

2024

[1] [1]

A., Maheswaranathan, N

Sohl-Dickstein, J., Weiss, E. A., Maheswaranathan, N. & Ganguli, S. Deep unsupervised learning using nonequilibrium thermodynamics. InProceedings of ICML(2015). URL https://proceedings.mlr.press/v37/sohl-dickstein15.html

2015

[2] [2]

& Abbeel, P

Ho, J., Jain, A. & Abbeel, P. Denoising diffusion probabilistic models. InAdvances in Neural Information Processing Systems(2020). URLhttps://papers.nips.cc/paper/ 2020/hash/4c5bcfec8584af0d967f1ab10179ca4b-Abstract.html

2020

[3] [3]

In International Conference on Learning Representations(2021)

Song, Y.et al.Score-based generative modeling through stochastic differential equations. In International Conference on Learning Representations(2021). URLhttps://openreview. net/forum?id=PxTIG12RRHS

2021

[4] [4]

URL https://doi.org/10.1021/acs.jctc.3c00702

Arts, M.et al.Two for one: Diffusion models and force fields for coarse-grained molecular dynamics.Journal of Chemical Theory and Computation19, 6151–6159 (2023). URL https://doi.org/10.1021/acs.jctc.3c00702

work page doi:10.1021/acs.jctc.3c00702 2023

[5] [5]

Consistent Sampling and Simulation: Molecular Dynamics with Energy-Based Diffusion Models

Plainer, M., Wu, H., Klein, L., Günnemann, S. & Noé, F. Consistent sampling and sim- ulation: Molecular dynamics with energy-based diffusion models. InAdvances in Neural Information Processing Systems(2025). URLhttps://arxiv.org/abs/2506.17139

work page arXiv 2025

[6] [6]

& Bereau, T

Mate, B., Fleuret, F. & Bereau, T. Neural thermodynamic integration: Free energies from energy-based diffusion models.Journal of Physical Chemistry Letters15, 11395–11404 (2024). URLhttps://doi.org/10.1021/acs.jpclett.4c01958

work page doi:10.1021/acs.jpclett.4c01958 2024

[7] [7]

URLhttps://doi.org/10.1101/2025.11.28.690021

Sarma, S.et al.Can we extract physics-like energies from generative protein diffusion models?bioRxiv(2025). URLhttps://doi.org/10.1101/2025.11.28.690021

work page doi:10.1101/2025.11.28.690021 2025

[8] [8]

Enhanced Diffusion Sampling: Efficient Rare Event Sampling and Free Energy Calculation with Diffusion Models

Xie, Y.et al.Enhanced diffusion sampling: Efficient rare event sampling and free energy calculation with diffusion models. arXiv preprint (2026). URLhttps://arxiv.org/abs/ 2602.16634

work page internal anchor Pith review Pith/arXiv arXiv 2026

[9] [9]

Autonomous Emergence of Hamiltonian in Deep Generative Models

Xi, W. & Chen, W.-Q. Autonomous emergence of hamiltonian in deep generative models. arXiv preprint (2026). URLhttps://arxiv.org/abs/2604.20821

work page internal anchor Pith review Pith/arXiv arXiv 2026

[10] [10]

Zwanzig, R. W. High-temperature equation of state by a perturbation method. i. nonpolar gases.Journal of Chemical Physics22, 1420–1426 (1954). URLhttps://doi.org/10. 1063/1.1740409. 14

1954

[11] [11]

Kirkwood, J. G. Statistical mechanics of fluid mixtures.Journal of Chemical Physics3, 300–313 (1935). URLhttps://doi.org/10.1063/1.1749657

work page doi:10.1063/1.1749657 1935

[12] [12]

Bennett, C. H. Efficient estimation of free energy differences from monte carlo data. Journal of Computational Physics22, 245–268 (1976). URLhttps://doi.org/10.1016/ 0021-9991(76)90078-4

1976

[13] [13]

Shirts, M. R. & Chodera, J. D. Statistically optimal analysis of samples from multiple equilibrium states.Journal of Chemical Physics129, 124105 (2008). URLhttps://doi. org/10.1063/1.2978177

work page doi:10.1063/1.2978177 2008

[14] [14]

& Wong, W

Meng, X.-L. & Wong, W. H. Simulating ratios of normalizing constants via a simple identity: A theoretical exploration.Statistica Sinica6, 831–860 (1996). URLhttps://www3.stat. sinica.edu.tw/statistica/j6n4/j6n43/j6n43.htm

1996

[15] [15]

Nonequilibrium equality for free energy differences.Physical Review Letters 78, 2690–2693 (1997)

Jarzynski, C. Nonequilibrium equality for free energy differences.Physical Review Letters 78, 2690–2693 (1997). URLhttps://doi.org/10.1103/PhysRevLett.78.2690

work page doi:10.1103/physrevlett.78.2690 1997

[16] [16]

Crooks, G. E. Entropy production fluctuation theorem and the nonequilibrium work relation for free energy differences.Physical Review E60, 2721–2726 (1999). URLhttps://doi. org/10.1103/PhysRevE.60.2721

work page doi:10.1103/physreve.60.2721 1999

[17] [17]

Targeted free energy perturbation.Physical Review E65, 046122 (2002)

Jarzynski, C. Targeted free energy perturbation.Physical Review E65, 046122 (2002). URLhttps://doi.org/10.1103/PhysRevE.65.046122

work page doi:10.1103/physreve.65.046122 2002

[18] [18]

URLhttps://doi.org/10.1063/5.0018903

Wirnsberger, P.et al.Targeted free energy estimation via learned mappings.Journal of Chemical Physics153, 144112 (2020). URLhttps://doi.org/10.1063/5.0018903

work page doi:10.1063/5.0018903 2020

[19] [19]

& Minh, D

Yoo, S., Kang, L. & Minh, D. D. L. Learned mappings for targeted free energy perturbation between peptide conformations.Journal of Chemical Physics159, 124104 (2023). URL https://doi.org/10.1063/5.0164662

work page doi:10.1063/5.0164662 2023

[20] [20]

Noe, F., Olsson, S., Kohler, J. & Wu, H. Boltzmann generators: Sampling equilibrium states of many-body systems with deep learning.Science365, eaaw1147 (2019). URL https://doi.org/10.1126/science.aaw1147

work page doi:10.1126/science.aaw1147 2019

[21] [21]

& Zhang, B

Ding, X. & Zhang, B. DeepBAR: A fast and exact method for binding free energy com- putation.Journal of Physical Chemistry Letters12, 2509–2515 (2021). URLhttps: //doi.org/10.1021/acs.jpclett.1c00189. 15

work page doi:10.1021/acs.jpclett.1c00189 2021

[22] [22]

& Fleuret, F

Mate, B. & Fleuret, F. Learning interpolations between boltzmann densities.Transac- tions on Machine Learning Research(2023). URLhttps://openreview.net/forum?id= TH6YrEcbth

2023

[23] [23]

& Bereau, T

Mate, B., Fleuret, F. & Bereau, T. Solvation free energies from neural thermodynamic integration.Journal of Chemical Physics162, 124107 (2025). URLhttps://doi.org/10. 1063/5.0251736

2025

[24] [24]

OpenReview preprint (2025)

Erdogan, E.et al.FreeFlow: Latent flow matching for free energy difference estimation. OpenReview preprint (2025). URLhttps://openreview.net/forum?id=D2EdWRWEQo

2025

[25] [25]

InAdvances in Neural Information Processing Systems(2025)

Du, Y.et al.FEAT: Free energy estimators with adaptive transport. InAdvances in Neural Information Processing Systems(2025). URLhttps://openreview.net/forum? id=GQXeLGYMda

2025

[26] [26]

McGibbon, R. T. MD trajectories of ALA2 (fileset). figshare Dataset (2014). URLhttps: //doi.org/10.6084/m9.figshare.1026131.v8

work page doi:10.6084/m9.figshare.1026131.v8 2014

[27] [27]

URLhttps:// doi.org/10.1371/journal.pcbi.1005659

Eastman, P.et al.OpenMM 7: Rapid development of high performance algorithms for molecular dynamics.PLoS Computational Biology13, e1005659 (2017). URLhttps:// doi.org/10.1371/journal.pcbi.1005659

work page doi:10.1371/journal.pcbi.1005659 2017

[28] [28]

PDB ID 185L: Specificity of ligand binding in a buried non- polar cavity of T4 lysozyme: linkage of dynamics and structural plasticity (1995)

RCSB Protein Data Bank. PDB ID 185L: Specificity of ligand binding in a buried non- polar cavity of T4 lysozyme: linkage of dynamics and structural plasticity (1995). URL https://doi.org/10.2210/pdb185L/pdb

work page doi:10.2210/pdb185l/pdb 1995

[29] [29]

& Matthews, B

Morton, A. & Matthews, B. W. Specificity of ligand binding in a buried nonpolar cavity of T4 lysozyme: linkage of dynamics and structural plasticity.Biochemistry34, 8576–8588 (1995). URLhttps://doi.org/10.1021/bi00027a007

work page doi:10.1021/bi00027a007 1995

[30] [30]

L.et al.Escaping atom types in force fields using direct chemical perception

Mobley, D. L.et al.Escaping atom types in force fields using direct chemical perception. Journal of Chemical Theory and Computation14, 6076–6092 (2018). URLhttps://doi. org/10.1021/acs.jctc.8b00640

work page doi:10.1021/acs.jctc.8b00640 2018

[31] [31]

OpenFF force field file openff-2.2.1.offxml (2024)

Open Force Field Initiative. OpenFF force field file openff-2.2.1.offxml (2024). URLhttps://github.com/openforcefield/openff-forcefields/blob/main/ openforcefields/offxml/openff-2.2.1.offxml. 16

2024