Structure-Aware Variational Learning of a Class of Generalized Diffusions

Chun Liu; Qi Tang; Xiaofan Li; Yiwei Wang; Yubin Lu

arxiv: 2604.20188 · v1 · submitted 2026-04-22 · 💻 cs.LG · math.DS

Structure-Aware Variational Learning of a Class of Generalized Diffusions

Yubin Lu , Xiaofan Li , Chun Liu , Qi Tang , Yiwei Wang This is my paper

Pith reviewed 2026-05-10 00:10 UTC · model grok-4.3

classification 💻 cs.LG math.DS

keywords variational learningenergy-based modelsgeneralized diffusionsFokker-Planck equationDe Giorgi dissipationpotential function inferencestochastic processesstructure-aware learning

0 comments

The pith

An energy-based variational loss from the Fokker-Planck energy-dissipation law infers unknown potentials in generalized diffusions without direct PDE enforcement.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper develops a learning method for the potential energy driving stochastic diffusion processes when data is incomplete and noisy. Classical methods that regress equations or velocities directly often falter under these conditions. By grounding the loss in the energy-dissipation structure rather than the PDE itself, the approach preserves the system's variational properties. Experiments across one to three dimensions show the resulting loss remains stable as observation times vary, noise increases, or training data becomes sparser or less diverse.

Core claim

Starting from the energy-dissipation law of the Fokker-Planck equation, the authors construct loss functions using the De Giorgi dissipation functional. These losses couple the free energy and the dissipation mechanism without explicitly enforcing the governing PDE, allowing structure-aware inference of the unknown potential function in generalized diffusion processes.

What carries the argument

The De Giorgi dissipation functional built from the energy-dissipation law associated with the Fokker-Planck equation, which acts as the variational loss that couples free energy and dissipation for learning the potential.

If this is right

The method recovers potentials accurately even with limited or noisy trajectory data.
Robustness improves across varying observation times and noise levels in 1D, 2D, and 3D settings.
The variational structure is preserved, enabling consistent coupling of energy and dissipation without PDE enforcement.
Performance holds with diverse and reduced amounts of training data.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The approach could extend to learning in other variational systems where energy-dissipation laws are known but full PDE solutions are intractable.
If the potential is learned this way, downstream simulations of the diffusion process should match observed statistics more reliably than direct regression methods.
Testing on real experimental data from chemistry or biology would reveal whether the robustness observed in numerics translates to physical systems.

Load-bearing premise

The energy-dissipation law from the Fokker-Planck equation and the De Giorgi functional can be turned into a loss that recovers the true potential without needing to enforce the PDE directly or have complete observations.

What would settle it

Generate synthetic trajectories from a known potential in a generalized diffusion, add high noise and remove some observations, then check if minimizing the proposed loss recovers a potential whose simulated trajectories match the original statistics; failure to do so would falsify the robustness claim.

Figures

Figures reproduced from arXiv: 2604.20188 by Chun Liu, Qi Tang, Xiaofan Li, Yiwei Wang, Yubin Lu.

**Figure 1.** Figure 1: Learning the potential ψ(x) = 1 4 x 4 − 1 2 x 2 using the loss function (24) for different values of the weighting parameter α. As shown in [PITH_FULL_IMAGE:figures/full_fig_p013_1.png] view at source ↗

**Figure 2.** Figure 2: Learning the quadruple-well potential ψ(x, y) = 1 4 x 4 − 1 2 x 2 + 1 4 y 4 − 1 2 y 2 using training data collected over different time intervals [Tb, Te]. Panel (a) corresponds to α = 1, and panel (b) corresponds to α = 0.5. In each panel, the subplots are arranged from left to right and top to bottom, showing the ground-truth potential and the learned potentials obtained using data from the time interval… view at source ↗

**Figure 3.** Figure 3: Learning the quadruple-well potential ψ(x, y) = 1 4 x 4− 1 2 x 2+ 1 4 y 4− 1 2 y 2 . Panel (a) shows the impact of the number of initial distributions Q for α = 1, while panel (b) illustrates the robustness of the learning results with respect to different noise levels for α = 0.5. Impact of Observational Noise In this experiment, we examine the robustness of the De Giorgi dissipation functional–based loss… view at source ↗

**Figure 4.** Figure 4: compares the learned potential using the loss function (24) for different values of the weighting parameter α = 0.25, 0.5, 0.75. Both α = 0.5 and α = 0.25 are better than α = 0.75. Moreover, when examining the contour details, α = 0.5 also shows some improvement over α = 0.25. Overall, α = 0.5 appears to be the optimal choice. − [PITH_FULL_IMAGE:figures/full_fig_p016_4.png] view at source ↗

**Figure 5.** Figure 5: Comparison between the learned potential and the true potential given by Eq. (26). − x − [PITH_FULL_IMAGE:figures/full_fig_p018_5.png] view at source ↗

**Figure 6.** Figure 6: Evolution of particle trajectories within the learned two-dimensional potential. The red solid points represent the particle positions at four distinct time points, simulated using the stochastic differential equation given in Eq. (1), with the dynamics driven by the learned potential. where (q1, q2, q3) = (x, y, z). The parameters are chosen as (c1, c2, c3) = (0.2, 0.3, 0.3), (A1, A2, A3) = (−2.0, −2.0, −… view at source ↗

**Figure 7.** Figure 7: Learning the three-dimensional potential (27) with observational noise level σ = 0.2. The three columns correspond to the density plot of the potential function in the z = 0, y = 0, and x = 0 coordinate planes respectively. The first row shows the ground-truth potential, and the second row shows the learned potential. 0 t 0 0 0 00 t [PITH_FULL_IMAGE:figures/full_fig_p020_7.png] view at source ↗

**Figure 8.** Figure 8: Learning the three-dimensional potential (27) with observational noise level σ = 0.2. 20 [PITH_FULL_IMAGE:figures/full_fig_p020_8.png] view at source ↗

**Figure 9.** Figure 9: presents the learning results obtained using the loss functions (24) with α = 0.5 and (25). It is evident that, under the presence of the environmental velocity perturbation (28), the energy-based loss (24) yields significantly more accurate reconstructions of the target potential than the PDE-based loss (25). This behavior can be explained by the structural difference between the two approaches. The PDE-b… view at source ↗

read the original abstract

Learning the underlying potential energy of stochastic gradient systems from partial and noisy observations is a fundamental problem arising in physics, chemistry, and data-driven modeling. Classical approaches often rely on direct regression of governing equations or velocity fields, which can be sensitive to noise and external perturbations and may fail when observations are incomplete. In this work, we propose a structure-aware, energy-based learning framework for inferring unknown potential functions in generalized diffusion processes, grounded in the energetic variational approach. Starting from the energy-dissipation law associated with the Fokker-Planck equation, we construct loss functions based on the De Giorgi dissipation functional, which consistently couple the free energy and the dissipation mechanism of the system. This formulation avoids explicit enforcement of the governing partial differential equation and preserves the underlying variational structure of the dynamics. Through numerical experiments in one, two, and three dimensions, we demonstrate that the proposed energy-based loss exhibits enhanced robustness with respect to observation time, noise level, and the diversity and amount of available training data. These results highlight the effectiveness of energy-dissipation principles as a reliable foundation for learning stochastic diffusion dynamics from data.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

This paper builds a loss from the De Giorgi functional on the energy-dissipation law to learn potentials in generalized diffusions without direct PDE enforcement, but the reported robustness rests on experiments whose details and controls are not shown.

read the letter

The main point is that the authors take the established energy-dissipation structure of the Fokker-Planck equation and turn the De Giorgi dissipation functional into a loss that couples free energy and dissipation for inferring unknown potentials from partial noisy data. This is presented as a structure-aware alternative to direct regression of velocities or equations, which often breaks under noise or incomplete observations. The claim is that the variational route gives better robustness across time, noise level, and data amount in 1D, 2D, and 3D tests. That construction is the actual novelty here; it does not collapse to prior regression baselines cited in the abstract. The grounding in the energetic variational approach is clean and keeps the underlying physics intact without forcing the PDE at every step. That part is done well and gives the method a coherent foundation for stochastic systems in physics or chemistry modeling. The experiments are described as demonstrating enhanced robustness, yet the abstract supplies no numbers, baselines, error bars, or data-generation rules, so the strength of that evidence cannot be checked from the given text. The stress-test concern about possible non-uniqueness also lands: with only partial trajectories the loss may allow multiple potentials to produce similar dissipation balances, and nothing in the summary shows checks for identifiability or recovery error under controlled perturbations. If those checks are absent from the full paper, the robustness claim stays unverified. This work is for readers already working on structure-preserving or energy-based methods for learning stochastic dynamics. Someone looking for a variational route to inverse problems in diffusions can extract the loss construction and try it, but they will still need to add their own validation. It deserves a serious referee because the core variational idea is internally consistent and draws on established principles, even though the empirical side requires more concrete support before the robustness advantage can be accepted.

Referee Report

2 major / 2 minor

Summary. The paper proposes a structure-aware variational learning framework for inferring unknown potential functions in generalized diffusion processes from partial and noisy observations. Grounded in the energetic variational approach, it starts from the energy-dissipation law of the Fokker-Planck equation and constructs loss functions via the De Giorgi dissipation functional to couple free energy and dissipation without explicit PDE enforcement. Numerical experiments in one, two, and three dimensions are used to claim enhanced robustness with respect to observation time, noise level, and training data diversity and volume.

Significance. If the robustness claims are substantiated with proper quantitative validation, the work would provide a principled, structure-preserving alternative to direct regression methods for learning stochastic dynamics, leveraging established energy-dissipation principles and De Giorgi functionals. This could be valuable for physics-informed machine learning in inverse problems involving diffusions. The approach avoids explicit PDE enforcement while retaining variational structure, which is a positive aspect, but the current lack of metrics limits assessment of its practical advantage.

major comments (2)

[Abstract and numerical experiments] Abstract and numerical experiments section: The claim that 'numerical experiments in one, two, and three dimensions... demonstrate that the proposed energy-based loss exhibits enhanced robustness' is load-bearing for the central contribution, yet the text supplies no quantitative metrics, baseline comparisons, error bars, data-generation protocols, or exclusion rules. This prevents independent verification of the data-to-claim link.
[Method (energy-dissipation construction)] Method construction (energy-dissipation law and De Giorgi functional): The loss may admit multiple potentials consistent with partial noisy observations without strict convexity or injectivity guarantees; different potentials could produce similar energy-dissipation balances on observed trajectories. No analysis of loss-landscape flatness, identifiability, or controlled recovery error under perturbations of the true potential is provided, which is required to support the inference claim.

minor comments (2)

[Abstract] The abstract would be clearer if it briefly indicated the explicit form of the constructed loss function or referenced the key variational equations.
[References] Ensure citations to prior literature on energetic variational approaches and applications of De Giorgi functionals to learning problems are complete and up-to-date.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive and detailed review of our manuscript. We address each major comment point by point below, indicating the revisions we will undertake to strengthen the presentation and support for our claims.

read point-by-point responses

Referee: [Abstract and numerical experiments] Abstract and numerical experiments section: The claim that 'numerical experiments in one, two, and three dimensions... demonstrate that the proposed energy-based loss exhibits enhanced robustness' is load-bearing for the central contribution, yet the text supplies no quantitative metrics, baseline comparisons, error bars, data-generation protocols, or exclusion rules. This prevents independent verification of the data-to-claim link.

Authors: We agree that the numerical experiments section requires more rigorous quantitative support to substantiate the robustness claims. In the revised manuscript, we will expand this section to include explicit quantitative metrics (e.g., mean squared error in recovered potentials with standard deviations over repeated trials), direct comparisons against baselines such as least-squares drift regression and physics-informed neural network methods, full descriptions of data-generation protocols (including ranges for observation times, noise variances, and training set sizes), and any data exclusion or preprocessing rules applied. These changes will enable independent verification and clearer evaluation of the method's advantages. revision: yes
Referee: [Method (energy-dissipation construction)] Method construction (energy-dissipation law and De Giorgi functional): The loss may admit multiple potentials consistent with partial noisy observations without strict convexity or injectivity guarantees; different potentials could produce similar energy-dissipation balances on observed trajectories. No analysis of loss-landscape flatness, identifiability, or controlled recovery error under perturbations of the true potential is provided, which is required to support the inference claim.

Authors: We acknowledge that the inverse problem is inherently ill-posed under partial and noisy observations, and that the loss may not guarantee unique recovery without additional assumptions. The De Giorgi-based construction is motivated by preserving the energy-dissipation structure rather than enforcing uniqueness a priori. In the revision, we will add a dedicated discussion of identifiability conditions (e.g., sufficient state-space coverage) together with new numerical experiments that report controlled recovery errors under perturbations of the ground-truth potential and basic diagnostics of loss-landscape behavior. While a full theoretical proof of strict convexity lies beyond the current scope, these empirical and conditional analyses will better support the inference claims. revision: partial

Circularity Check

0 steps flagged

No significant circularity; construction applies established variational principles

full rationale

The paper starts from the established energy-dissipation law of the Fokker-Planck equation and applies the De Giorgi dissipation functional to build an energy-based loss. This is a direct application of known structure from the energetic variational approach rather than any self-definitional loop, fitted input renamed as prediction, or load-bearing self-citation chain. No equations reduce the claimed robustness to tautology or prior author-specific uniqueness results. The numerical experiments in 1D/2D/3D serve as external validation of the loss properties and do not close a circular derivation.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The framework rests on the standard energetic variational approach and the energy-dissipation law of the Fokker-Planck equation; no free parameters, new entities, or ad-hoc axioms are introduced in the abstract.

axioms (1)

domain assumption Generalized diffusion processes obey the energy-dissipation law associated with the Fokker-Planck equation.
The method starts from this law to construct the loss functions, as stated in the abstract.

pith-pipeline@v0.9.0 · 5502 in / 1360 out tokens · 54734 ms · 2026-05-10T00:10:40.775149+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

49 extracted references · 49 canonical work pages

[1]

Ambrosio, N

L. Ambrosio, N. Gigli, and G. Savaré. Gradient ﬂows: in metric spaces and in the space of probability measures. Springer, 2005

work page 2005
[2]

V. I. Arnol’d. Mathematical methods of classical mechanics , volume 60. Springer Science & Business Media, 2013

work page 2013
[3]

Batlle, Y

P. Batlle, Y. Chen, B. Hosseini, H. Owhadi, and A. M. Stuart. Error analysis of kernel/GP methods for nonlinear and parametric PDEs. Journal of Computational Physics , 520:113488, 2025

work page 2025
[4]

X. Chen, L. Yang, J. Duan, and G. E. Karniadakis. Solving i nverse stochastic problems from discrete particle observations using the Fokker–Planck eq uation and physics-informed neural networks. SIAM Journal on Scientiﬁc Computing , 43(3):B811–B830, 2021

work page 2021
[5]

Y. Chen, B. Hosseini, H. Owhadi, and A. M. Stuart. Solving a nd learning nonlinear PDEs with Gaussian processes. Journal of Computational Physics , 447:110668, 2021

work page 2021
[6]

De Giorgi, A

E. De Giorgi, A. Marino, and M. Tosques. Problems of evolu tion in metric spaces and maximal decreasing curve. Atti Accad. Naz. Lincei Rend. Cl. Sci. Fis. Mat. Natur.(8) , 68(3):180–187, 1980. 22

work page 1980
[7]

De Ryck, S

T. De Ryck, S. Mishra, and R. Molinaro. Weak physics infor med neural networks for approxi- mating entropy solutions of hyperbolic conservation laws. In Seminar für Angewandte Mathe- matik, Eidgenössische Technische Hochschule, Zürich, Swi tzerland, Rep, volume 35, page 2022, 2022

work page 2022
[8]

Dietrich, A

F. Dietrich, A. Makeev, G. Kevrekidis, N. Evangelou, T. Be rtalan, S. Reich, and I. Kevrekidis. Learning eﬀective stochastic diﬀerential equations from m icroscopic simulations: Linking stochastic numerics to deep learning. Chaos: An Interdisciplinary Journal of Nonlinear Science , 33(2):023121, 2023

work page 2023
[9]

M. Doi. Onsager’s variational principle in soft matter. J. Phys.: Condens. Matter , 23(28):284118, 2011

work page 2011
[10]

W. E, C. Ma, and L. Wu. Machine learning from a continuous viewpoint, i. Science China Mathematics, pages 1–34, 2020

work page 2020
[11]

Eisenberg, C

B. Eisenberg, C. Liu, and Y. Wang. On variational princip les for polarization responses in electromechanical systems. Communications in Mathematical Sciences , 20(6), 2022

work page 2022
[12]

J. L. Ericksen. Introduction to the Thermodynamics of Solids . Applied Mathematical Sciences. Springer-Verlag, 1998

work page 1998
[13]

Flamary, N

R. Flamary, N. Courty, A. Gramfort, M. Z. Alaya, A. Boisbu non, S. Chambon, L. Chapel, A. Corenﬂos, K. Fatras, N. Fournier, L. Gautheron, N. T. Gayr aud, H. Janati, A. Rakotoma- monjy, I. Redko, A. Rolet, A. Schutz, V. Seguy, D. J. Sutherla nd, R. Tavenard, A. Tong, and T. Vayer. Pot: Python optimal transport. Journal of Machine Learning Research , 22(78)...

work page 2021
[14]

Flamary, C

R. Flamary, C. Vincent-Cuaz, N. Courty, A. Gramfort, O. Kachaiev, H. Quang Tran, L. David, C. Bonet, N. Cassereau, T. Gnassounou, E. Tanguy, J. Delon, A. Collas, S. Mazelet, L. Chapel, T. Kerdoncuﬀ, X. Yu, M. Feickert, P. Krzakala, T. Liu, and E. F ernandes Montesuma. Pot python optimal transport (version 0.9.5), 2024

work page 2024
[15]

H. Gao, M. J. Zahr, and J.-X. Wang. Physics-informed gra ph neural Galerkin networks: A uniﬁed framework for solving PDE-governed forward and inve rse problems. Computer Methods in Applied Mechanics and Engineering , 390:114502, 2022

work page 2022
[16]

Y. Gao, Q. Lang, and L. Fei. Self-test loss functions for learning weak-form operators and gradient ﬂows. arXiv preprint arXiv:2412.03506 , 2024

work page arXiv 2024
[17]

M.-H. Giga, A. Kirshtein, and C. Liu. Variational model ing and complex ﬂuids. Handbook of Mathematical Analysis in Mechanics of Viscous Fluids , pages 1–41, 2017

work page 2017
[18]

Grmela and H

M. Grmela and H. C. Öttinger. Dynamics and thermodynami cs of complex ﬂuids. i. develop- ment of a general formalism. Phys. Rev. E , 56(6):6620, 1997

work page 1997
[19]

Gruber, K

A. Gruber, K. Lee, H. Lim, N. Park, and N. Trask. Eﬃcientl y parameterized neural metriplectic systems. In The Thirteenth International Conference on Learning Repre sentations, 2025

work page 2025
[20]

Z. Hu, C. Liu, Y. Wang, and Z. Xu. Energetic variational n eural network discretizations of gradient ﬂows. SIAM Journal on Scientiﬁc Computing , 46(4):A2528–A2556, 2024. 23

work page 2024
[21]

Huang, Z

S. Huang, Z. He, and C. Reina. Variational Onsager neura l networks (VONNs): A thermodynamics-based variational learning strategy for n on-equilibrium PDEs. Journal of the Mechanics and Physics of Solids , 163:104856, 2022

work page 2022
[22]

Jiang, W

Y. Jiang, W. Yang, Y. Zhu, and L. Hong. Entropy structure informed learning for solving inverse problems of diﬀerential equations. Chaos, Solitons & Fractals , 175:114057, 2023

work page 2023
[23]

G. E. Karniadakis, I. G. Kevrekidis, L. Lu, P. Perdikari s, S. Wang, and L. Yang. Physics- informed machine learning. Nature Reviews Physics , 3(6):422–440, 2021

work page 2021
[24]

Kharazmi, Z

E. Kharazmi, Z. Zhang, and G. Karniadakis. hp-VPINNs: V ariational physics-informed neural networks with domain decomposition. Computer Methods in Applied Mechanics and Engineer- ing, 374:113547, 2021

work page 2021
[25]

R. Kubo. The ﬂuctuation-dissipation theorem. Reports on progress in physics , 29(1):255, 1966

work page 1966
[26]

K. Lee, N. Trask, and P. Stinis. Machine learning struct ure preserving brackets for forecasting irreversible processes. In Advances in Neural Information Processing Systems , 2021

work page 2021
[27]

W. Li, M. Z. Bazant, and J. Zhu. Phase-ﬁeld deeponet: Phys ics-informed deep operator neural network for fast simulations of pattern formation governed by gradient ﬂows of free-energy functionals. Computer Methods in Applied Mechanics and Engineering , 416:116299, 2023

work page 2023
[28]

Liero, A

M. Liero, A. Mielke, M. A. Peletier, and D. M. Renger. On m icroscopic origins of generalized gradient structures. Discrete and Continuous Dynamical Systems-S , 10(1):1–35, 2017

work page 2017
[29]

C. Liu. An introduction of elastic complex ﬂuids: an ene rgetic variational approach. In Multi- Scale Phenomena in Complex Fluids: Modeling, Analysis and N umerical Simulation , pages 286–337. World Scientiﬁc, 2009

work page 2009
[30]

Y. Lu, X. Li, C. Liu, Q. Tang, and Y. Wang. Learning genera lized diﬀusions using an energetic variational approach. arXiv preprint arXiv:2412.04480 , 2025

work page arXiv 2025
[31]

S. Ma, S. Liu, H. Zha, and H. Zhou. Learning stochastic be haviour from aggregate data. In M. Meila and T. Zhang, editors, Proceedings of the 38th International Conference on Machin e Learning, volume 139 of Proceedings of Machine Learning Research , pages 7258–7267. PMLR, 18–24 Jul 2021

work page 2021
[32]

Messenger and D

D. Messenger and D. Bortz. Weak SINDy for partial diﬀeren tial equations. Journal of Com- putational Physics , 443:110525, 2021

work page 2021
[33]

Messenger and D

D. Messenger and D. Bortz. Weak SINDy: Galerkin-based da ta-driven model selection. Mul- tiscale Modeling & Simulation , 19(3):1474–1497, 2021

work page 2021
[34]

Messenger and D

D. Messenger and D. Bortz. Learning mean-ﬁeld equations from particle data using WSINDy. Physica D: Nonlinear Phenomena , 439:133406, 2022

work page 2022
[35]

L. Onsager. Reciprocal relations in irreversible proc esses. I. Physical Review, 37:405–426, 1931

work page 1931
[36]

L. Onsager. Reciprocal relations in irreversible proc esses. II. Physical Review, 38:2265–2279, 1931

work page 1931
[37]

M. Opper. Variational inference for stochastic diﬀere ntial equations. Annalen der Physik , 531(3):1800233, 2019. 24

work page 2019
[38]

H. C. Öttinger and M. Grmela. Dynamics and thermodynami cs of complex ﬂuids. ii. illustra- tions of a general formalism. Phys. Rev. E , 56(6):6633, 1997

work page 1997
[39]

M. A. Peletier. Variational modelling: Energies, grad ient ﬂows, and large deviations. arXiv preprint arXiv:1402.1990 , 2014

work page arXiv 1990
[40]

Raissi, P

M. Raissi, P. Perdikaris, and G. Karniadakis. Physics- informed neural networks: A deep learn- ing framework for solving forward and inverse problems invo lving nonlinear partial diﬀerential equations. Journal of Computational Physics , 378:686–707, 2019

work page 2019
[41]

Schaeﬀer

H. Schaeﬀer. Learning partial diﬀerential equations v ia data discovery and sparse optimiza- tion. Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences , 473(2197):20160446, 2017

work page 2017
[42]

Sirignano and K

J. Sirignano and K. Spiliopoulos. Dgm: A deep learning a lgorithm for solving partial diﬀerential equations. Journal of computational physics , 375:1339–1364, 2018

work page 2018
[43]

J. W. Strutt. Some general theorems relating to vibrati ons. Proceedings of the London Mathe- matical Society, s1-4(1):357–368, 1871

work page
[44]

Wang and C

Y. Wang and C. Liu. Some recent advances in energetic var iational approaches. Entropy, 24(5):721, 2022

work page 2022
[45]

Y. Wang, C. Liu, P. Liu, and B. Eisenberg. Field theory of r eaction-diﬀusion: Law of mass action with an energetic variational approach. Physical Review E , 102(6):062147, 2020

work page 2020
[46]

Wang, T.-F

Y. Wang, T.-F. Zhang, and C. Liu. A two species micro–mac ro model of wormlike micellar solutions and its maximum entropy closure approximations: An energetic variational approach. Journal of Non-Newtonian Fluid Mechanics , 293:104559, 2021

work page 2021
[47]

Wu and Z

J. Wu and Z. Li. Density-functional theory for complex ﬂ uids. Annu. Rev. Phys. Chem. , 58(1):85–112, 2007

work page 2007
[48]

Z. Xu, D. Long, Y. Xu, G. Yang, S. Zhe, and H. Owhadi. Towar d eﬃcient kernel-based solvers for nonlinear PDEs. In Forty-second International Conference on Machine Learnin g, 2025

work page 2025
[49]

Zhang, Y

Z. Zhang, Y. Shin, and G. Em Karniadakis. GFINNs: GENERI C formalism informed neural networks for deterministic and stochastic dynamical syste ms. Philosophical Transactions of the Royal Society A , 380(2229):20210207, 2022. 25

work page 2022

[1] [1]

Ambrosio, N

L. Ambrosio, N. Gigli, and G. Savaré. Gradient ﬂows: in metric spaces and in the space of probability measures. Springer, 2005

work page 2005

[2] [2]

V. I. Arnol’d. Mathematical methods of classical mechanics , volume 60. Springer Science & Business Media, 2013

work page 2013

[3] [3]

Batlle, Y

P. Batlle, Y. Chen, B. Hosseini, H. Owhadi, and A. M. Stuart. Error analysis of kernel/GP methods for nonlinear and parametric PDEs. Journal of Computational Physics , 520:113488, 2025

work page 2025

[4] [4]

X. Chen, L. Yang, J. Duan, and G. E. Karniadakis. Solving i nverse stochastic problems from discrete particle observations using the Fokker–Planck eq uation and physics-informed neural networks. SIAM Journal on Scientiﬁc Computing , 43(3):B811–B830, 2021

work page 2021

[5] [5]

Y. Chen, B. Hosseini, H. Owhadi, and A. M. Stuart. Solving a nd learning nonlinear PDEs with Gaussian processes. Journal of Computational Physics , 447:110668, 2021

work page 2021

[6] [6]

De Giorgi, A

E. De Giorgi, A. Marino, and M. Tosques. Problems of evolu tion in metric spaces and maximal decreasing curve. Atti Accad. Naz. Lincei Rend. Cl. Sci. Fis. Mat. Natur.(8) , 68(3):180–187, 1980. 22

work page 1980

[7] [7]

De Ryck, S

T. De Ryck, S. Mishra, and R. Molinaro. Weak physics infor med neural networks for approxi- mating entropy solutions of hyperbolic conservation laws. In Seminar für Angewandte Mathe- matik, Eidgenössische Technische Hochschule, Zürich, Swi tzerland, Rep, volume 35, page 2022, 2022

work page 2022

[8] [8]

Dietrich, A

F. Dietrich, A. Makeev, G. Kevrekidis, N. Evangelou, T. Be rtalan, S. Reich, and I. Kevrekidis. Learning eﬀective stochastic diﬀerential equations from m icroscopic simulations: Linking stochastic numerics to deep learning. Chaos: An Interdisciplinary Journal of Nonlinear Science , 33(2):023121, 2023

work page 2023

[9] [9]

M. Doi. Onsager’s variational principle in soft matter. J. Phys.: Condens. Matter , 23(28):284118, 2011

work page 2011

[10] [10]

W. E, C. Ma, and L. Wu. Machine learning from a continuous viewpoint, i. Science China Mathematics, pages 1–34, 2020

work page 2020

[11] [11]

Eisenberg, C

B. Eisenberg, C. Liu, and Y. Wang. On variational princip les for polarization responses in electromechanical systems. Communications in Mathematical Sciences , 20(6), 2022

work page 2022

[12] [12]

J. L. Ericksen. Introduction to the Thermodynamics of Solids . Applied Mathematical Sciences. Springer-Verlag, 1998

work page 1998

[13] [13]

Flamary, N

R. Flamary, N. Courty, A. Gramfort, M. Z. Alaya, A. Boisbu non, S. Chambon, L. Chapel, A. Corenﬂos, K. Fatras, N. Fournier, L. Gautheron, N. T. Gayr aud, H. Janati, A. Rakotoma- monjy, I. Redko, A. Rolet, A. Schutz, V. Seguy, D. J. Sutherla nd, R. Tavenard, A. Tong, and T. Vayer. Pot: Python optimal transport. Journal of Machine Learning Research , 22(78)...

work page 2021

[14] [14]

Flamary, C

R. Flamary, C. Vincent-Cuaz, N. Courty, A. Gramfort, O. Kachaiev, H. Quang Tran, L. David, C. Bonet, N. Cassereau, T. Gnassounou, E. Tanguy, J. Delon, A. Collas, S. Mazelet, L. Chapel, T. Kerdoncuﬀ, X. Yu, M. Feickert, P. Krzakala, T. Liu, and E. F ernandes Montesuma. Pot python optimal transport (version 0.9.5), 2024

work page 2024

[15] [15]

H. Gao, M. J. Zahr, and J.-X. Wang. Physics-informed gra ph neural Galerkin networks: A uniﬁed framework for solving PDE-governed forward and inve rse problems. Computer Methods in Applied Mechanics and Engineering , 390:114502, 2022

work page 2022

[16] [16]

Y. Gao, Q. Lang, and L. Fei. Self-test loss functions for learning weak-form operators and gradient ﬂows. arXiv preprint arXiv:2412.03506 , 2024

work page arXiv 2024

[17] [17]

M.-H. Giga, A. Kirshtein, and C. Liu. Variational model ing and complex ﬂuids. Handbook of Mathematical Analysis in Mechanics of Viscous Fluids , pages 1–41, 2017

work page 2017

[18] [18]

Grmela and H

M. Grmela and H. C. Öttinger. Dynamics and thermodynami cs of complex ﬂuids. i. develop- ment of a general formalism. Phys. Rev. E , 56(6):6620, 1997

work page 1997

[19] [19]

Gruber, K

A. Gruber, K. Lee, H. Lim, N. Park, and N. Trask. Eﬃcientl y parameterized neural metriplectic systems. In The Thirteenth International Conference on Learning Repre sentations, 2025

work page 2025

[20] [20]

Z. Hu, C. Liu, Y. Wang, and Z. Xu. Energetic variational n eural network discretizations of gradient ﬂows. SIAM Journal on Scientiﬁc Computing , 46(4):A2528–A2556, 2024. 23

work page 2024

[21] [21]

Huang, Z

S. Huang, Z. He, and C. Reina. Variational Onsager neura l networks (VONNs): A thermodynamics-based variational learning strategy for n on-equilibrium PDEs. Journal of the Mechanics and Physics of Solids , 163:104856, 2022

work page 2022

[22] [22]

Jiang, W

Y. Jiang, W. Yang, Y. Zhu, and L. Hong. Entropy structure informed learning for solving inverse problems of diﬀerential equations. Chaos, Solitons & Fractals , 175:114057, 2023

work page 2023

[23] [23]

G. E. Karniadakis, I. G. Kevrekidis, L. Lu, P. Perdikari s, S. Wang, and L. Yang. Physics- informed machine learning. Nature Reviews Physics , 3(6):422–440, 2021

work page 2021

[24] [24]

Kharazmi, Z

E. Kharazmi, Z. Zhang, and G. Karniadakis. hp-VPINNs: V ariational physics-informed neural networks with domain decomposition. Computer Methods in Applied Mechanics and Engineer- ing, 374:113547, 2021

work page 2021

[25] [25]

R. Kubo. The ﬂuctuation-dissipation theorem. Reports on progress in physics , 29(1):255, 1966

work page 1966

[26] [26]

K. Lee, N. Trask, and P. Stinis. Machine learning struct ure preserving brackets for forecasting irreversible processes. In Advances in Neural Information Processing Systems , 2021

work page 2021

[27] [27]

W. Li, M. Z. Bazant, and J. Zhu. Phase-ﬁeld deeponet: Phys ics-informed deep operator neural network for fast simulations of pattern formation governed by gradient ﬂows of free-energy functionals. Computer Methods in Applied Mechanics and Engineering , 416:116299, 2023

work page 2023

[28] [28]

Liero, A

M. Liero, A. Mielke, M. A. Peletier, and D. M. Renger. On m icroscopic origins of generalized gradient structures. Discrete and Continuous Dynamical Systems-S , 10(1):1–35, 2017

work page 2017

[29] [29]

C. Liu. An introduction of elastic complex ﬂuids: an ene rgetic variational approach. In Multi- Scale Phenomena in Complex Fluids: Modeling, Analysis and N umerical Simulation , pages 286–337. World Scientiﬁc, 2009

work page 2009

[30] [30]

Y. Lu, X. Li, C. Liu, Q. Tang, and Y. Wang. Learning genera lized diﬀusions using an energetic variational approach. arXiv preprint arXiv:2412.04480 , 2025

work page arXiv 2025

[31] [31]

S. Ma, S. Liu, H. Zha, and H. Zhou. Learning stochastic be haviour from aggregate data. In M. Meila and T. Zhang, editors, Proceedings of the 38th International Conference on Machin e Learning, volume 139 of Proceedings of Machine Learning Research , pages 7258–7267. PMLR, 18–24 Jul 2021

work page 2021

[32] [32]

Messenger and D

D. Messenger and D. Bortz. Weak SINDy for partial diﬀeren tial equations. Journal of Com- putational Physics , 443:110525, 2021

work page 2021

[33] [33]

Messenger and D

D. Messenger and D. Bortz. Weak SINDy: Galerkin-based da ta-driven model selection. Mul- tiscale Modeling & Simulation , 19(3):1474–1497, 2021

work page 2021

[34] [34]

Messenger and D

D. Messenger and D. Bortz. Learning mean-ﬁeld equations from particle data using WSINDy. Physica D: Nonlinear Phenomena , 439:133406, 2022

work page 2022

[35] [35]

L. Onsager. Reciprocal relations in irreversible proc esses. I. Physical Review, 37:405–426, 1931

work page 1931

[36] [36]

L. Onsager. Reciprocal relations in irreversible proc esses. II. Physical Review, 38:2265–2279, 1931

work page 1931

[37] [37]

M. Opper. Variational inference for stochastic diﬀere ntial equations. Annalen der Physik , 531(3):1800233, 2019. 24

work page 2019

[38] [38]

H. C. Öttinger and M. Grmela. Dynamics and thermodynami cs of complex ﬂuids. ii. illustra- tions of a general formalism. Phys. Rev. E , 56(6):6633, 1997

work page 1997

[39] [39]

M. A. Peletier. Variational modelling: Energies, grad ient ﬂows, and large deviations. arXiv preprint arXiv:1402.1990 , 2014

work page arXiv 1990

[40] [40]

Raissi, P

M. Raissi, P. Perdikaris, and G. Karniadakis. Physics- informed neural networks: A deep learn- ing framework for solving forward and inverse problems invo lving nonlinear partial diﬀerential equations. Journal of Computational Physics , 378:686–707, 2019

work page 2019

[41] [41]

Schaeﬀer

H. Schaeﬀer. Learning partial diﬀerential equations v ia data discovery and sparse optimiza- tion. Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences , 473(2197):20160446, 2017

work page 2017

[42] [42]

Sirignano and K

J. Sirignano and K. Spiliopoulos. Dgm: A deep learning a lgorithm for solving partial diﬀerential equations. Journal of computational physics , 375:1339–1364, 2018

work page 2018

[43] [43]

J. W. Strutt. Some general theorems relating to vibrati ons. Proceedings of the London Mathe- matical Society, s1-4(1):357–368, 1871

work page

[44] [44]

Wang and C

Y. Wang and C. Liu. Some recent advances in energetic var iational approaches. Entropy, 24(5):721, 2022

work page 2022

[45] [45]

Y. Wang, C. Liu, P. Liu, and B. Eisenberg. Field theory of r eaction-diﬀusion: Law of mass action with an energetic variational approach. Physical Review E , 102(6):062147, 2020

work page 2020

[46] [46]

Wang, T.-F

Y. Wang, T.-F. Zhang, and C. Liu. A two species micro–mac ro model of wormlike micellar solutions and its maximum entropy closure approximations: An energetic variational approach. Journal of Non-Newtonian Fluid Mechanics , 293:104559, 2021

work page 2021

[47] [47]

Wu and Z

J. Wu and Z. Li. Density-functional theory for complex ﬂ uids. Annu. Rev. Phys. Chem. , 58(1):85–112, 2007

work page 2007

[48] [48]

Z. Xu, D. Long, Y. Xu, G. Yang, S. Zhe, and H. Owhadi. Towar d eﬃcient kernel-based solvers for nonlinear PDEs. In Forty-second International Conference on Machine Learnin g, 2025

work page 2025

[49] [49]

Zhang, Y

Z. Zhang, Y. Shin, and G. Em Karniadakis. GFINNs: GENERI C formalism informed neural networks for deterministic and stochastic dynamical syste ms. Philosophical Transactions of the Royal Society A , 380(2229):20210207, 2022. 25

work page 2022