pith. sign in

arxiv: 2501.06933 · v3 · pith:QABJ5TFLnew · submitted 2025-01-12 · 💻 cs.LG · physics.comp-ph· physics.flu-dyn

Neural equilibria for long-term prediction of nonlinear conservation laws

Pith reviewed 2026-05-23 05:01 UTC · model grok-4.3

classification 💻 cs.LG physics.comp-phphysics.flu-dyn
keywords neural networksconservation lawskinetic solversequilibrium closurescientific machine learningshock wavesBoltzmann formulationlong-term prediction
0
0 comments X

The pith

A neural network learns only the nonlinear equilibrium closure inside a kinetic solver to enable stable long-term prediction of nonlinear conservation laws.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper establishes that embedding machine learning inside an existing kinetic solver, where the network supplies only the local nonlinear equilibrium target of a Boltzmann formulation, produces accurate and conservative simulations for nonlinear conservation laws. The solver itself continues to handle transport, relaxation, moment recovery, and conservation, preserving physical structure while the network adds flexibility for the hard-to-model equilibrium. Tests on six systems, including subsonic, transonic, and supersonic shocks, show the resulting method outperforms both much larger neural operators and pretrained foundation models. It also improves accuracy and stability over the numerical scheme that supplies its structure. The central suggestion is that learning the equilibrium law the system relaxes toward is a more effective target for scientific machine learning than learning the full evolution law.

Core claim

NeurDE places machine learning inside a kinetic solver by learning the local equilibrium closure of a Boltzmann formulation. The kinetic solver still performs transport, relaxation, moment recovery, and conservation; the neural network provides only the nonlinear equilibrium target. NeurDE outperforms state-of-the-art SciML methods, including neural operators and pretrained SciML foundation models that are 10^4 and 10^6 times larger, respectively. Most notably, NeurDE improves upon the numerical method from which it is derived.

What carries the argument

Neural Discrete Equilibrium (NeurDE), the mechanism that learns the nonlinear local equilibrium closure of the Boltzmann formulation while the kinetic solver retains all other steps.

If this is right

  • The method remains stable and conservative over long times on subsonic to supersonic shock systems.
  • NeurDE outperforms neural operators 10,000 times larger and foundation models a million times larger.
  • NeurDE improves upon the numerical method that supplies its kinetic structure.
  • Scientific machine learning for conservation laws can target the equilibrium law rather than the evolution law itself.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The same equilibrium-learning pattern could be tested on other systems where analytic closures are unavailable but kinetic formulations exist.
  • Restricting the network to the equilibrium step may lower data needs compared with learning full space-time operators.
  • Hybrid kinetic-ML solvers of this form might allow larger stable time steps in production codes without sacrificing conservation.

Load-bearing premise

The neural network can accurately learn the nonlinear local equilibrium closure across regimes from subsonic to supersonic shocks so the overall solver remains stable and conservative.

What would settle it

A long-time simulation on a supersonic shock problem that develops instability, violates discrete conservation, or loses accuracy relative to the base numerical scheme would show the claim is false.

Figures

Figures reproduced from arXiv: 2501.06933 by Ivan Dokmani\'c, J. Antonio Lara Benitez, Junyi Guo, Kareem Hegazy, Maarten V. de Hoop, Michael W. Mahoney.

Figure 1
Figure 1. Figure 1: Universal kinetic framework for conservation laws. a. [PITH_FULL_IMAGE:figures/full_fig_p004_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Hybrid kinetic-macroscopic propagation. (a) [PITH_FULL_IMAGE:figures/full_fig_p006_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: Inference results for the subsonic (case 1, Eq. (10)) and transonic (case 2, Eq. (11)) Sod shock tube at t = 500, 700, 999 after training on t < 500. Panels a, b, c, and d, respectively, illustrate the temperature, density, velocity, and pressure evolution for the subsonic case. Panels e – h are the analogous plots for the transonic case. The small solid line represents the polynomial LB expansion of f eq … view at source ↗
Figure 4
Figure 4. Figure 4: Prediction for the supersonic flow around a cylinder with Re = 300 and Ma∞ = 1.8. Panel a illustrates our 2D supersonic flow boundary conditions: free streaming condition on the top and bottom (blue line, Γ3), Dirichlet boundary conditions at the inlet (red line, Γ1), first-order Neumann conditions at the outlet (green line, Γ2), and no slip conditions on the circle (black circle, Γ4). Panels b and c depic… view at source ↗
Figure 5
Figure 5. Figure 5: Long-term prediction for the (top) and transonic (bottom) Sod shock tube (Subsec. 2.3.1) at t = 2099, with initial condition t = 2000. The model is trained using the first 500 time steps of the dataset. The timeline of training, early inference, flow evolution, and long-time evolution presented here is shown in panel a. Panels b, c, d, and e show the temperature, density, velocity, and pressure profiles, r… view at source ↗
Figure 6
Figure 6. Figure 6: Comparison of LB+NeurDE’s performance on long-time predictions with a shortened training time and an initialization time of 900, depicted in panel a. Panels b, c, and d show the Mach number. Panel b shows the last training step seen by LB+NeurDE at t = 150. Panels c and d show the local Mach number at t = 999 for LB+NeurDE (initialized at t0 = 900) and numerical results, respectively. Notably, there is a s… view at source ↗
Figure 7
Figure 7. Figure 7: Detailed interplay between macroscopic and kinetic representations. [PITH_FULL_IMAGE:figures/full_fig_p030_7.png] view at source ↗
Figure 8
Figure 8. Figure 8: Neural lattice Boltzmann architecture. a. [PITH_FULL_IMAGE:figures/full_fig_p032_8.png] view at source ↗
Figure 9
Figure 9. Figure 9: visually confirms the subsonic nature of the case, a characteristic we have emphasized throughout the primary portion of this work. For the local Mach number we use Ma = (u · u) 1/2 (γRT)−1/2 . 0 500 1000 1500 2000 2500 3000 Position x 0.00 0.10 0.20 0.30 Mach LB+NeurDE Reference [PITH_FULL_IMAGE:figures/full_fig_p038_9.png] view at source ↗
Figure 10
Figure 10. Figure 10: LB+NeurDE’s failure on the subsonic Sod shock tube when predicting t = 3999 after being initialized at t0 = 3900. The solid blue line represents the LB+NeurDE prediction and the dotted lines represent simulated results at different times. We show the temperature, density, velocity, and pressure variables in panels a, b, c, and d, respectively. E.1.3 FNO Benchmark To establish a rigorous baseline, we train… view at source ↗
Figure 11
Figure 11. Figure 11: Impact of boundary extension strategies on spectral convergence. [PITH_FULL_IMAGE:figures/full_fig_p040_11.png] view at source ↗
Figure 12
Figure 12. Figure 12: We train and evaluate the FNO model on the subsonic Sod shock tube with 128 Fourier modes, 128 embedding dimensions, and 4 layers. We evaluate (a) Temperature, (b) Density, (c) Velocity, and (d) Pressure for only 1 and 10 time steps [PITH_FULL_IMAGE:figures/full_fig_p042_12.png] view at source ↗
Figure 13
Figure 13. Figure 13: We train and evaluate the FNO model on the subsonic Sod shock tube with 128 Fourier modes, 256 embedding dimensions, and 4 layers. We evaluate (a) Temperature, (b) Density, (c) Velocity, and (d) Pressure for only 1 and 10 time steps. 42 [PITH_FULL_IMAGE:figures/full_fig_p042_13.png] view at source ↗
Figure 14
Figure 14. Figure 14: We train and evaluate the FNO model on the subsonic Sod shock tube with 512 Fourier modes, 256 embedding dimensions, and 8 layers. We evaluate (a) Temperature, (b) Density, (c) Velocity, and (d) Pressure for only 1 and 10 time steps. contrast to LB+NeurDE’s ability to accurately resolve the discontinuity for 500 time steps (Fig (a-d) 3), FNO shows significant deviations after a single time step. This is e… view at source ↗
Figure 15
Figure 15. Figure 15: Improvements in LB+NeurDE temperature profile predictions through the inclusion of TVD regularization during training. Panel a shows the LB+NeurDE performance without TVD regularization. Panel b shows the LB+NeurDE performance when TVD regularization is included and weighted by a constant α2, see Algorithm 2. Panel c shows the LB+NeurDE performance when TVD regularization is included with a linearly incre… view at source ↗
Figure 16
Figure 16. Figure 16: Comparison of the local Mach number for the near-sonic Sod shock tube (case 2 Eq. (11)) between LB+NeurDE and simulation results. The black line represents the numerical reference, while the blue line depicts the flow predicted by LB+NeurDE trained with TVD. This snapshot is taken at time-step 700. We observe that the local Mach number of the tube is close to Ma = 1 at around x = 1500. E.2.5 Errors of the… view at source ↗
Figure 17
Figure 17. Figure 17: LB+NeurDE’s failure on the near-sonic Sod shock tube when predicting the t = 2599 time-step after being initialized at t0 = 2500. The solid blue line represents the LB+NeurDE prediction and the dotted lines represent simulated results at different times. We show the temperature, density, velocity, and pressure variables in panels a, b, c, and d, respectively. 0.14 0.16 0.18 0.20 0.22 0.24 T e m p e r a t … view at source ↗
Figure 18
Figure 18. Figure 18: The onset of LB+NeurDE’s failure on the near-sonic Sod shock tube when predicting the t = 2575 time-step after being initialized at t0 = 2500. The solid blue line represents the LB+NeurDE prediction and the dotted lines represent simulated results at different times. We show the temperature, density, velocity, and pressure variables in panels a, b, c, and d, respectively. 47 [PITH_FULL_IMAGE:figures/full… view at source ↗
Figure 19
Figure 19. Figure 19: Comparison of the temperature predictions for Sod case 1 between LB+NeurDE, MLPsym+ cons , and the numerical simulation for time-step 550. Here, both LB+NeurDE and MLPsym+ cons are initialized at t0 = 500. Panel a shows the predicted temperature; and panel b shows its relative L2 -norm error with respect to the numerical simulation at different time-steps. The blue line represents LB+NeurDE, the black lin… view at source ↗
Figure 20
Figure 20. Figure 20: Comparison of the evolving temperature predictions for Sod case 1 between LB+NeurDE, MLPsym+ cons , without the algebraic correction, and the numerical simulation for time-step 550, 625, and 675, in panels a, b, and c, respectively. Here, both LB+NeurDE and MLPsym+ cons are initialized at t0 = 500. Panel d shows the relative L2 -norm error with respect to the numerical simulation as a function of predicti… view at source ↗
Figure 21
Figure 21. Figure 21: The temperature comparison between the ContinuousNet surrogate model and numerical simulation for the subsonic Sod shock tube (case 1 Eq. (10)). The results are presented at time-step 650 which is 150 time-steps beyond training data. The blue line represents the ContinuousNet surrogate model and the black line represents the simulation. 52 [PITH_FULL_IMAGE:figures/full_fig_p052_21.png] view at source ↗
Figure 22
Figure 22. Figure 22: The LB+NeurDE temperature prediction at t = 700 for the 2D supersonic flow experiment when trained on the first 500 time-steps and initialized at t0 = 500. LB+NeurDE a Simulation b 0.75 1.00 1.25 1.50 1.75 2.00 [PITH_FULL_IMAGE:figures/full_fig_p053_22.png] view at source ↗
Figure 23
Figure 23. Figure 23: The LB+NeurDE density prediction at t = 700 for the 2D supersonic flow experiment when trained on the first 500 time-steps and initialized at t0 = 500. 53 [PITH_FULL_IMAGE:figures/full_fig_p053_23.png] view at source ↗
Figure 24
Figure 24. Figure 24: The LB+NeurDE pressure prediction at t = 700 for the 2D supersonic flow experiment when trained on the first 500 time-steps and initialized at t0 = 500. LB+NeurDE a Simulation b 0.0 0.2 0.4 0.6 0.8 1.0 [PITH_FULL_IMAGE:figures/full_fig_p054_24.png] view at source ↗
Figure 25
Figure 25. Figure 25: The LB+NeurDE speed ( √ u · u) prediction at t = 700 for the 2D supersonic flow experiment when trained on the first 500 time-steps and initialized at t0 = 500. LB+NeurDE a Simulation b 0.0 0.2 0.4 0.6 0.8 1.0 [PITH_FULL_IMAGE:figures/full_fig_p054_25.png] view at source ↗
Figure 26
Figure 26. Figure 26: The LB+NeurDE x velocity (ux) prediction at t = 700 for the 2D supersonic flow experiment when trained on the first 500 time-steps and initialized at t0 = 500. 54 [PITH_FULL_IMAGE:figures/full_fig_p054_26.png] view at source ↗
Figure 27
Figure 27. Figure 27: The LB+NeurDE y velocity (uy) prediction at t = 700 for the 2D supersonic flow experiment when trained on the first 500 time-steps and initialized at t0 = 500. 55 [PITH_FULL_IMAGE:figures/full_fig_p055_27.png] view at source ↗
Figure 28
Figure 28. Figure 28: We show the LB+NeurDE temperature prediction at t = 999 for the 2D supersonic flow experiment when trained on the first 150 time-steps and initialized at t0 = 900. LB+NeurDE b Simulation c a Last Training Example 0.75 1.00 1.25 1.50 1.75 2.00 [PITH_FULL_IMAGE:figures/full_fig_p056_28.png] view at source ↗
Figure 29
Figure 29. Figure 29: We show the LB+NeurDE density prediction at t = 999 for the 2D supersonic flow experiment when trained on the first 150 time-steps and initialized at t0 = 900. 56 [PITH_FULL_IMAGE:figures/full_fig_p056_29.png] view at source ↗
Figure 30
Figure 30. Figure 30: We show the LB+NeurDE pressure prediction at t = 999 for the 2D supersonic flow experiment when trained on the first 150 time-steps and initialized at t0 = 900. LB+NeurDE b Simulation c a Last Training Example 0.0 0.2 0.4 0.6 0.8 1.0 [PITH_FULL_IMAGE:figures/full_fig_p057_30.png] view at source ↗
Figure 31
Figure 31. Figure 31: We show the LB+NeurDE speed prediction at t = 999 for the 2D supersonic flow experiment when trained on the first 150 time-steps and initialized at t0 = 900. 57 [PITH_FULL_IMAGE:figures/full_fig_p057_31.png] view at source ↗
Figure 32
Figure 32. Figure 32: We show the LB+NeurDE x velocity prediction at t = 999 for the 2D supersonic flow experiment when trained on the first 150 time-steps and initialized at t0 = 900. LB+NeurDE b Simulation c a Last Training Example 0.2 0.1 0.0 0.1 [PITH_FULL_IMAGE:figures/full_fig_p058_32.png] view at source ↗
Figure 33
Figure 33. Figure 33: We show the LB+NeurDE y velocity prediction at t = 999 for the 2D supersonic flow experiment when trained on the first 150 time-steps and initialized at t0 = 900. 58 [PITH_FULL_IMAGE:figures/full_fig_p058_33.png] view at source ↗
read the original abstract

Nonlinear conservation laws govern a broad class of important physical systems in science and industry and are central to scientific machine learning (SciML). Large general-purpose models offer speed, but replacing the numerical and physical structure of solvers often compromises stability, accuracy, and physical faithfulness. Here, we aim to balance the general inductive bias of conservation with the flexibility and speed of neural networks through a conservation-aware SciML backbone, which we call Neural Discrete Equilibrium (NeurDE). NeurDE places machine learning inside a kinetic solver by learning the local equilibrium closure of a Boltzmann formulation. The kinetic solver still performs transport, relaxation, moment recovery, and conservation; the neural network provides only the nonlinear equilibrium target. We test NeurDE on $6$ conserved systems, including three very challenging subsonic, transonic, and supersonic shock systems. NeurDE outperforms state-of-the-art SciML methods, including neural operators and pretrained SciML foundation models that are $10^4$ and $10^6$ times larger, respectively. Most notably, NeurDE improves upon the numerical method from which it is derived. NeurDE therefore provides a compact target for scientific machine learning in conservative simulation: learn the equilibrium law toward which the system relaxes, not the evolution law itself.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

1 major / 1 minor

Summary. The paper introduces Neural Discrete Equilibrium (NeurDE), a SciML approach that embeds a neural network to learn only the local nonlinear equilibrium closure inside a standard kinetic (Boltzmann-type) solver for nonlinear conservation laws. The solver itself continues to perform transport, relaxation, moment recovery, and exact conservation; the network supplies the equilibrium target. The method is evaluated on six conserved systems, including three challenging subsonic/transonic/supersonic shock problems, with the claim that it outperforms state-of-the-art neural operators and much larger pretrained foundation models while also improving upon the underlying numerical scheme.

Significance. If the empirical results hold, NeurDE offers a compact, structure-preserving route for incorporating machine learning into conservative simulation: the learned component targets the equilibrium law rather than the full evolution operator. This could yield more stable long-time behavior than purely data-driven neural operators, especially for shock-dominated flows, while retaining the conservation properties and stability guarantees of the base kinetic scheme. The approach aligns well with existing numerical infrastructure and avoids replacing the entire solver.

major comments (1)
  1. [Abstract] Abstract: the headline claim that NeurDE 'improves upon the numerical method from which it is derived' on three shock problems is load-bearing for the central contribution, yet the abstract supplies no metrics, baselines, error tables, conservation-error histories, or implementation details. Without these, it is impossible to verify that the learned equilibrium map remains strictly superior to the original closure and does not accumulate drift or oscillations in the supersonic regime over long integration times.
minor comments (1)
  1. The abstract refers to 'six conserved systems' and 'three very challenging shock systems' without naming them or indicating the precise quantitative criteria (e.g., L2 error, conservation residual, wall-clock time) used for the comparisons; this information should appear in the introduction or first results section for clarity.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for their thoughtful review and for highlighting the need for greater specificity in the abstract. We address the single major comment below and will revise the manuscript accordingly.

read point-by-point responses
  1. Referee: [Abstract] Abstract: the headline claim that NeurDE 'improves upon the numerical method from which it is derived' on three shock problems is load-bearing for the central contribution, yet the abstract supplies no metrics, baselines, error tables, conservation-error histories, or implementation details. Without these, it is impossible to verify that the learned equilibrium map remains strictly superior to the original closure and does not accumulate drift or oscillations in the supersonic regime over long integration times.

    Authors: We agree that the abstract would be strengthened by including concrete quantitative support for the headline claim. In the revised manuscript we will add a concise sentence (or short clause) summarizing the key metrics: relative L2 error reductions versus the baseline kinetic solver on the three shock problems, together with a statement that conservation errors remain at machine precision with no observed drift or oscillations over the reported integration horizons. Full error tables, conservation-error histories, baseline comparisons, and implementation details already appear in Sections 4.3–5 and the supplementary material; the revised abstract will explicitly reference these sections. This change directly addresses the verifiability concern while preserving the abstract’s brevity. revision: yes

Circularity Check

0 steps flagged

No circularity: hybrid solver structure remains independent of learned component

full rationale

The derivation places a neural network only in the role of supplying the nonlinear local equilibrium target inside an otherwise standard kinetic solver that independently executes transport, relaxation, moment recovery, and exact conservation. No equation reduces the output to a fitted parameter by construction, no prediction is statistically forced from a subset of the same data, and no load-bearing premise rests on a self-citation or uniqueness theorem imported from the authors' prior work. The improvement claim over the base numerical method is presented as an empirical outcome on specific shock problems rather than a definitional identity. The method is therefore self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The central claim rests on the standard kinetic-theory assumption that a local equilibrium closure exists and can be learned separately while the solver enforces conservation; no free parameters or invented physical entities are specified in the abstract.

axioms (1)
  • domain assumption Nonlinear conservation laws admit a Boltzmann-type kinetic formulation whose distribution relaxes to a local equilibrium that can be approximated independently.
    This premise allows the neural network to supply only the equilibrium target while the solver handles transport and conservation.

pith-pipeline@v0.9.0 · 5779 in / 1313 out tokens · 99971 ms · 2026-05-23T05:01:35.850284+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Is Flow Matching Just Trajectory Replay for Sequential Data?

    stat.ML 2026-02 unverdicted novelty 7.0

    Flow matching on time series targets a closed-form nonparametric velocity field that is a similarity-weighted mixture of observed transition velocities, making neural models approximations to an ideal memory-augmented...

  2. SciML Agents: Write the Solver, Not the Solution

    cs.LG 2025-09 unverdicted novelty 7.0

    LLMs prompted with domain knowledge can generate runnable, numerically valid code for stiff and non-stiff ODEs on new diagnostic and 1000-task benchmarks.

Reference graph

Works this paper leans on

94 extracted references · 94 canonical work pages · cited by 2 Pith papers · 4 internal anchors

  1. [1]

    Discrete kinetic schemes for multidimensional systems of conservation laws.SIAM Journal on Numerical Analysis, 37(6):1973–2004, 2000

    Denise Aregba-Driollet and Roberto Natalini. Discrete kinetic schemes for multidimensional systems of conservation laws.SIAM Journal on Numerical Analysis, 37(6):1973–2004, 2000

  2. [2]

    Splitting of the inhomogeneous kinetic operator of the boltzmann equation

    Vladimir Vladimirovich Aristov and Feliks Grigor’evich Cheremisin. Splitting of the inhomogeneous kinetic operator of the boltzmann equation. InAkademiia Nauk SSSR Doklady, volume 231, pages 49–52, 1976

  3. [3]

    Supersonic cylinder wake dynamics.Journal of Fluid Mechanics, 945, 2022

    M Awasthi, S McCreton, DJ Moreau, and CJ Doolan. Supersonic cylinder wake dynamics.Journal of Fluid Mechanics, 945, 2022

  4. [4]

    Thomas Bellotti. Truncation errors and modified equations for the lattice boltzmann method via the corresponding finite difference schemes.ESAIM: Mathematical Modelling and Numerical Analysis, 57(3): 1225–1255, 2023

  5. [5]

    Finite difference formulation of any lattice boltzmann scheme.Numerische Mathematik, 152(1):1–40, 2022

    Thomas Bellotti, Benjamin Graille, and Marc Massot. Finite difference formulation of any lattice boltzmann scheme.Numerische Mathematik, 152(1):1–40, 2022

  6. [6]

    Out-of-distributional risk bounds for neural operators with applications to the helmholtz equation.Journal of Computational Physics, page 113168, 2024

    Jose Antonio Lara Benitez, Takashi Furuya, Florian Faucher, Anastasis Kratsios, Xavier Tricoche, and Maarten V de Hoop. Out-of-distributional risk bounds for neural operators with applications to the helmholtz equation.Journal of Computational Physics, page 113168, 2024

  7. [7]

    A model for collision processes in gases

    Prabhu Lal Bhatnagar, Eugene P Gross, and Max Krook. A model for collision processes in gases. i. small amplitude processes in charged and neutral one-component systems.Physical review, 94(3):511, 1954

  8. [8]

    Construction of bgk models with a family of kinetic entropies for a given system of conservation laws.Journal of statistical physics, 95:113–170, 1999

    Fran¸ cois Bouchut. Construction of bgk models with a family of kinetic entropies for a given system of conservation laws.Journal of statistical physics, 95:113–170, 1999

  9. [9]

    Message passing neural pde solvers.arXiv preprint arXiv:2202.03376, 2022

    Johannes Brandstetter, Daniel Worrall, and Max Welling. Message passing neural pde solvers.arXiv preprint arXiv:2202.03376, 2022

  10. [10]

    Promising directions of machine learning for partial differential equations.Nature Computational Science, 4(7):483–494, 2024

    Steven L Brunton and J Nathan Kutz. Promising directions of machine learning for partial differential equations.Nature Computational Science, 4(7):483–494, 2024

  11. [11]

    Shock profile solutions of the boltzmann equation.Communica- tions in Mathematical Physics, 86(2):161–194, 1982

    Russel E Caflisch and Basil Nicolaenko. Shock profile solutions of the boltzmann equation.Communica- tions in Mathematical Physics, 86(2):161–194, 1982

  12. [12]

    Physics-informed neural networks (pinns) for fluid mechanics: A review.Acta Mechanica Sinica, 37(12):1727–1738, 2021

    Shengze Cai, Zhiping Mao, Zhicheng Wang, Minglang Yin, and George Em Karniadakis. Physics-informed neural networks (pinns) for fluid mechanics: A review.Acta Mechanica Sinica, 37(12):1727–1738, 2021

  13. [13]

    The boltzmann equation

    Carlo Cercignani. The boltzmann equation. InThe Boltzmann equation and its applications, pages 40–103. Springer, 1988

  14. [14]

    Multi-level Residual Networks from Dynamical Systems View

    Bo Chang, Lili Meng, Eldad Haber, Frederick Tung, and David Begert. Multi-level residual networks from dynamical systems view.arXiv preprint arXiv:1710.10348, 2017

  15. [15]

    Ricky T. Q. Chen, Yulia Rubanova, Jesse Bettencourt, and David K Duvenaud. Neural ordinary differential equations. In S. Bengio, H. Wallach, H. Larochelle, K. Grauman, N. Cesa-Bianchi, and R. Garnett, editors,Advances in Neural Information Processing Systems, volume 31, 2018

  16. [16]

    Lattice boltzmann method for fluid flows.Annual review of fluid mechanics, 30(1):329–364, 1998

    Shiyi Chen and Gary D Doolen. Lattice boltzmann method for fluid flows.Annual review of fluid mechanics, 30(1):329–364, 1998

  17. [17]

    Constructing custom thermodynamics using deep learning.Nature Computational Science, 4(1):66–85, 2024

    Xiaoli Chen, Beatrice W Soh, Zi-En Ooi, Eleonore Vissol-Gaudin, Haijun Yu, Kostya S Novoselov, Kedar Hippalgaonkar, and Qianxiao Li. Constructing custom thermodynamics using deep learning.Nature Computational Science, 4(1):66–85, 2024. 15

  18. [18]

    Toward learning lattice boltzmann collision operators.The European Physical Journal E, 46(3): 10, 2023

    Alessandro Corbetta, Alessandro Gabbana, Vitaliy Gyrya, Daniel Livescu, Joost Prins, and Federico Toschi. Toward learning lattice boltzmann collision operators.The European Physical Journal E, 46(3): 10, 2023

  19. [19]

    Comprehensive comparison of collision models in the lattice boltzmann framework: Theoretical investigations.Physical Review E, 100(3):033305, 2019

    Christophe Coreixas, Bastien Chopard, and Jonas Latt. Comprehensive comparison of collision models in the lattice boltzmann framework: Theoretical investigations.Physical Review E, 100(3):033305, 2019

  20. [20]

    Lattice kinetic schemes for magnetohydrodynamics.Journal of Computational Physics, 179(1):95–126, 2002

    Paul J Dellar. Lattice kinetic schemes for magnetohydrodynamics.Journal of Computational Physics, 179(1):95–126, 2002

  21. [21]

    An interpretation and derivation of the lattice boltzmann method using strang splitting

    Paul J Dellar. An interpretation and derivation of the lattice boltzmann method using strang splitting. Computers & Mathematics with Applications, 65(2):129–141, 2013

  22. [22]

    Maximisation of the entropy in non-equilibrium.Journal of Physics A: Mathematical and General, 20(18):6505, 1987

    Wolfgang Dreyer. Maximisation of the entropy in non-equilibrium.Journal of Physics A: Mathematical and General, 20(18):6505, 1987

  23. [23]

    Differentiable hybrid neural modeling for fluid-structure interaction

    Xiantao Fan and Jian-Xun Wang. Differentiable hybrid neural modeling for fluid-structure interaction. Journal of Computational Physics, 496:112584, 2024

  24. [24]

    A compressible lattice boltzmann finite volume model for high subsonic and transonic flows on regular lattices.Computers & Fluids, 131:45–55, 2016

    Yongliang Feng, Pierre Sagaut, and Wen-Quan Tao. A compressible lattice boltzmann finite volume model for high subsonic and transonic flows on regular lattices.Computers & Fluids, 131:45–55, 2016

  25. [25]

    Entropic lattice boltzmann model for compressible flows.Physical Review E, 92(6):061301, 2015

    Nicol` o Frapolli, Shyam S Chikatamarla, and Iliya V Karlin. Entropic lattice boltzmann model for compressible flows.Physical Review E, 92(6):061301, 2015

  26. [26]

    Lattice kinetic theory in a comoving galilean reference frame.Physical review letters, 117(1):010604, 2016

    Nicol` o Frapolli, Shyam S Chikatamarla, and Iliya V Karlin. Lattice kinetic theory in a comoving galilean reference frame.Physical review letters, 117(1):010604, 2016

  27. [27]

    On the kinetic theory of rarefied gases.Communications on pure and applied mathematics, 2(4):331–407, 1949

    Harold Grad. On the kinetic theory of rarefied gases.Communications on pure and applied mathematics, 2(4):331–407, 1949

  28. [28]

    Principles of the kinetic theory of gases

    Harold Grad. Principles of the kinetic theory of gases. InThermodynamik der Gase/Thermodynamics of Gases, pages 205–294. Springer, 1958

  29. [29]

    Symmetrized operator split schemes for force and source modeling in cascaded lattice boltzmann methods for flow and scalar transport.Physical Review E, 97(6):063303, 2018

    Farzaneh Hajabdollahi and Kannan N Premnath. Symmetrized operator split schemes for force and source modeling in cascaded lattice boltzmann methods for flow and scalar transport.Physical Review E, 97(6):063303, 2018

  30. [30]

    Learning physical models that can respect conservation laws

    Derek Hansen, Danielle C Maddix, Shima Alizadeh, Gaurav Gupta, and Michael W Mahoney. Learning physical models that can respect conservation laws. InInternational Conference on Machine Learning, pages 12469–12510. PMLR, 2023

  31. [31]

    High resolution schemes for hyperbolic conservation laws.Journal of computational physics, 135(2):260–278, 1997

    Ami Harten. High resolution schemes for hyperbolic conservation laws.Journal of computational physics, 135(2):260–278, 1997

  32. [32]

    Discrete boltzmann equation model for nonideal gases

    Xiaoyi He, Xiaowen Shan, and Gary D Doolen. Discrete boltzmann equation model for nonideal gases. Physical Review E, 57(1):R13, 1998

  33. [33]

    Recency Biased Causal Attention for Time-series Forecasting

    Kareem Hegazy, Michael W. Mahoney, and N. Benjamin Erichson. Powerformer: A transformer with weighted causal attention for time-series forecasting, 2025. URL https://arxiv.org/abs/2502.06151

  34. [34]

    Poseidon: Efficient foundation models for PDEs

    Maximilian Herde, Bogdan Raonic, Tobias Rohner, Roger K¨ appeli, Roberto Molinaro, Emmanuel de Bezenac, and Siddhartha Mishra. Poseidon: Efficient foundation models for PDEs. InThe Thirty- eighth Annual Conference on Neural Information Processing Systems, 2024

  35. [35]

    European Mathematical Society, 2010

    Helge Holden.Splitting methods for partial differential equations with rough solutions: Analysis and MATLAB programs, volume 11. European Mathematical Society, 2010

  36. [36]

    The relaxation schemes for systems of conservation laws in arbitrary space dimensions.Communications on pure and applied mathematics, 48(3):235–276, 1995

    Shi Jin and Zhouping Xin. The relaxation schemes for systems of conservation laws in arbitrary space dimensions.Communications on pure and applied mathematics, 48(3):235–276, 1995. 16

  37. [37]

    Factorization symmetry in the lattice boltzmann method.Physica A: Statistical Mechanics and its Applications, 389(8):1530–1548, 2010

    Ilya Karlin and Pietro Asinari. Factorization symmetry in the lattice boltzmann method.Physica A: Statistical Mechanics and its Applications, 389(8):1530–1548, 2010

  38. [38]

    Consistent two-population lattice boltzmann model for thermal flows.Physical Review E, 88(6):063310, 2013

    IV Karlin, D Sichau, and SS Chikatamarla. Consistent two-population lattice boltzmann model for thermal flows.Physical Review E, 88(6):063310, 2013

  39. [39]

    Machine learning–accelerated computational fluid dynamics.Proceedings of the National Academy of Sciences, 118(21):e2101784118, 2021

    Dmitrii Kochkov, Jamie A Smith, Ayya Alieva, Qing Wang, Michael P Brenner, and Stephan Hoyer. Machine learning–accelerated computational fluid dynamics.Proceedings of the National Academy of Sciences, 118(21):e2101784118, 2021

  40. [40]

    AM Kogan. Derivation of grad’s type equations and study of their relaxation properties by the method of maximization of entropy.Journal of Applied Mathematics and Mechanics, 29(1):130–142, 1965

  41. [42]

    Neural operator: Learning maps between function spaces with applications to pdes.Journal of Machine Learning Research, 24(89):1–97, 2023

    Nikola Kovachki, Zongyi Li, Burigede Liu, Kamyar Azizzadenesheli, Kaushik Bhattacharya, Andrew Stuart, and Anima Anandkumar. Neural operator: Learning maps between function spaces with applications to pdes.Journal of Machine Learning Research, 24(89):1–97, 2023

  42. [43]

    Mix- ture of experts soften the curse of dimensionality in operator learning.arXiv preprint arXiv:2404.09101, 2024

    Anastasis Kratsios, Takashi Furuya, Jose Antonio Lara Benitez, Matti Lassas, and Maarten de Hoop. Mix- ture of experts soften the curse of dimensionality in operator learning.arXiv preprint arXiv:2404.09101, 2024

  43. [44]

    Character- izing possible failure modes in physics-informed neural networks

    Aditi Krishnapriyan, Amir Gholami, Shandian Zhe, Robert Kirby, and Michael W Mahoney. Character- izing possible failure modes in physics-informed neural networks. InAdvances in Neural Information Processing Systems, volume 34, pages 26548–26560, 2021

  44. [45]

    Learning continuous models for continuous physics.Communications Physics, 6(1):319, 2023

    Aditi S Krishnapriyan, Alejandro F Queiruga, N Benjamin Erichson, and Michael W Mahoney. Learning continuous models for continuous physics.Communications Physics, 6(1):319, 2023

  45. [46]

    The lattice boltzmann method.Springer International Publishing, 10(978-3):4–15, 2017

    Timm Kr¨ uger, Halim Kusumaatmaja, Alexandr Kuzmin, Orest Shardt, Goncalo Silva, and Erlend Magnus Viggen. The lattice boltzmann method.Springer International Publishing, 10(978-3):4–15, 2017

  46. [47]

    Efficient supersonic flow simulations using lattice boltzmann methods based on numerical equilibria.Philosophical Transactions of the Royal Society A, 378(2175):20190559, 2020

    Jonas Latt, Christophe Coreixas, Jo¨ el Beny, and Andrea Parmigiani. Efficient supersonic flow simulations using lattice boltzmann methods based on numerical equilibria.Philosophical Transactions of the Royal Society A, 378(2175):20190559, 2020

  47. [48]

    PhD thesis, Inria, 1997

    Patrick Le Tallec and Jean-Philippe Perlat.Numerical analysis of Levermore’s moment system. PhD thesis, Inria, 1997

  48. [49]

    Lrnla lattice boltzmann method: a performance comparison of implementations on gpu and cpu

    Vadim Levchenko, Andrey Zakirov, and Anastasia Perepelkina. Lrnla lattice boltzmann method: a performance comparison of implementations on gpu and cpu. InInternational Conference on Parallel Computational Technologies, pages 139–151. Springer, 2019

  49. [50]

    Finite volume methods for hyperbolic problems, 2002

    RJ LeVeque. Finite volume methods for hyperbolic problems, 2002

  50. [51]

    Moment closure hierarchies for kinetic theories.Journal of statistical Physics, 83: 1021–1065, 1996

    C David Levermore. Moment closure hierarchies for kinetic theories.Journal of statistical Physics, 83: 1021–1065, 1996

  51. [52]

    Fourier neural operator for parametric partial differential equations

    Zongyi Li, Nikola Borislavov Kovachki, Kamyar Azizzadenesheli, Burigede liu, Kaushik Bhattacharya, Andrew Stuart, and Anima Anandkumar. Fourier neural operator for parametric partial differential equations. InInternational Conference on Learning Representations, 2021

  52. [53]

    Pde-refiner: Achieving accurate long rollouts with neural pde solvers.Advances in Neural Information Processing Systems, 36:67398–67433, 2023

    Phillip Lippe, Bas Veeling, Paris Perdikaris, Richard Turner, and Johannes Brandstetter. Pde-refiner: Achieving accurate long rollouts with neural pde solvers.Advances in Neural Information Processing Systems, 36:67398–67433, 2023. 17

  53. [54]

    iTransformer: Inverted Transformers Are Effective for Time Series Forecasting

    Yong Liu, Tengge Hu, Haoran Zhang, Haixu Wu, Shiyu Wang, Lintao Ma, and Mingsheng Long. itransformer: Inverted transformers are effective for time series forecasting, 2024. URL https://arxiv. org/abs/2310.06625

  54. [55]

    Learning nonlinear operators via deeponet based on the universal approximation theorem of operators.Nature machine intelligence, 3(3):218–229, 2021

    Lu Lu, Pengzhan Jin, Guofei Pang, Zhongqiang Zhang, and George Em Karniadakis. Learning nonlinear operators via deeponet based on the universal approximation theorem of operators.Nature machine intelligence, 3(3):218–229, 2021

  55. [56]

    Wavecastnet: An AI- enabled wavefield forecasting framework for earthquake early warning,

    Dongwei Lyu, Rie Nakata, Pu Ren, Michael W Mahoney, Arben Pitarka, Nori Nakata, and N Benjamin Erichson. Wavecastnet: An ai-enabled wavefield forecasting framework for earthquake early warning. arXiv preprint arXiv:2405.20516, 2024

  56. [57]

    Springer Science & Business Media, 2012

    Andrew Majda.Compressible fluid flow and systems of conservation laws in several space variables, volume 53. Springer Science & Business Media, 2012

  57. [58]

    Weak baselines and reporting biases lead to overoptimism in machine learning for fluid-related partial differential equations.arXiv preprint arXiv:2407.07218, 2024

    Nick McGreivy and Ammar Hakim. Weak baselines and reporting biases lead to overoptimism in machine learning for fluid-related partial differential equations.arXiv preprint arXiv:2407.07218, 2024

  58. [59]

    Discrete velocity model and implicit scheme for the bgk equation of rarefied gas dynamics

    Luc Mieussens. Discrete velocity model and implicit scheme for the bgk equation of rarefied gas dynamics. Mathematical Models and Methods in Applied Sciences, 10(08):1121–1149, 2000

  59. [60]

    A survey of deterministic solvers for rarefied flows

    Luc Mieussens. A survey of deterministic solvers for rarefied flows. InAIP Conference Proceedings, volume 1628, pages 943–951. American Institute of Physics, 2014

  60. [61]

    Neural-network based collision operators for the boltzmann equation.Journal of Computational Physics, 470:111541, 2022

    Sean T Miller, Nathan V Roberts, Stephen D Bond, and Eric C Cyr. Neural-network based collision operators for the boltzmann equation.Journal of Computational Physics, 470:111541, 2022

  61. [62]

    A time series is worth 64 words: Long-term forecasting with transformers

    Yuqi Nie, Nam H Nguyen, Phanwadee Sinthong, and Jayant Kalagnanam. A time series is worth 64 words: Long-term forecasting with transformers. InThe Eleventh International Conference on Learning Representations, 2023

  62. [63]

    Higher order approximation methods for the boltzmann equation.Journal of Computa- tional Physics, 139(1):1–14, 1998

    Taku Ohwada. Higher order approximation methods for the boltzmann equation.Journal of Computa- tional Physics, 139(1):1–14, 1998

  63. [64]

    Thermodynamically consistent physics-informed neural networks for hyperbolic systems

    Ravi G Patel, Indu Manickam, Nathaniel A Trask, Mitchell A Wood, Myoungkyu Lee, Ignacio Tomas, and Eric C Cyr. Thermodynamically consistent physics-informed neural networks for hyperbolic systems. Journal of Computational Physics, 449:110754, 2022

  64. [65]

    General entropic approximations for canonical systems described by kinetic equations

    Vincent Pavan. General entropic approximations for canonical systems described by kinetic equations. Journal of Statistical Physics, 142:792–827, 2011

  65. [66]

    Global existence to the bgk model of boltzmann equation.Journal of Differential equations, 82(1):191–205, 1989

    Benoˆıt Perthame. Global existence to the bgk model of boltzmann equation.Journal of Differential equations, 82(1):191–205, 1989

  66. [67]

    Lattice boltzmann method for thermal flow simulation on standard lattices.Physical Review E—Statistical, Nonlinear, and Soft Matter Physics, 76(1):016702, 2007

    Nikolaos I Prasianakis and Iliya V Karlin. Lattice boltzmann method for thermal flow simulation on standard lattices.Physical Review E—Statistical, Nonlinear, and Soft Matter Physics, 76(1):016702, 2007

  67. [68]

    Benjamin Erichson, and Michael W

    Pu Ren, Rie Nakata, Maxime Lacour, Ilan Naiman, Nori Nakata, Jialin Song, Zhengfa Bi, Osman Asif Malik, Dmitriy Morozov, Omri Azencot, N. Benjamin Erichson, and Michael W. Mahoney. Learning physics for unveiling hidden earthquake ground motions via conditional generative modeling, 2024. URL https://arxiv.org/abs/2407.15089

  68. [69]

    Lattice boltzmann model for compressible flows on standard lattices: Variable prandtl number and adiabatic exponent.Physical Review E, 99(1): 013306, 2019

    Mohammad Hossein Saadat, Fabian B¨ osch, and Ilya V Karlin. Lattice boltzmann model for compressible flows on standard lattices: Variable prandtl number and adiabatic exponent.Physical Review E, 99(1): 013306, 2019

  69. [70]

    Sakarvadia, K

    M. Sakarvadia, K. Hegazy, A. Totounferoush, K. Chard, Y. Yang, I. Foster, and M. W. Mahoney. The false promise of zero-shot super-resolution in machine-learned operators. Technical Report Preprint: arXiv:2510.06646, 2025. 18

  70. [71]

    A unified operator splitting approach for multi-scale fluid–particle coupling in the lattice boltzmann method.Computer Physics Communications, 185(10):2586–2597, 2014

    Ulf D Schiller. A unified operator splitting approach for multi-scale fluid–particle coupling in the lattice boltzmann method.Computer Physics Communications, 185(10):2586–2597, 2014

  71. [72]

    Central-moment-based galilean-invariant multiple-relaxation-time collision model.Physical Review E, 100(4):043308, 2019

    Xiaowen Shan. Central-moment-based galilean-invariant multiple-relaxation-time collision model.Physical Review E, 100(4):043308, 2019

  72. [73]

    Discretization of the velocity space in the solution of the boltzmann equation.Physical Review Letters, 80(1):65, 1998

    Xiaowen Shan and Xiaoyi He. Discretization of the velocity space in the solution of the boltzmann equation.Physical Review Letters, 80(1):65, 1998

  73. [74]

    A survey of several finite difference methods for systems of nonlinear hyperbolic conservation laws.Journal of computational physics, 27(1):1–31, 1978

    Gary A Sod. A survey of several finite difference methods for systems of nonlinear hyperbolic conservation laws.Journal of computational physics, 27(1):1–31, 1978

  74. [75]

    On the construction and comparison of difference schemes.SIAM journal on numerical analysis, 5(3):506–517, 1968

    Gilbert Strang. On the construction and comparison of difference schemes.SIAM journal on numerical analysis, 5(3):506–517, 1968

  75. [76]

    Towards foundation models for scientific machine learning: Characterizing scaling and transfer behavior

    Shashank Subramanian, Peter Harrington, Kurt Keutzer, Wahid Bhimji, Dmitriy Morozov, Michael W Mahoney, and Amir Gholami. Towards foundation models for scientific machine learning: Characterizing scaling and transfer behavior. InAdvances in Neural Information Processing Systems, volume 36, pages 71242–71262, 2023

  76. [77]

    Oxford university press, 2018

    Sauro Succi and S Succi.The lattice Boltzmann equation: for complex states of flowing matter. Oxford university press, 2018

  77. [78]

    Exponential distribution functions for positivity-preserving lattice boltzmann schemes: Application to 2d compressible flow simulations.Physics of Fluids, 35(12), 2023

    Karthik Thyagarajan, Christophe Coreixas, and Jonas Latt. Exponential distribution functions for positivity-preserving lattice boltzmann schemes: Application to 2d compressible flow simulations.Physics of Fluids, 35(12), 2023

  78. [79]

    Springer Science & Business Media, 2013

    Eleuterio F Toro.Riemann solvers and numerical methods for fluid dynamics: a practical introduction. Springer Science & Business Media, 2013

  79. [80]

    Lattice boltzmann method for high reynolds number compressible flow.Computers & Fluids, 249:105701, 2022

    Si Bui Quang Tran, Fong Yew Leong, Quang Tuyen Le, and Duc Vinh Le. Lattice boltzmann method for high reynolds number compressible flow.Computers & Fluids, 249:105701, 2022

  80. [81]

    A review of mathematical topics in collisional kinetic theory.Handbook of mathematical fluid dynamics, 1:71–74, 2002

    C´ edric Villani. A review of mathematical topics in collisional kinetic theory.Handbook of mathematical fluid dynamics, 1:71–74, 2002

Showing first 80 references.