Generative diffusion learning for parametric partial differential equations

Jaroslaw Knap; Petr Plechac; Ting Wang

arxiv: 2305.14703 · v2 · submitted 2023-05-24 · 🧮 math.NA · cs.NA

Generative diffusion learning for parametric partial differential equations

Ting Wang , Petr Plechac , Jaroslaw Knap This is my paper

Pith reviewed 2026-05-24 09:03 UTC · model grok-4.3

classification 🧮 math.NA cs.NA

keywords generative diffusion modelsparametric PDEsoperator learninguncertainty quantificationnoisy datasolution operatorDDPM

0 comments

The pith

A conditional diffusion model learns the solution operator for parameter-dependent PDEs by approximating input-to-output mappings as conditional distributions.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper develops a data-driven generative approach that uses denoising diffusion probabilistic models to approximate how parameters determine solutions of PDEs. By casting the operator learning task in probabilistic terms, the method represents the mapping as a conditional distribution rather than a deterministic function. This setup produces confidence intervals for predicted solutions as a direct byproduct and remains usable even when the training outputs contain additive noise. Tests show accuracy on par with Fourier neural operators while also recovering the noise magnitude present in the data.

Core claim

Adapting DDPMs to a supervised conditional setting allows the solution operator for parameter-dependent PDEs to be expressed as a family of conditional distributions that map problem parameters to solution fields, thereby enabling automatic uncertainty quantification and direct applicability to noisy training data.

What carries the argument

Conditional denoising diffusion probabilistic models adapted to supervised learning of the PDE solution operator.

If this is right

The learned solutions achieve accuracy comparable to Fourier neural operators on the tested problems.
Confidence intervals for the solutions are obtained automatically from the probabilistic formulation.
The framework can be trained directly on datasets whose outputs are corrupted by additive noise and recovers the noise magnitude.
Multiple solution realizations can be sampled from the learned conditional distribution for any given parameter set.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same conditioning strategy might transfer to operator learning tasks outside classical PDEs, such as integral or integro-differential equations.
Because the model handles noisy observations, it could be applied to inverse problems where data come from imperfect measurements.
Scalability questions remain open for high-dimensional parameter spaces or problems with complex geometry.

Load-bearing premise

Changing DDPMs into a supervised conditional form will reproduce the true parameter-to-solution mapping of the PDE without adding systematic bias to the learned distributions.

What would settle it

On a parametric PDE with known exact solutions, the statistical distribution of samples drawn from the trained model fails to match the distribution of true solutions for held-out parameters, or the recovered noise level deviates from the added noise in the training set.

Figures

Figures reproduced from arXiv: 2305.14703 by Jaroslaw Knap, Petr Plechac, Ting Wang.

**Figure 2.** Figure 2: 2D joint histograms of the solution field evaluated at [PITH_FULL_IMAGE:figures/full_fig_p007_2.png] view at source ↗

**Figure 3.** Figure 3: Decay of the MRLE with respect to the number of training epochs. [PITH_FULL_IMAGE:figures/full_fig_p009_3.png] view at source ↗

read the original abstract

We develop a class of data-driven generative models that approximate the solution operator for parameter-dependent partial differential equations (PDE). We propose a novel probabilistic formulation of the operator learning problem based on recently developed generative denoising diffusion probabilistic models (DDPM) in order to learn the input-to-output mapping between problem parameters and solutions of the PDE. To achieve this goal we modify DDPM to supervised learning in which the solution operator for the PDE is represented by a class of conditional distributions. The probabilistic formulation combined with DDPM allows for an automatic quantification of confidence intervals for the learned solutions. Furthermore, the framework is directly applicable for learning from a noisy data set. We compare computational performance of the developed method with the Fourier Network Operators (FNO). Our results show that our method achieves comparable accuracy and recovers the noise magnitude when applied to data sets with outputs corrupted by additive noise.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

Adapting DDPM to conditional operator learning for parametric PDEs adds uncertainty estimates and noise recovery while matching FNO accuracy on the reported tests.

read the letter

The main thing to know is that this paper adapts denoising diffusion probabilistic models to a supervised conditional setting so they can approximate the solution operator for parameter-dependent PDEs. This gives a generative approach that supplies uncertainty estimates and works directly on noisy data, while matching the accuracy of Fourier neural operators in their tests. They do a solid job laying out the probabilistic formulation and showing how the conditioning on parameters is incorporated into the diffusion process. The noise recovery result is a nice concrete demonstration that the model is capturing the data distribution rather than just overfitting. The soft spots are in the validation. The results are presented without error bars or details on the experimental setup such as dataset sizes, PDE types, or discretization methods, so the claim of comparable accuracy is difficult to scrutinize. It is also not clear how well the uncertainty intervals are calibrated. Overall this is a straightforward application of an existing generative modeling technique to operator learning. It is the sort of work that belongs in a reading group for people following scientific machine learning, and it deserves peer review because the core idea is well-motivated and the implementation appears faithful to the DDPM framework.

Referee Report

2 major / 0 minor

Summary. The manuscript proposes a supervised conditional reformulation of denoising diffusion probabilistic models (DDPM) to approximate the solution operator mapping parameters to solutions for parametric PDEs. It claims this probabilistic approach enables automatic quantification of confidence intervals, handles additive noise in the data, recovers the noise magnitude, and achieves accuracy comparable to Fourier Neural Operators (FNO).

Significance. If the central claims hold with rigorous validation, the work would introduce a generative probabilistic framework for operator learning that naturally incorporates uncertainty quantification, a feature absent from many deterministic operator learners such as FNO. The direct applicability to noisy datasets is a practical strength. However, the abstract provides no quantitative metrics, implementation details, or dataset descriptions, so the significance cannot yet be fully assessed.

major comments (2)

[Abstract] Abstract: the claim that the method 'achieves comparable accuracy' to FNO is presented without any numerical error values, tables, figures, error bars, or dataset descriptions, rendering the performance comparison impossible to evaluate.
[Abstract] Abstract: the modification of DDPM to a supervised conditional distribution setting is described only at a high level; no explicit form of the conditioning mechanism, loss function, or network architecture is supplied, which is load-bearing for assessing whether the learned conditional faithfully represents the PDE solution operator without systematic bias.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their careful review and constructive comments on our manuscript. The two major comments both concern the abstract; we address each below and will revise the abstract to incorporate additional quantitative detail and a concise description of the conditioning mechanism.

read point-by-point responses

Referee: Abstract: the claim that the method 'achieves comparable accuracy' to FNO is presented without any numerical error values, tables, figures, error bars, or dataset descriptions, rendering the performance comparison impossible to evaluate.

Authors: We agree that the abstract would be strengthened by including specific quantitative metrics. The body of the manuscript reports relative L2 errors on the parametric PDE test cases (e.g., Darcy flow and Navier-Stokes) together with the corresponding FNO baselines and dataset sizes. In the revised version we will add a sentence to the abstract that reports the key error values and briefly identifies the datasets, thereby making the performance claim directly verifiable from the abstract itself. revision: yes
Referee: Abstract: the modification of DDPM to a supervised conditional distribution setting is described only at a high level; no explicit form of the conditioning mechanism, loss function, or network architecture is supplied, which is load-bearing for assessing whether the learned conditional faithfully represents the PDE solution operator without systematic bias.

Authors: The abstract is intentionally concise, yet we acknowledge that a short indication of the conditioning approach would improve transparency. Section 3 of the manuscript defines the conditional DDPM formulation, the loss (a supervised variant of the standard DDPM objective with parameters concatenated to the noisy solution), and the U-Net architecture with parameter embedding. We will revise the abstract to include one additional sentence that states the conditioning is performed by concatenating the PDE parameters to the input of the denoising network and that the training loss is the conditional variant of the variational bound, thereby directing readers to the explicit definitions while keeping the abstract brief. revision: yes

Circularity Check

0 steps flagged

No significant circularity

full rationale

The paper adapts the external DDPM framework to a supervised conditional setting for learning PDE solution operators. No equations, fitting procedures, or self-citations are shown that reduce the claimed operator approximation, noise recovery, or uncertainty quantification to a fitted parameter or input by construction. The central claims rest on the probabilistic reformulation and empirical comparisons to FNO, which are independent of the method's internal definitions.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The central claim rests on the assumption that DDPM can be repurposed for supervised conditional operator learning; no free parameters, new entities, or additional axioms are stated in the abstract.

axioms (1)

domain assumption DDPM can be modified to supervised conditional distributions that represent PDE solution operators
Stated as the core proposal in the abstract.

pith-pipeline@v0.9.0 · 5674 in / 1200 out tokens · 30170 ms · 2026-05-24T09:03:37.397562+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

We propose a novel probabilistic formulation of the operator learning problem based on recently developed generative denoising diffusion probabilistic models (DDPM) in order to learn the input-to-output mapping between problem parameters and solutions of the PDE. To achieve this goal we modify DDPM to supervised learning in which the solution operator for the PDE is represented by a class of conditional distributions.
IndisputableMonolith/Foundation/AbsoluteFloorClosure.lean reality_from_one_distinction unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

The probabilistic formulation combined with DDPM allows for an automatic quantification of confidence intervals for the learned solutions.

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

34 extracted references · 34 canonical work pages · 5 internal anchors

[1]

Batlle, M

P. Batlle, M. Darcy, B. Hosseini, and H. Owhadi. Kernel methods are competitive for operator learning. arXiv preprint arXiv:2304.13202, 2023

work page arXiv 2023
[2]

Bhattacharya, B

K. Bhattacharya, B. Hosseini, N. B. Kovachki, and A. M. Stuart. Model reduction and neural networks for parametric pdes. The SMAI journal of computational mathematics, 7:121–157, 2021

work page 2021
[3]

Chen and H

T. Chen and H. Chen. Universal approximation to nonlinear operators by neural networks with arbitrary activation functions and its application to dynamical systems. IEEE Transactions on Neural Networks, 6(4):911–917, 1995

work page 1995
[4]

De Hoop, D

M. De Hoop, D. Z. Huang, E. Qian, and A. M. Stuart. The cost-accuracy trade-off in operator learning with neural networks. arXiv preprint arXiv:2203.13181, 2022

work page arXiv 2022
[5]

Dhariwal and A

P. Dhariwal and A. Nichol. Diffusion models beat gans on image synthesis.Advances in Neural Information Processing Systems, 34:8780–8794, 2021

work page 2021
[6]

Gupta, X

G. Gupta, X. Xiao, and P. Bogdan. Multiwavelet-based operator learning for differential equations. Advances in neural information processing systems, 34:24048–24062, 2021

work page 2021
[7]

J. S. Hesthaven and S. Ubbiali. Non-intrusive reduced order modeling of nonlinear problems using neural networks. Journal of Computational Physics, 363:55–78, 2018

work page 2018
[8]

J. Ho, A. Jain, and P. Abbeel. Denoising diffusion probabilistic models. Advances in Neural Information Processing Systems, 33:6840–6851, 2020

work page 2020
[9]

J. Ho, C. Saharia, W. Chan, D. J. Fleet, M. Norouzi, and T. Salimans. Cascaded diffusion models for high fidelity image generation. J. Mach. Learn. Res., 23(47):1–33, 2022. 9

work page 2022
[10]

Khrulkov, G

V . Khrulkov, G. Ryzhakov, A. Chertkov, and I. Oseledets. Understanding ddpm latent codes through optimal transport. arXiv preprint arXiv:2202.07477, 2022

work page arXiv 2022
[11]

Kovachki, S

N. Kovachki, S. Lanthaler, and S. Mishra. On universal approximation and error bounds for Fourier neural operators. The Journal of Machine Learning Research, 22(1):13237–13312, 2021

work page 2021
[12]

Kovachki, Z

N. Kovachki, Z. Li, B. Liu, K. Azizzadenesheli, K. Bhattacharya, A. Stuart, and A. Anandkumar. Neural operator: Learning maps between function spaces. arXiv preprint arXiv:2108.08481, 2021

work page arXiv 2021
[13]

Z. Li, N. Kovachki, K. Azizzadenesheli, B. Liu, K. Bhattacharya, A. Stuart, and A. Anandkumar. Fourier neural operator for parametric partial differential equations. arXiv preprint arXiv:2010.08895, 2020

work page internal anchor Pith review Pith/arXiv arXiv 2010
[14]

Z. Li, N. Kovachki, K. Azizzadenesheli, B. Liu, K. Bhattacharya, A. Stuart, and A. Anandkumar. Neural operator: Graph kernel network for partial differential equations. arXiv preprint arXiv:2003.03485, 2020

work page internal anchor Pith review Pith/arXiv arXiv 2003
[15]

Z. Li, H. Zheng, N. Kovachki, D. Jin, H. Chen, B. Liu, K. Azizzadenesheli, and A. Anandkumar. Physics- informed neural operator for learning partial differential equations. arXiv preprint arXiv:2111.03794, 2021

work page arXiv 2021
[16]

L. Lu, P. Jin, and G. E. Karniadakis. Deeponet: Learning nonlinear operators for identifying differential equations based on the universal approximation theorem of operators. arXiv preprint arXiv:1910.03193, 2019

work page internal anchor Pith review Pith/arXiv arXiv 1910
[17]

L. Lu, X. Meng, S. Cai, Z. Mao, S. Goswami, Z. Zhang, and G. E. Karniadakis. A comprehensive and fair comparison of two neural operators (with practical extensions) based on fair data. Computer Methods in Applied Mechanics and Engineering, 393:114778, 2022

work page 2022
[18]

R. G. Patel, N. A. Trask, M. A. Wood, and E. C. Cyr. A physics-informed operator regression framework for extracting data-driven continuum models. Computer Methods in Applied Mechanics and Engineering, 373:113500, 2021

work page 2021
[19]

Phillips, T

A. Phillips, T. Seror, M. Hutchinson, V . De Bortoli, A. Doucet, and E. Mathieu. Spectral diffusion processes. arXiv preprint arXiv:2209.14125, 2022

work page arXiv 2022
[20]

Rombach, A

R. Rombach, A. Blattmann, D. Lorenz, P. Esser, and B. Ommer. High-resolution image synthesis with latent diffusion models. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 10684–10695, 2022

work page 2022
[21]

Ronneberger, P

O. Ronneberger, P. Fischer, and T. Brox. U-net: Convolutional networks for biomedical image segmentation. In Medical Image Computing and Computer-Assisted Intervention–MICCAI 2015: 18th International Conference, Munich, Germany, October 5-9, 2015, Proceedings, Part III 18, pages 234–241. Springer, 2015

work page 2015
[22]

Sohl-Dickstein, E

J. Sohl-Dickstein, E. Weiss, N. Maheswaranathan, and S. Ganguli. Deep unsupervised learning using nonequilibrium thermodynamics. In International Conference on Machine Learning, pages 2256–2265. PMLR, 2015

work page 2015
[23]

J. Song, C. Meng, and S. Ermon. Denoising diffusion implicit models. arXiv preprint arXiv:2010.02502, 2020

work page internal anchor Pith review Pith/arXiv arXiv 2010
[24]

Song and S

Y . Song and S. Ermon. Generative modeling by estimating gradients of the data distribution.Advances in Neural Information Processing Systems, 32, 2019

work page 2019
[25]

Y . Song, J. Sohl-Dickstein, D. P. Kingma, A. Kumar, S. Ermon, and B. Poole. Score-based generative modeling through stochastic differential equations. arXiv preprint arXiv:2011.13456, 2020

work page internal anchor Pith review Pith/arXiv arXiv 2011
[26]

L. Sun, H. Gao, S. Pan, and J.-X. Wang. Surrogate modeling for fluid flows based on physics-constrained deep learning without simulation data. Computer Methods in Applied Mechanics and Engineering , 361:112732, 2020

work page 2020
[27]

R. Wang, K. Kashinath, M. Mustafa, A. Albert, and R. Yu. Towards physics-informed deep learning for turbulent flow prediction. In Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pages 1457–1466, 2020

work page 2020
[28]

S. Wang, Y . Teng, and P. Perdikaris. Understanding and mitigating gradient flow pathologies in physics- informed neural networks. SIAM Journal on Scientific Computing, 43(5):A3055–A3081, 2021

work page 2021
[29]

S. Wang, H. Wang, and P. Perdikaris. Learning the solution operator of parametric partial differential equations with physics-informed deeponets. Science advances, 7(40):eabi8605, 2021

work page 2021
[30]

S. Wang, X. Yu, and P. Perdikaris. When and why pinns fail to train: A neural tangent kernel perspective. Journal of Computational Physics, 449:110768, 2022

work page 2022
[31]

Wang and J

T. Wang and J. Knap. Stochastic deep-ritz for parametric uncertainty quantification. arXiv preprint arXiv:2206.00867, 2022

work page arXiv 2022
[32]

C. K. Williams and C. E. Rasmussen. Gaussian processes for machine learning, volume 2. MIT press Cambridge, MA, 2006. 10

work page 2006
[33]

Zhu and N

Y . Zhu and N. Zabaras. Bayesian deep convolutional encoder–decoder networks for surrogate modeling and uncertainty quantification. Journal of Computational Physics, 366:415–447, 2018

work page 2018
[34]

Y . Zhu, N. Zabaras, P.-S. Koutsourelakis, and P. Perdikaris. Physics-constrained deep learning for high-dimensional surrogate modeling and uncertainty quantification without labeled data. Journal of Computational Physics, 394:56–81, 2019. A Key notations For quick reference, important notations are summarized in the following table. Table A1: Table of no...

work page 2019

[1] [1]

Batlle, M

P. Batlle, M. Darcy, B. Hosseini, and H. Owhadi. Kernel methods are competitive for operator learning. arXiv preprint arXiv:2304.13202, 2023

work page arXiv 2023

[2] [2]

Bhattacharya, B

K. Bhattacharya, B. Hosseini, N. B. Kovachki, and A. M. Stuart. Model reduction and neural networks for parametric pdes. The SMAI journal of computational mathematics, 7:121–157, 2021

work page 2021

[3] [3]

Chen and H

T. Chen and H. Chen. Universal approximation to nonlinear operators by neural networks with arbitrary activation functions and its application to dynamical systems. IEEE Transactions on Neural Networks, 6(4):911–917, 1995

work page 1995

[4] [4]

De Hoop, D

M. De Hoop, D. Z. Huang, E. Qian, and A. M. Stuart. The cost-accuracy trade-off in operator learning with neural networks. arXiv preprint arXiv:2203.13181, 2022

work page arXiv 2022

[5] [5]

Dhariwal and A

P. Dhariwal and A. Nichol. Diffusion models beat gans on image synthesis.Advances in Neural Information Processing Systems, 34:8780–8794, 2021

work page 2021

[6] [6]

Gupta, X

G. Gupta, X. Xiao, and P. Bogdan. Multiwavelet-based operator learning for differential equations. Advances in neural information processing systems, 34:24048–24062, 2021

work page 2021

[7] [7]

J. S. Hesthaven and S. Ubbiali. Non-intrusive reduced order modeling of nonlinear problems using neural networks. Journal of Computational Physics, 363:55–78, 2018

work page 2018

[8] [8]

J. Ho, A. Jain, and P. Abbeel. Denoising diffusion probabilistic models. Advances in Neural Information Processing Systems, 33:6840–6851, 2020

work page 2020

[9] [9]

J. Ho, C. Saharia, W. Chan, D. J. Fleet, M. Norouzi, and T. Salimans. Cascaded diffusion models for high fidelity image generation. J. Mach. Learn. Res., 23(47):1–33, 2022. 9

work page 2022

[10] [10]

Khrulkov, G

V . Khrulkov, G. Ryzhakov, A. Chertkov, and I. Oseledets. Understanding ddpm latent codes through optimal transport. arXiv preprint arXiv:2202.07477, 2022

work page arXiv 2022

[11] [11]

Kovachki, S

N. Kovachki, S. Lanthaler, and S. Mishra. On universal approximation and error bounds for Fourier neural operators. The Journal of Machine Learning Research, 22(1):13237–13312, 2021

work page 2021

[12] [12]

Kovachki, Z

N. Kovachki, Z. Li, B. Liu, K. Azizzadenesheli, K. Bhattacharya, A. Stuart, and A. Anandkumar. Neural operator: Learning maps between function spaces. arXiv preprint arXiv:2108.08481, 2021

work page arXiv 2021

[13] [13]

Z. Li, N. Kovachki, K. Azizzadenesheli, B. Liu, K. Bhattacharya, A. Stuart, and A. Anandkumar. Fourier neural operator for parametric partial differential equations. arXiv preprint arXiv:2010.08895, 2020

work page internal anchor Pith review Pith/arXiv arXiv 2010

[14] [14]

Z. Li, N. Kovachki, K. Azizzadenesheli, B. Liu, K. Bhattacharya, A. Stuart, and A. Anandkumar. Neural operator: Graph kernel network for partial differential equations. arXiv preprint arXiv:2003.03485, 2020

work page internal anchor Pith review Pith/arXiv arXiv 2003

[15] [15]

Z. Li, H. Zheng, N. Kovachki, D. Jin, H. Chen, B. Liu, K. Azizzadenesheli, and A. Anandkumar. Physics- informed neural operator for learning partial differential equations. arXiv preprint arXiv:2111.03794, 2021

work page arXiv 2021

[16] [16]

L. Lu, P. Jin, and G. E. Karniadakis. Deeponet: Learning nonlinear operators for identifying differential equations based on the universal approximation theorem of operators. arXiv preprint arXiv:1910.03193, 2019

work page internal anchor Pith review Pith/arXiv arXiv 1910

[17] [17]

L. Lu, X. Meng, S. Cai, Z. Mao, S. Goswami, Z. Zhang, and G. E. Karniadakis. A comprehensive and fair comparison of two neural operators (with practical extensions) based on fair data. Computer Methods in Applied Mechanics and Engineering, 393:114778, 2022

work page 2022

[18] [18]

R. G. Patel, N. A. Trask, M. A. Wood, and E. C. Cyr. A physics-informed operator regression framework for extracting data-driven continuum models. Computer Methods in Applied Mechanics and Engineering, 373:113500, 2021

work page 2021

[19] [19]

Phillips, T

A. Phillips, T. Seror, M. Hutchinson, V . De Bortoli, A. Doucet, and E. Mathieu. Spectral diffusion processes. arXiv preprint arXiv:2209.14125, 2022

work page arXiv 2022

[20] [20]

Rombach, A

R. Rombach, A. Blattmann, D. Lorenz, P. Esser, and B. Ommer. High-resolution image synthesis with latent diffusion models. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 10684–10695, 2022

work page 2022

[21] [21]

Ronneberger, P

O. Ronneberger, P. Fischer, and T. Brox. U-net: Convolutional networks for biomedical image segmentation. In Medical Image Computing and Computer-Assisted Intervention–MICCAI 2015: 18th International Conference, Munich, Germany, October 5-9, 2015, Proceedings, Part III 18, pages 234–241. Springer, 2015

work page 2015

[22] [22]

Sohl-Dickstein, E

J. Sohl-Dickstein, E. Weiss, N. Maheswaranathan, and S. Ganguli. Deep unsupervised learning using nonequilibrium thermodynamics. In International Conference on Machine Learning, pages 2256–2265. PMLR, 2015

work page 2015

[23] [23]

J. Song, C. Meng, and S. Ermon. Denoising diffusion implicit models. arXiv preprint arXiv:2010.02502, 2020

work page internal anchor Pith review Pith/arXiv arXiv 2010

[24] [24]

Song and S

Y . Song and S. Ermon. Generative modeling by estimating gradients of the data distribution.Advances in Neural Information Processing Systems, 32, 2019

work page 2019

[25] [25]

Y . Song, J. Sohl-Dickstein, D. P. Kingma, A. Kumar, S. Ermon, and B. Poole. Score-based generative modeling through stochastic differential equations. arXiv preprint arXiv:2011.13456, 2020

work page internal anchor Pith review Pith/arXiv arXiv 2011

[26] [26]

L. Sun, H. Gao, S. Pan, and J.-X. Wang. Surrogate modeling for fluid flows based on physics-constrained deep learning without simulation data. Computer Methods in Applied Mechanics and Engineering , 361:112732, 2020

work page 2020

[27] [27]

R. Wang, K. Kashinath, M. Mustafa, A. Albert, and R. Yu. Towards physics-informed deep learning for turbulent flow prediction. In Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pages 1457–1466, 2020

work page 2020

[28] [28]

S. Wang, Y . Teng, and P. Perdikaris. Understanding and mitigating gradient flow pathologies in physics- informed neural networks. SIAM Journal on Scientific Computing, 43(5):A3055–A3081, 2021

work page 2021

[29] [29]

S. Wang, H. Wang, and P. Perdikaris. Learning the solution operator of parametric partial differential equations with physics-informed deeponets. Science advances, 7(40):eabi8605, 2021

work page 2021

[30] [30]

S. Wang, X. Yu, and P. Perdikaris. When and why pinns fail to train: A neural tangent kernel perspective. Journal of Computational Physics, 449:110768, 2022

work page 2022

[31] [31]

Wang and J

T. Wang and J. Knap. Stochastic deep-ritz for parametric uncertainty quantification. arXiv preprint arXiv:2206.00867, 2022

work page arXiv 2022

[32] [32]

C. K. Williams and C. E. Rasmussen. Gaussian processes for machine learning, volume 2. MIT press Cambridge, MA, 2006. 10

work page 2006

[33] [33]

Zhu and N

Y . Zhu and N. Zabaras. Bayesian deep convolutional encoder–decoder networks for surrogate modeling and uncertainty quantification. Journal of Computational Physics, 366:415–447, 2018

work page 2018

[34] [34]

Y . Zhu, N. Zabaras, P.-S. Koutsourelakis, and P. Perdikaris. Physics-constrained deep learning for high-dimensional surrogate modeling and uncertainty quantification without labeled data. Journal of Computational Physics, 394:56–81, 2019. A Key notations For quick reference, important notations are summarized in the following table. Table A1: Table of no...

work page 2019