pith. sign in

arxiv: 2605.17107 · v1 · pith:OGOGYP2Znew · submitted 2026-05-16 · 📊 stat.ML · cs.LG· math.OC· math.PR

Diffusion-Based Stochastic Operator Networks for Uncertainty Quantification in Stochastic Partial Differential Equations

Pith reviewed 2026-05-20 14:57 UTC · model grok-4.3

classification 📊 stat.ML cs.LGmath.OCmath.PR
keywords stochastic partial differential equationsoperator learninguncertainty quantificationdeep operator networksstochastic neural networks
0
0 comments X

The pith

Stochastic Operator Networks learn solution operators for SPDEs and quantify uncertainty directly from noisy data.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces the Stochastic Operator Network to learn solution operators for stochastic partial differential equations without first specifying the noise structure. It builds this by merging the Deep Operator Network architecture with stochastic neural networks, then trains the model by minimizing a Hamiltonian-type loss through the Stochastic Maximum Principle. The result is a method that produces both a predicted mean field and a measure of predictive uncertainty. A reader would care if this holds because many real-world models involve unknown or hard-to-measure uncertainties, and learning them from data could reduce reliance on explicit assumptions about noise.

Core claim

The Stochastic Operator Network is formed by combining the Deep Operator Network structure with Stochastic Neural Networks to model stochasticity and enable probabilistic predictions of SPDE solution operators. Training proceeds by minimizing a Hamiltonian-type loss and optimizing the objective with the Stochastic Maximum Principle. Numerical experiments on benchmark SPDEs with multiple uncertainty sources show the approach captures solution structure and quantifies predictive uncertainty accurately and robustly.

What carries the argument

The Stochastic Operator Network (SON), which merges the DeepONet architecture with Stochastic Neural Networks and trains via minimization of a Hamiltonian-type loss using the Stochastic Maximum Principle.

If this is right

  • The method produces accurate mean solution fields on standard SPDE test problems.
  • It quantifies predictive uncertainty reliably when multiple sources of uncertainty are present.
  • It operates directly on noisy measurements without requiring a separate noise model to be supplied.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The same architecture might be tested on inverse problems where the goal is to infer uncertain parameters from observations.
  • Extensions could examine performance on SPDEs with non-Gaussian or spatially correlated noise.
  • Integration with existing physics-informed neural network techniques could further constrain the learned operators.

Load-bearing premise

That combining DeepONet with stochastic neural networks and training through a Hamiltonian loss optimized by the Stochastic Maximum Principle yields well-calibrated uncertainty estimates that generalize across different SPDE problems without an explicit noise model.

What would settle it

Apply the trained SON to a new benchmark SPDE whose true noise statistics are known, then check whether the predicted uncertainty bands contain the actual solution errors at the claimed rate, for instance whether 95 percent intervals contain the true errors in 95 percent of cases.

Figures

Figures reproduced from arXiv: 2605.17107 by Feng Bao, Phuoc-Toan Huynh, Richard Archibald.

Figure 1
Figure 1. Figure 1: [Reaction-Diffusion] (Left) MSE loss from Phase 1. (Right) MSE loss and Hamiltonian loss (14) from Phase 2. We next investigate the predictive performance of the SON model. We evaluate the performance of SON in terms of both predictive accuracy of the solution profiles and the quality of uncertainty 12 [PITH_FULL_IMAGE:figures/full_fig_p012_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: [Reaction-Diffusion] Solution prediction: (Left) Reference solution mean . (Middle) Predicted solution mean. (Right) Prediction mean error. quantification. To this end, we randomly select one input function from the testing dataset and generate 400 predicted samples by using the trained SON. For the same input, we also generate 400 reference solution samples from the corresponding SPDE solver. The left and… view at source ↗
Figure 3
Figure 3. Figure 3 [PITH_FULL_IMAGE:figures/full_fig_p014_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: [Advection-Diffusion Case 1] Solution prediction: (Left) Reference solution mean, (Middle) SON predicted solution mean, (Right) SON prediction mean error [PITH_FULL_IMAGE:figures/full_fig_p015_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: [Advection-Diffusion Case 1] Std estimation: (Left) Reference std, (Middle) SON prediction std, (Right) SON std prediction error. To further support this observation, we plot multiple cross-sections of the reference and predicted means and display their corresponding confidence bands in [PITH_FULL_IMAGE:figures/full_fig_p015_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: [Advection-Diffusion Case 1] Cross-sections at columns 1, 11, 16, 26, 33, and 39 of the reference solution and approximation. each kth entry for k = 1, . . . , r takes the form (σ(A, θn,g))k ≡ (σ(A, θg))k = X Lg l=1 bl,k µ(cl,kA), n = 0, . . . , N − 1, (23) where Lg = 4, and for all k = 1, . . . , r, {bl,k} Lg l=1 and {cl,k} Lg l=1 are sampled uniformly from the intervals [0.035, 0.1] and [0.2, 0.5], respe… view at source ↗
Figure 7
Figure 7. Figure 7: [Advection-Diffusion Case 2] Heatmaps of mean errors: (Left) DeepONet; (Right) SON. We first compare the predictive accuracy between the deterministic DeepONet and the SON. In SON, Phase I and Phase II are trained for 500 and 200 epochs, respectively, for a total of 700 training epochs, while the DeepONet baseline is trained for 500 [PITH_FULL_IMAGE:figures/full_fig_p016_7.png] view at source ↗
Figure 8
Figure 8. Figure 8: [Advection-Diffusion Case 2] Cross-sections of the predicted solutions with quantitative uncertainty with three different testing inputs: (First row) DeepONet; (Second row) SON. SON achieves higher predictive accuracy than the deterministic DeepONet, and it provides reliable uncertainty quantification [PITH_FULL_IMAGE:figures/full_fig_p017_8.png] view at source ↗
Figure 9
Figure 9. Figure 9: [Advection-Diffusion Case 2] Std estimation: (Left) Reference std, (Middle) SON prediction std, (Right) Std prediction Error. To better illustrate SON’s capability in uncertainty quantification, for a representative testing input we plot cross-sections of the reference and predicted means at several selected spatial columns in the solution domain, along with their corresponding confidence bands, in [PITH_… view at source ↗
Figure 10
Figure 10. Figure 10: [Advection-Diffusion Case 2] Cross-sections at columns 1, 11, 16, 26, 33, and 40 of the reference solution and approximation. see that the mean of the reference samples is accurately predicted. We also want to point out that the confidence bands exhibit non-uniform behavior, being narrower in some regions than in others, which makes Case 2 more challenging than Case 1. Although the predicted bands do not … view at source ↗
Figure 11
Figure 11. Figure 11: [Advection-Diffusion Case 2] Errors in the sample mean and std of the B-DeepONet predictions are significantly larger than those of SON under the same input data and training epochs, as shown in [PITH_FULL_IMAGE:figures/full_fig_p018_11.png] view at source ↗
Figure 12
Figure 12. Figure 12: [Advection-Diffusion Case 2] Cross-sections at columns 1, 11, 16, 26, 33, and 40 of the B-DeepONet prediction, using the same input as in [PITH_FULL_IMAGE:figures/full_fig_p019_12.png] view at source ↗
Figure 13
Figure 13. Figure 13: [Heat Equation] DeepONet vs. SON. Cross-sections of the predicted solutions at the boundary (column 1) and time steps tm for m = 1, 10, 20, 30: (First row) DeepONet; (Second row) SON. SON not only provides accurate solution predictions and reliable uncertainty estimates, but also achieves higher predictive accuracy than the deterministic DeepONet. shown in [PITH_FULL_IMAGE:figures/full_fig_p021_13.png] view at source ↗
Figure 14
Figure 14. Figure 14: [Heat Equation] SON performance: (Left) Reference solution at the final time; (Middle) SON predicted sample mean at the final time.(Right) SON prediction errors [PITH_FULL_IMAGE:figures/full_fig_p021_14.png] view at source ↗
Figure 15
Figure 15. Figure 15: [Heat Equation] SON performance: (Left) Reference std at the final time. (Middle) SON output sample std at the final time.(Right) SON std prediction error. 21 [PITH_FULL_IMAGE:figures/full_fig_p021_15.png] view at source ↗
Figure 16
Figure 16. Figure 16: [Heat Equation] Cross-sections of the SON predicted solutionand its associated uncertainty at time steps tm for m = 1, 10, 20, 30: (Top row) Column 1; (Middle row) Column 16; (Bottom row) Column 31. 3.4. Burgers’ equation In this numerical example, we consider a more challenging problem: the 2D Burgers’ equation, ut +  u 2 2  x +  u 2 2  y = 0, (x, y, t) ∈ (−1, 1)2 × (0, T), u(x, y, 0) = u0(x, y), x ∈… view at source ↗
Figure 17
Figure 17. Figure 17: [Burger Equation Case 1] Prediction mean at final time: (Left) Reference mean; (Middle) Prediction mean; (Right) Prediction error. We then examine SON’s predictive performance more closely by plotting cross-sections of predicted sample means and their corresponding confidence bands at x = 0, x = 25 and x = 40 for time steps m = 1, 14, 27 and 40 in [PITH_FULL_IMAGE:figures/full_fig_p023_17.png] view at source ↗
Figure 18
Figure 18. Figure 18: [Burger Equation Case 1] Prediction std at final time: (Left) Reference std. (Middle) Prediction std. (Third) Std estimation errors [PITH_FULL_IMAGE:figures/full_fig_p024_18.png] view at source ↗
Figure 19
Figure 19. Figure 19: [Burger Equation Case 1] Cross-sections of the predicted solution at time steps tm for m = 1, 14, 27, 40. (First row) Column 1. (Second row) Column 25. (Third row) Column 40. tm for m = 1, 14, 27, 40. We visualize these averages as heatmaps in Figures 20 and 21. These results demonstrate the consistent accuracy and reliability of the SON model in capturing the solution’s spatial structure and in quantifyi… view at source ↗
Figure 20
Figure 20. Figure 20: [Burger Equation Case 1] Average mean error over 8 inputs at time step tm for m = 1, 14, 27, 40 [PITH_FULL_IMAGE:figures/full_fig_p025_20.png] view at source ↗
Figure 21
Figure 21. Figure 21: [Burger Equation Case 1] Average std error over 8 inputs at time step tm for m = 1, 14, 27, 40. where σm = [σm,1, . . . , σm,H ] ∈ R N is a matrix and ⊙ denotes element-wise multiplication. For all time steps m, the entries of σm are sampled from U(0.15, 0.25) [PITH_FULL_IMAGE:figures/full_fig_p025_21.png] view at source ↗
Figure 22
Figure 22. Figure 22: [Burger Equation Case 2] Prediction performance. (Left) Prediction mean at final time. (Middle) Prediction mean error. (Right) Prediction std error. We first present the performance of SON for a randomly selected representative input. For this input, we generate 400 reference and predicted solution samples and compute the corresponding sample means and stds [PITH_FULL_IMAGE:figures/full_fig_p025_22.png] view at source ↗
Figure 23
Figure 23. Figure 23: [Burger Equation Case 2] Cross-sections of the predicted solution in Figure (22) at time steps tm for m = 1, 14, 27, 40. (First row) Column 1. (Second row) Column 25. (Third row) Column 40 [PITH_FULL_IMAGE:figures/full_fig_p026_23.png] view at source ↗
Figure 24
Figure 24. Figure 24: [Burger Equation Case 2] Average mean error over 8 inputs at time step tm for m = 1, 14, 27, 40 [PITH_FULL_IMAGE:figures/full_fig_p026_24.png] view at source ↗
Figure 25
Figure 25. Figure 25: [Burger Equation Case 2] Average std error over 8 inputs at time step tm for m = 1, 14, 27, 40. 4. Conclusion In this work, we introduced the Stochastic Operator Network (SON) framework for learning solution operators associated with various SPDEs and for quantifying intrinsic uncertainty in model predictions. SON is derived from the DeepONet architecture by replacing the branch network with a Stochastic … view at source ↗
read the original abstract

We introduce a novel framework for uncertainty quantification of solution operators associated with stochastic partial differential equations (SPDEs). Although SPDEs play a central role in modeling complex physical systems under uncertainty, their practical use typically requires specifying the magnitude and structure of model uncertainties that are often unknown and difficult to infer from noisy measurements. To address this challenge, we develop a stochastic operator-learning framework that learns directly from noisy data and outputs both a mean solution field and a quantification of uncertainty. The proposed method, namely the Stochastic Operator Network (SON), is constructed by combining the structure of the Deep Operator Network (DeepONet) with Stochastic Neural Networks (SNNs) to model stochasticity and enable probabilistic prediction. The training procedure is carried out by minimizing a Hamiltonian-type loss and optimizing the resulting objective using the Stochastic Maximum Principle. Numerical experiments on benchmark SPDEs under multiple uncertainty sources demonstrate the accuracy and robustness of the proposed method in capturing solution structure and quantifying predictive uncertainty.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The paper introduces the Stochastic Operator Network (SON) for uncertainty quantification in SPDEs. It combines the DeepONet architecture with Stochastic Neural Networks to model stochasticity, and trains the model by minimizing a Hamiltonian-type loss optimized via the Stochastic Maximum Principle. This allows learning both mean solution operators and predictive uncertainty directly from noisy data without an explicit noise model. Numerical experiments on benchmark problems such as the stochastic heat equation and Burgers equation under multiple uncertainty sources are used to demonstrate accuracy in capturing solution structure and robustness in uncertainty quantification.

Significance. If the central claim holds, the framework would offer a practical advance in operator learning for SPDEs by enabling data-driven UQ when noise structure is unknown, which is common in physical modeling. The integration of DeepONet with SNNs and SMP-based optimization is a novel technical contribution to probabilistic operator networks. The work is grounded in learning from data rather than assuming a specific noise law, which aligns with real-world needs.

major comments (2)
  1. [Numerical Experiments] Numerical Experiments section: results are limited to pointwise errors and visual uncertainty bands on standard benchmarks; no proper scoring rules, coverage probabilities, calibration plots, or out-of-distribution tests (e.g., deliberately changing from additive to multiplicative noise) are reported. This leaves the claim of well-calibrated predictive uncertainty without formal verification.
  2. [Method] Method and Experiments: the implicit assumption that SNN stochasticity optimized under the SMP-derived Hamiltonian loss recovers accurate posterior predictive distributions for unknown noise structures is load-bearing for the central claim, yet no sensitivity analysis or formal checks against violated modeling assumptions are provided.
minor comments (2)
  1. [Numerical Experiments] Add explicit details on data splits, training/validation/test ratios, and any error bars or statistical significance measures for the reported errors.
  2. [Related Work] Clarify notation for the stochastic components in the SON architecture relative to prior stochastic DeepONet variants.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive and detailed feedback. The comments identify opportunities to strengthen the empirical validation of uncertainty calibration and to provide additional robustness checks. We address each major comment below and describe the revisions we will implement.

read point-by-point responses
  1. Referee: [Numerical Experiments] Numerical Experiments section: results are limited to pointwise errors and visual uncertainty bands on standard benchmarks; no proper scoring rules, coverage probabilities, calibration plots, or out-of-distribution tests (e.g., deliberately changing from additive to multiplicative noise) are reported. This leaves the claim of well-calibrated predictive uncertainty without formal verification.

    Authors: We agree that the current numerical results rely primarily on pointwise errors and visual inspection of uncertainty bands. To provide a more rigorous assessment of calibration, we will augment the Numerical Experiments section with the Continuous Ranked Probability Score (CRPS), empirical coverage probabilities at multiple nominal levels, and reliability (calibration) diagrams. We will also add an out-of-distribution experiment that deliberately changes the noise structure (additive to multiplicative) on the stochastic heat and Burgers equations. These additions will be included in the revised manuscript. revision: yes

  2. Referee: [Method] Method and Experiments: the implicit assumption that SNN stochasticity optimized under the SMP-derived Hamiltonian loss recovers accurate posterior predictive distributions for unknown noise structures is load-bearing for the central claim, yet no sensitivity analysis or formal checks against violated modeling assumptions are provided.

    Authors: The Stochastic Maximum Principle supplies a principled optimality condition for the Hamiltonian loss that enables learning without an explicit parametric noise model; this is the theoretical motivation for the framework. Nevertheless, we recognize that empirical sensitivity checks are important for practical credibility. In the revised manuscript we will add a dedicated sensitivity study that perturbs the noise structure and distribution during training and reports the resulting changes in predictive mean and uncertainty metrics. These results will be presented alongside the existing benchmark experiments. revision: yes

Circularity Check

0 steps flagged

No circularity: method is a data-driven combination of architectures with external benchmark validation

full rationale

The paper presents a Stochastic Operator Network formed by combining DeepONet with Stochastic Neural Networks, trained via a Hamiltonian-type loss derived from the Stochastic Maximum Principle. This is a constructive modeling choice applied to noisy data, followed by numerical experiments on standard SPDE benchmarks. No step reduces a claimed prediction to a fitted input by construction, invokes a self-citation as the sole justification for uniqueness, or renames an empirical pattern as a first-principles result. The derivation chain remains self-contained because the architecture and loss are explicitly assembled from known components and then evaluated against independent benchmark problems rather than being tautological with the training data.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 1 invented entities

Because only the abstract is available, the ledger is necessarily incomplete and inferred from the high-level description; the central claim rests on the unstated assumption that the Stochastic Maximum Principle optimization converges to a useful posterior over operators and that the stochastic neural network parameterization is sufficiently expressive for the target SPDEs.

axioms (1)
  • domain assumption The Hamiltonian-type loss combined with the Stochastic Maximum Principle yields a well-defined optimization problem whose solution corresponds to accurate mean and uncertainty estimates for the SPDE solution operator.
    Invoked in the description of the training procedure.
invented entities (1)
  • Stochastic Operator Network (SON) no independent evidence
    purpose: To model stochasticity in operator learning for SPDEs and enable probabilistic predictions.
    New named architecture introduced in the abstract.

pith-pipeline@v0.9.0 · 5703 in / 1347 out tokens · 43418 ms · 2026-05-20T14:57:15.349086+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

61 extracted references · 61 canonical work pages

  1. [1]

    Abgrall and P

    R. Abgrall and P. M. Congedo. A semi-intrusive determini stic approach to uncertainty quantifica- tion in non-linear fluid flow problems. Journal of Computational Physics , 235:828–845, 2013

  2. [2]

    Abgrall and S

    R. Abgrall and S. Tokareva. The stochastic finite volume m ethod. In S. Jin and L. Pareschi, editors, Uncertainty Quantification for Hyperbolic and Kinetic Equa tions, pages 1–57. Springer International Publishing, Cham, 2017

  3. [3]

    Andersson

    D. Andersson. Contributions to the Stochastic Maximum Principle . PhD thesis, KTH Royal Institute of Technology, Sweden, 2009

  4. [4]

    Archibald and F

    R. Archibald and F. Bao. Kernel learning backward sde filt er for data assimilation. J. Comput. Phys., 455(3):111009, 2022

  5. [5]

    Archibald, F

    R. Archibald, F. Bao, Y. Cao, and H. Sun. Numerical analys is for convergence of a sample-wise backpropagation method for training stochastic neural net works. SIAM Journal on Numerical Analysis, 62(2):593–621, 2024

  6. [6]

    Archibald, F

    R. Archibald, F. Bao, Y. Cao, and H. Zhang. A backward sde m ethod for uncertainty quantification in deep learning. Discrete and Continuous Dynamical Systems - S , 15(10):2807–2835, 2022

  7. [7]

    Archibald, F

    R. Archibald, F. Bao, and J. Yong. A stochastic gradient d escent approach for stochastic optimal control. East Asian Journal on Applied Mathematics , 10(4):635–658, 2020

  8. [8]

    Archibald, F

    R. Archibald, F. Bao, J. Yong, and T. Zhou. An efficient nume rical algorithm for solving data driven feedback control problems. Journal of Scientific Computing , 85(51), 2020

  9. [9]

    F. Bao, Y. Cao, A. Meir, and W. Zhao. A first order scheme for backward doubly stochastic differential equations. SIAM/ASA Journal on Uncertainty Quantification , 4(1):413–445, 2016

  10. [10]

    F. Bao, Y. Cao, and J. Yong. Data informed solution estim ation for forward-backward stochastic differential equations. Analysis and Applications , 19(3):439–464, 2021. 27

  11. [11]

    L´ evy backward sde filter for jump diffusion processes and its applications in material sciences

    Feng Bao, Richard Archibald, and Peter Maksymovych. L´ evy backward sde filter for jump diffusion processes and its applications in material sciences. Communications in Computational Physics , 27(2):589–618, Dec. 2019

  12. [12]

    Adjoint forward backward stochastic differential equations driven by jump diffusion processes and its applicat ion to nonlinear filtering problems

    Feng Bao, Yanzhao Cao, and Hongmei Chi. Adjoint forward backward stochastic differential equations driven by jump diffusion processes and its applicat ion to nonlinear filtering problems. International Journal for Uncertainty Quantification , 9(2):143–159, 2019

  13. [13]

    A hybrid sparse-grid approach for nonlinear filtering problems based on adaptive-domain o f the zakai equation approximations

    Feng Bao, Yanzhao Cao, Clayton Webster, and Guannan Zha ng. A hybrid sparse-grid approach for nonlinear filtering problems based on adaptive-domain o f the zakai equation approximations. SIAM/ASA Journal on Uncertainty Quantification , 2(1):784–804, 2014

  14. [14]

    Numerical solu tions for forward backward doubly stochastic differential equations and zakai equations

    Feng Bao, Yanzhao Cao, and Weidong Zhao. Numerical solu tions for forward backward doubly stochastic differential equations and zakai equations. Visualization of Mechanical Processes: An International Online Journal , 1(4):351–367, 2011

  15. [15]

    A first order sem i-discrete algorithm for backward dou- bly stochastic differential equations

    Feng Bao, Yanzhao Cao, and Weidong Zhao. A first order sem i-discrete algorithm for backward dou- bly stochastic differential equations. Discrete and Continuous Dynamical Systems - B , 20(5):1297– 1313, 2015

  16. [16]

    Adaptive meshfree bac kward sde filter

    Feng Bao and Vasileios Maroulas. Adaptive meshfree bac kward sde filter. SIAM Journal on Scientific Computing , 39(6):A2664–A2683, 2017

  17. [17]

    D. A. Barajas-Solano and D. M. Tartakovsky. Stochastic collocation methods for nonlinear parabolic equations with random coefficients. SIAM/ASA Journal on Uncertainty Quantification , 4(1):475–494, 2016

  18. [18]

    T. Barth. On the propagation of statistical model param eter uncertainty in cfd calculations. Theoretical and Computational Fluid Dynamics , 26(5):435–457, 2012

  19. [19]

    Bausback, J

    R. Bausback, J. Tang, Lu Lu, F. Bao, and P-.T. Huynh. Stoc hastic operator network: A stochastic maximum principle based approach to operator learning. Journal of Machine Learning , 2026

  20. [20]

    Bhattacharya, B

    K. Bhattacharya, B. Hosseini, N. B. Kovachki, and A. M. S tuart. Model reduction and neural networks for parametric pdes. The SMAI Journal of computational mathematics , 7:121–157, 2021

  21. [21]

    Bottou, F

    L. Bottou, F. E. Curtis, and J. Nocedal. Optimization me thods for large-scale machine learning. SIAM Review , 60(2):223–311, 2018

  22. [22]

    Brunner, A

    F. Brunner, A. F. Radu, and P. Knabner. Analysis of an upw ind-mixed hybrid finite element method for transport problems. SIAM Journal on Numerical Analysis , 52(1):83–102, 2014

  23. [23]

    B. Chen, C. Wang, W. Li, and H. Fu. A hybrid decoder-deepo net operator regression framework for unaligned observation data. Physics of Fluids , 36(2):027132, 2 2024

  24. [24]

    R.T. Q. Chen, Y. Rubanova, J. Bettencourt, and D. Duvena ud. Neural ordinary differential equa- tions. In Proceedings of the 32nd International Conference on Neural Inf ormation Processing Sys- tems, NIPS’18, page 6572–6583, Red Hook, NY, USA, 2018. Curran As sociates Inc

  25. [25]

    Chen and H

    T. Chen and H. Chen. Universal approximation to nonline ar operators by neural networks with arbitrary activation functions and its application to dyna mical systems. IEEE transactions on neural netwrosk, 6(4):911–917, 1995. 28

  26. [26]

    Cockburn and C.-W

    B. Cockburn and C.-W. Shu. The runge-kutta local projec tion-discontinuous-galerkin finite element method for scalar conservation laws. ESAIM: Math. Model. Numer. Anal. , 25(3):337–361, 1991

  27. [27]

    Dupont, A

    E. Dupont, A. Doucet, and Y. W. Teh. Augmented neural ode s. In Advances in Neural Information Processing Systems, volume 32, pages 3140–3150, Red Hook, NY, 2019. Curran Asso ciates

  28. [28]

    W. H. Fleming and R. W. Rishel. Deterministic and Stochastic Optimal Control , volume 1 of Applications of Mathematics . Springer, New York, 1975

  29. [29]

    W. H. Fleming and H. M. Soner. Controlled Markov Processes and Viscosity Solutions , volume 25 of Stochastic Modelling and Applied Probability . Springer, New York, 2 edition, 2006

  30. [30]

    Geraci, P

    G. Geraci, P. M. Congedo, R. Abgrall, and G. Iaccarino. I ntrusive non-linear multiresolution frame- work for uncertainty quantification in hyperbolic partial d ifferential equations. J. Sci. Comput. , 66:358–405, 2016

  31. [31]

    Gerstberger and P

    R. Gerstberger and P. Rentrop. Feedforward neural nets as discretization schemes for odes and daes. J. Comput. Appl. Math. , 82:117–128, 1997

  32. [32]

    Gottlieb and C.-W

    S. Gottlieb and C.-W. Shu. Total variation diminishing runge-kutta schemes. Math. Comp. , 67(221):73–85, 1998

  33. [33]

    L. Guo, H. Wu, Y. Wang, W. Zhou, and T. Zhou. Ib-uq: Inform ation bottleneck based uncertainty quantification for neural function regression and neural op erator learning. Journal of Computational Physics, 510:113089, 2024

  34. [34]

    Haber and L

    E. Haber and L. Ruthotto. Stable architectures for deep neural networks. Inverse Problems , 34:014004, 2018

  35. [35]

    J. J. Harmon, S. Tokareva, A. Zlotnik, and P. J. Swart. Ad aptive uncertainty quantification for stochastic hyperbolic conservation laws. SIAM/ASA Journal on Uncertainty Quantification , 13(2):339–374, 2025

  36. [36]

    K. He, X. Zhang, S. Ren, and J. Sun. Deep residual learnin g for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recogniti on, pages 770–778, 2016

  37. [37]

    K. He, X. Zhang, S. Ren, and J. Sun. Identity mappings in d eep residual networks. In European Conference on Computer Vision , pages 630–645. Springer, 2016

  38. [38]

    Jia and A

    J. Jia and A. R. Benson. Neural jump stochastic differenti al equations. In Advances in Neural Information Processing Systems , volume 32, pages 9847–9858, 2019

  39. [39]

    J. Jung, H. Shin, and M. Choi. Bayesian deep learning fra mework for uncertainty quantification in stochastic partial differential equations. SIAM Journal on Scientific Computing , 46(1):C57–C76, 2024

  40. [40]

    Knio and O.P

    O.M. Knio and O.P. Le Ma ˆ ıtre. Uncertainty propagation in cfd using polynomial chaos decom- position. Fluid Dynamics Research , 38(9):616–640, 2006. Recent Topics in Computational Flui d Dynamics

  41. [41]

    L. Kong, J. Sun, and C. Zhang. SDE-Net: Equipping deep ne ural networks with uncertainty estimates. In Proceedings of the 37th International Conference on Machine L earning, volume 119 of Proceedings of Machine Learning Research , pages 5405–5415, 2020. 29

  42. [42]

    Assimilating partial observation to enhance feedback control of stochastic dyna mical systems

    Siming Liang, Ruoyu Hu, Feng Bao, Richard Archibald, an d Guannan Zhang. Assimilating partial observation to enhance feedback control of stochastic dyna mical systems. Foundations of Data Science, 9:1–33, 2026

  43. [43]

    Convergence analysis for an online data-driven feedback control algorithm

    Siming Liang, Hui Sun, Richard Archibald, and Feng Bao. Convergence analysis for an online data-driven feedback control algorithm. Mathematics (2227-7390) , 12(16), 2024

  44. [44]

    G. Lin, C. Moya, and Z. Zhang. B-deeponet: An enhanced ba yesian deeponet for solving noisy para- metric pdes using accelerated replica exchange sgld. Journal of Computational Physics , 473:111713, 2023

  45. [45]

    X. Liu, T. Xiao, S. Si, Q. Cao andS. K. Kumar, and C.-J. Hsi eh. Neural sde: Stabilizing neural ode networks with stochastic noise. arXiv preprint, 2019

  46. [46]

    L. Lu, , G. Pang, P. Jin, Z. Zhang, and G. E. Karniadakis. L earning nonlinear operators via deeponet based on the universal approximation theorem of op erators. Nat. Mach. Intell. , 3:218– 229, 2021

  47. [47]

    L. Lu, X. Meng, S. Cai, Z. Mao, S. Goswami, Z. Zhang, and G. E. Karniadakis. A comprehensive and fair comparison of two neural operators (with practical ext ensions) based on fair data. Computer Methods in Applied Mechanics and Engineering , 393:114778, 2022

  48. [48]

    Ma and J

    J. Ma and J. Yong. Forward-Backward Stochastic Differential Equations and The ir Applications , volume 1702 of Lecture Notes in Mathematics . Springer, 1999

  49. [49]

    H. C. ¨Ozen and G. Bal. A dynamical polynomial chaos approach for lo ng-time evolution of SPDEs. Journal of Computational Physics , 343:300–323, 2017

  50. [50]

    Petrella, S

    M. Petrella, S. Tokareva, and E.F. Toro. Uncertainty qu antification methodology for hyperbolic systems with application to blood flow in arteries. Journal of Computational Physics , 386:405–427, 2019

  51. [51]

    Pranesh and D

    S. Pranesh and D. Ghosh. Cost reduction of stochastic ga lerkin method by adaptive identification of significant polynomial chaos bases for elliptic equations. Computer Methods in Applied Mechanics and Engineering , 340:54–69, 2018

  52. [52]

    Rahman, M

    Md A. Rahman, M. A. Florez, A. Anandkumar, Z. E. Ross, and K. Azizzadenesheli. Generative adversarial neural operators. arXiv preprint, arXiv:2205 .03017, 2022

  53. [53]

    Cho S and M. Choi. Mgdgan: Multiple generator and discri minator generative adversarial networks for solving stochastic partial differential equations. IEEE Access, 10:130908–130920, 2022

  54. [54]

    Stochastic Optimal Control through Gradient Projection Met hod and Back- ward Action Learning

    Hui Sun. Stochastic Optimal Control through Gradient Projection Met hod and Back- ward Action Learning . Phd thesis, Florida State University, 2023. Retrieved fro m https://purl.lib.fsu.edu/diginole/Sun_fsu_0071E_18074

  55. [55]

    Tokareva, A

    S. Tokareva, A. Zlotnik, and V. Gyrya. Stochastic finite volume method for uncertainty quan- tification of transient flow in gas pipeline networks. Applied Mathematical Modelling , 125:66–84, 2024

  56. [56]

    Walton, S

    S. Walton, S. Tokareva, and G. Manzini. The tensor-trai n stochastic finite volume method for uncertainty quantification. Journal of Computational Physics , 538:114192, 2025. 30

  57. [57]

    Winovich, M

    N. Winovich, M. Daneker, Lu Lu, and G. Lin. Active operat or learning with predictive uncertainty quantification for partial differential equations. Journal of Computational Physics , 555:114791, 2026

  58. [58]

    ConvPDE- UQ: Convolutional neural networks with quantified uncertainty for heterogeneous elliptic par tial differential equations on varied do- mains

    Nick Winovich, Karthik Ramani, and Guang Lin. ConvPDE- UQ: Convolutional neural networks with quantified uncertainty for heterogeneous elliptic par tial differential equations on varied do- mains. Journal of Computational Physics , 394:263–279, 2019

  59. [59]

    Y. Yang, G. Kissas, and P. Perdikaris. Scalable uncerta inty quantification for deep operator networks using randomized priors. Computer Methods in Applied Mechanics and Engineering , 399:115399, 2022

  60. [60]

    Yong and X

    J. Yong and X. Y. Zhou. Stochastic controls: Hamiltonian systems and HJB equation s. Number 43 in Applications of mathematics. Springer Science & Busines s Media, New York, 1999

  61. [61]

    W. Zhao, L. Chen, and S. Peng. A new kind of accurate numer ical method for backward stochastic differential equations. SIAM Journal on Scientific Computing , 28(4):1563–1581, 2006. 31