Adaptive-Distribution Randomized Neural Networks for PDEs: A Low-Dimensional Distribution-Learning Framework

Fei Wang; You Yang

arxiv: 2604.23999 · v1 · submitted 2026-04-27 · 🧮 math.NA · cs.LG· cs.NA

Adaptive-Distribution Randomized Neural Networks for PDEs: A Low-Dimensional Distribution-Learning Framework

You Yang , Fei Wang This is my paper

Pith reviewed 2026-05-08 02:24 UTC · model grok-4.3

classification 🧮 math.NA cs.LGcs.NA

keywords distributionrandomizedadaptiveneuralproblemad-rannleast-squareslow-dimensional

0 comments

The pith

AD-RaNN learns an effective low-dimensional sampling distribution for hidden parameters in randomized neural networks by optimizing a vector p via PDE-driven or data-driven adaptation and a two-stage least-squares procedure, improving accuracy on benchmark PDE problems.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

Randomized neural networks solve partial differential equations by drawing random values for the hidden layer weights and biases, then solving a simple linear least-squares problem for the output weights. This avoids the heavy training of ordinary neural networks but works well only when the random values are sampled from a good distribution, which is usually chosen by hand for each new equation. The paper replaces that manual choice with an automatic step: it represents the sampling distribution by a short vector of numbers called p and tunes only those numbers. It does this in two stages. First it solves a regularized version of the problem to find a stable p. Then it solves the original unregularized problem once more with the chosen p to recover the final solution. Two concrete ways to choose the objective for tuning p are given: one that looks directly at the PDE residual and one that uses available data. The same idea is also extended to time-stepping schemes and to learning operators. Experiments on standard test problems show that the automatically chosen distributions give smaller errors than the usual hand-picked ones.

Core claim

AD-RaNN provides an effective distribution-level adaptation mechanism, reduces reliance on hand-crafted hidden-feature distributions, and achieves strong empirical accuracy.

Load-bearing premise

That a low-dimensional parameterization of the sampling distribution is expressive enough to capture near-optimal distributions for a broad class of PDEs, and that the two-stage ridge-regularized optimization produces a p that genuinely improves the final unregularized solution without introducing new instabilities.

Figures

Figures reproduced from arXiv: 2604.23999 by Fei Wang, You Yang.

**Figure 1.** Figure 1: In the architecture of Randomized Neural Networks, black solid lines denote parameters that have been view at source ↗

**Figure 2.** Figure 2: Two-layer RaNN with local basis functions. view at source ↗

**Figure 3.** Figure 3: Architecture of RaNN-DeepONet. 4.4.2 AD-RaNN-DeepONet The performance of RaNN-DeepONet depends strongly on the random distributions used to generate the branch and trunk hidden features. This is precisely the same structural issue encountered earlier for randomized neural PDE solvers: once the hidden representation is randomized and frozen, the quality of the approximation is largely determined by how the … view at source ↗

**Figure 4.** Figure 4: Reference solution and numerical solutions for the sharp-layer problem view at source ↗

**Figure 5.** Figure 5: Evolution of p = (rx, ry) for PDAD-DT (top) and DDAD-DT (bottom) with Nt = 100, 200, 400, 800 view at source ↗

**Figure 6.** Figure 6: Numerical solutions of (6.13) obtained by the PDAD method with (a1, a2) = (6, 6) (left) and (a1, a2) = (1, 20) (right). Here m1 = 3000 and mλ = 1000. 6.3 Burgers’ equation Burgers’ equation is an important partial differential equation that captures the interaction between nonlinear advection and viscous diffusion, and serves as a simplified model in many fluid-mechanical applications. To assess the perfor… view at source ↗

**Figure 7.** Figure 7: Results for the 1D Burgers’ equation (6.15) computed using PDAD-DT and DDAD-DT with Nt = 1000. From left to right, the columns show: the PDAD-DT solution, the optimized parameters p of PDAD-DT, the DDAD-DT solution, and the optimized parameters p of DDAD-DT. Case 2: Two-dimensional Burgers’ equation. We now extend Burgers’ equation to two dimensions and consider the problem    ut + u (ux + uy) − ε∆u… view at source ↗

**Figure 8.** Figure 8: Results for the 2D Burgers’ equation (6.16) with ε = 0.1 and Nt = 400 at t = 1. From left to right: the PDAD-DT solution, the optimized parameters p = (rx, ry) for PDAD-DT, the DDAD-DT solution, and the optimized parameters p = (rx, ry) for DDAD-DT. To further assess the performance of the discrete-time AD-RaNN, we reduce ε to 0.01, which produces a much sharper shock layer and significantly increases the … view at source ↗

**Figure 9.** Figure 9: Results for 2D Burgers’ equation (6.16) obtained by PDAD-DT (top) and DDAD-DT (bottom) with ε = 0.01 and Nt = 400. From left to right, the columns show: the numerical solution at t = 1, the residual points selected for the second layer at t = 1, the optimized first-layer parameters p, and the second-layer parameter r2. 6.4 Allen-Cahn equation Case 1: One-dimensional Allen-Cahn equation The Allen-Cahn equat… view at source ↗

**Figure 10.** Figure 10: Results for the Allen-Cahn equation (6.18) computed using PDAD-DT and DDAD-DT with Nt = 1000. From left to right, the columns show: the PDAD-DT solution, the optimized parameters p of PDAD-DT, the DDAD-DT solution, and the optimized parameters p of DDAD-DT. Case 2: Two-dimensional Allen-Cahn equation We now extend the Allen-Cahn equation to two dimensions and consider the problem with Dirichlet boundary c… view at source ↗

**Figure 11.** Figure 11: Results for the Allen-Cahn equation (6.19) obtained by PDAD-DT (top) and DDAD-DT (bottom) with Nt = 400 at t = 1. From left to right, the columns show: the numerical solution, the residual points selected for the second layer, the optimized first-layer parameters p, and the second-layer parameter r2. formulation of mean curvature flow: ∂tx = ∂ρρx |∂ρx| 2 , (6.20) where ρ ∈ (0, 1). A first-order discrete-t… view at source ↗

**Figure 12.** Figure 12: Schematic of the neural network architecture with a hard-constrained periodic boundary layer. view at source ↗

**Figure 13.** Figure 13: Comparison between the reference solution (blue dashed) and the DDAD-DT solution (red solid) for the view at source ↗

**Figure 14.** Figure 14: DDAD-DT solutions for the initial condition view at source ↗

**Figure 15.** Figure 15: Comparison of exact solutions (left), predictions (middle), and absolute errors (right) for equation view at source ↗

read the original abstract

Randomized neural networks (RaNNs) are attractive for partial differential equations (PDEs) because they replace expensive end-to-end training with a linear least-squares solve over randomized hidden features. Their practical performance, however, depends strongly on the sampling distribution of the hidden-layer parameters, which is usually chosen heuristically and problem by problem. This distribution sensitivity is a central bottleneck in randomized neural PDE solvers. In this work, we propose Adaptive-Distribution Randomized Neural Networks (AD-RaNN), a framework that promotes randomized feature generation from a fixed heuristic choice to a low-dimensional adaptive optimization problem. Instead of training all hidden weights and biases, AD-RaNN parameterizes the hidden-feature sampling distribution by a low-dimensional vector p and optimizes only p, thereby preserving the least-squares structure of RaNNs while reducing manual distribution tuning. The method uses a two-stage strategy: ridge-regularized reduced training for stable distribution-parameter optimization, followed by an unregularized least-squares refit for final solution recovery. We develop two adaptive mechanisms, PDE-Driven Adaptive Distribution (PDAD) and Data-Driven Adaptive Distribution (DDAD), and deploy them in space-time solvers, discrete-time solvers, and operator-learning models. We also incorporate an adaptive layer-growth enhancement for localized structures. For the reduced optimization problem, we establish well-posedness of the reduced objectives, consistency of ridge-regularized minimizers, an efficient gradient formula, and a practical lower-bound estimate for the ridge parameter. Numerical experiments on benchmark problems show that AD-RaNN provides an effective distribution-level adaptation mechanism, reduces reliance on hand-crafted hidden-feature distributions, and achieves strong empirical accuracy.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

AD-RaNN turns the heuristic distribution choice in randomized neural PDE solvers into a low-dimensional optimization over p, with a two-stage ridge-then-unregularized procedure, but the expressivity of that parameterization is the main open question.

read the letter

The main new element is the low-dimensional parameterization of the hidden-feature sampling distribution together with the PDAD and DDAD mechanisms and the explicit two-stage ridge-regularized optimization followed by unregularized refit. This keeps the linear least-squares core of RaNNs while moving distribution selection from manual tuning to an optimizable vector p, and they add an adaptive layer-growth step for localized features. The paper also supplies some formal statements on well-posedness of the reduced problem, consistency of the ridge minimizers, and an efficient gradient formula, which is more than most engineering tweaks in this area provide. Those pieces are useful for anyone already working with randomized networks on PDEs or operator learning, because they preserve the cheap solve while giving a concrete way to adapt the distribution without full nonlinear training. The experiments are reported to show clear accuracy gains on benchmarks, which suggests the approach is at least practically workable in the cases they tested. The soft spot is the assumption that a low-dimensional p can get close enough to a near-optimal distribution for a wide range of PDEs. The reduced objective optimizes a proxy, so nothing in the construction rules out the learned p staying far from the distribution that would minimize the final unregularized residual, especially when solutions have sharp gradients or high-frequency content. The two-stage split and ridge step are meant to limit direct fitting, but the performance still hinges on how expressive that low-dimensional family actually is. This is worth a serious referee because it targets a genuine bottleneck in the RaNN literature and supplies both mechanisms and some theory, even if the theory and the empirical robustness need checking. I would send it to review rather than desk-reject.

Circularity Check

0 steps flagged

No significant circularity detected

full rationale

The paper presents AD-RaNN as an algorithmic framework that optimizes a low-dimensional distribution parameter p via a two-stage ridge-regularized procedure before an unregularized refit. This optimization is explicitly part of the proposed method rather than a claimed first-principles derivation whose output reduces to its inputs by construction. Well-posedness, consistency, and gradient formulas are derived for the reduced problem itself; numerical results are presented as separate empirical evidence. No load-bearing self-citations, uniqueness theorems, or ansatzes imported from prior author work appear in the provided text, and the central performance claims rest on the method's design and experiments rather than tautological renaming or fitting.

Axiom & Free-Parameter Ledger

1 free parameters · 1 axioms · 1 invented entities

The framework rests on the assumption that the sampling distribution can be usefully parameterized by a low-dimensional vector p whose optimization yields better features than fixed heuristics, plus standard well-posedness results for the reduced ridge-regularized least-squares problems.

free parameters (1)

distribution parameter vector p
Low-dimensional vector that defines the adaptive sampling distribution and is optimized in the first stage.

axioms (1)

domain assumption Well-posedness of the reduced ridge-regularized objectives
Invoked to guarantee existence and consistency of the minimizer for p.

invented entities (1)

AD-RaNN framework with PDAD and DDAD mechanisms no independent evidence
purpose: Adaptive low-dimensional distribution learning for randomized neural PDE solvers
New method introduced to replace heuristic distribution choice.

pith-pipeline@v0.9.0 · 5601 in / 1555 out tokens · 38076 ms · 2026-05-08T02:24:26.897521+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

38 extracted references · 38 canonical work pages

[1]

Galerkin neural networks: A framework for approximating variational equations with error control.SIAM Journal on Scientific Computing, 43(4):A2474–A2501, 2021

Mark Ainsworth and Justin Dong. Galerkin neural networks: A framework for approximating variational equations with error control.SIAM Journal on Scientific Computing, 43(4):A2474–A2501, 2021

work page 2021
[2]

Bridging Traditional and Machine Learning-Based Algorithms for Solving PDEs: The Random Feature Method.Journal of Machine Learning, 1(3):268–298, 2022

Jingrun Chen, Xurong Chi, Weinan E, and Zhouwang Yang. Bridging Traditional and Machine Learning-Based Algorithms for Solving PDEs: The Random Feature Method.Journal of Machine Learning, 1(3):268–298, 2022

work page 2022
[3]

Tianping Chen and Hong Chen. Universal approximation to nonlinear operators by neural networks with arbitrary activation functions and its application to dynamical systems.IEEE Transactions on Neural Networks, 6(4):911–917, 1995

work page 1995
[4]

Approximation by superpositions of a sigmoidal function.Mathematics of Control, Signals, and Systems, 2(4):303–314, 1989

George Cybenko. Approximation by superpositions of a sigmoidal function.Mathematics of Control, Signals, and Systems, 2(4):303–314, 1989

work page 1989
[5]

Adaptive growing randomized neural networks for solving partial differential equations.arXiv preprint arXiv:2408.17225, 2024

Haoning Dang, Fei Wang, and Song Jiang. Adaptive growing randomized neural networks for solving partial differential equations.arXiv preprint arXiv:2408.17225, 2024

work page arXiv 2024
[6]

Approximation theory and applications of randomized neural networks for solving high-dimensional PDEs.arXiv preprint arXiv:2501.12145, 2025

Tim De Ryck, Siddhartha Mishra, Y Shang, and F Wang. Approximation theory and applications of randomized neural networks for solving high-dimensional PDEs.arXiv preprint arXiv:2501.12145, 2025

work page arXiv 2025
[7]

Suchuan Dong and Zongwei Li. Local extreme learning machines and domain decomposition for solving linear and nonlinear partial differential equations.Computer Methods in Applied Mechanics and Engineering, 387:114129, 2021

work page 2021
[8]

A method for representing periodic functions and enforcing exactly periodic boundary conditions with deep neural networks.Journal of Computational Physics, 435:110242, 2021

Suchuan Dong and Naxian Ni. A method for representing periodic functions and enforcing exactly periodic boundary conditions with deep neural networks.Journal of Computational Physics, 435:110242, 2021

work page 2021
[9]

Suchuan Dong and Jielin Yang. On computing the hyperparameter of extreme learning machines: Algorithm and application to computational PDEs, and comparison with classical and high-order finite elements.Journal of Computational Physics, 463:111290, 2022

work page 2022
[10]

On the stability of implicit-explicit linear multistep methods.Applied Numerical Mathematics, 25(2-3):193–205, 1997

Jason Frank, Willem Hundsdorfer, and Jan G Verwer. On the stability of implicit-explicit linear multistep methods.Applied Numerical Mathematics, 25(2-3):193–205, 1997

work page 1997
[11]

The heat equation shrinking convex plane curves.Journal of Differential Geometry, 23(1):69–96, 1986

Michael Gage and Richard S Hamilton. The heat equation shrinking convex plane curves.Journal of Differential Geometry, 23(1):69–96, 1986

work page 1986
[12]

The heat equation shrinks embedded plane curves to round points.Journal of Differential Geometry, 26(2):285–314, 1987

Matthew A Grayson. The heat equation shrinks embedded plane curves to round points.Journal of Differential Geometry, 26(2):285–314, 1987

work page 1987
[13]

Approximation capabilities of multilayer feedforward networks.Neural Networks, 4(2):251–257, 1991

Kurt Hornik. Approximation capabilities of multilayer feedforward networks.Neural Networks, 4(2):251–257, 1991

work page 1991
[14]

DeepONet augmented by randomized neural networks for efficient operator learning in PDEs.Communications in Nonlinear Science and Numerical Simulation, 155:109605, 2026

Zhaoxi Jiang, Haoning Dang, and Fei Wang. DeepONet augmented by randomized neural networks for efficient operator learning in PDEs.Communications in Nonlinear Science and Numerical Simulation, 155:109605, 2026

work page 2026
[15]

Kingma and Jimmy Ba

Diederik P. Kingma and Jimmy Ba. Adam: A method for stochastic optimization. InInternational Conference on Learning Representations, 2015

work page 2015
[16]

A structure-preserving PINN with embedded periodic boundary layer and adaptively enforced initial conditions for geometric flows.Computer Physics Communications, 316:109762, 2025

Meng Li and You Yang. A structure-preserving PINN with embedded periodic boundary layer and adaptively enforced initial conditions for geometric flows.Computer Physics Communications, 316:109762, 2025

work page 2025
[17]

Local randomized neural networks with finite difference methods for interface problems.Journal of Computational Physics, 529:113847, 2025

Yunlong Li and Fei Wang. Local randomized neural networks with finite difference methods for interface problems.Journal of Computational Physics, 529:113847, 2025. 36

work page 2025
[18]

On the limited memory BFGS method for large scale optimization.Mathematical Programming, 45(1-3):503–528, 1989

Dong C Liu and Jorge Nocedal. On the limited memory BFGS method for large scale optimization.Mathematical Programming, 45(1-3):503–528, 1989

work page 1989
[19]

Learning nonlinear operators via DeepONet based on the universal approximation theorem of operators.Nature Machine Intelligence, 3(3):218–229, 2021

Lu Lu, Pengzhan Jin, Guofei Pang, Zhongqiang Zhang, and George Em Karniadakis. Learning nonlinear operators via DeepONet based on the universal approximation theorem of operators.Nature Machine Intelligence, 3(3):218–229, 2021

work page 2021
[20]

Revanth Mattey and Susanta Ghosh. A novel sequential method to train physics informed neural networks for Allen Cahn and Cahn Hilliard equations.Computer Methods in Applied Mechanics and Engineering, 390:114474, 2022

work page 2022
[21]

Self-adaptive physics-informed neural networks.Journal of Computational Physics, 474:111722, 2023

Levi D McClenny and Ulisses M Braga-Neto. Self-adaptive physics-informed neural networks.Journal of Computational Physics, 474:111722, 2023

work page 2023
[22]

Approximation theory of the MLP model in neural networks.Acta Numerica, 8:143–195, 1999

Allan Pinkus. Approximation theory of the MLP model in neural networks.Acta Numerica, 8:143–195, 1999

work page 1999
[23]

Maziar Raissi, Paris Perdikaris, and George E Karniadakis. Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations.Journal of Computational Physics, 378:686–707, 2019

work page 2019
[24]

A stochastic Approximation Method.The Annals of Mathematical Statistics, 22(3):400–407, 1951

Herbert Robbins and Sutton Monro. A stochastic Approximation Method.The Annals of Mathematical Statistics, 22(3):400–407, 1951

work page 1951
[25]

Randomized neural networks with Petrov–Galerkin methods for solving linear elasticity and Navier–Stokes equations.Journal of Engineering Mechanics, 150(4):04024010, 2024

Yong Shang and Fei Wang. Randomized neural networks with Petrov–Galerkin methods for solving linear elasticity and Navier–Stokes equations.Journal of Engineering Mechanics, 150(4):04024010, 2024

work page 2024
[26]

Yong Shang, Fei Wang, and Jingbo Sun. Randomized neural network with Petrov–Galerkin methods for solving linear and nonlinear partial differential equations.Communications in Nonlinear Science and Numerical Simulation, 127:107518, 2023

work page 2023
[27]

DGM: A deep learning algorithm for solving partial differential equations.Journal of Computational Physics, 375:1339–1364, 2018

Justin Sirignano and Konstantinos Spiliopoulos. DGM: A deep learning algorithm for solving partial differential equations.Journal of Computational Physics, 375:1339–1364, 2018

work page 2018
[28]

Local randomized neural networks with discontinuous Galerkin methods for partial differential equations.Journal of Computational and Applied Mathematics, 445:115830, 2024

Jingbo Sun, Suchuan Dong, and Fei Wang. Local randomized neural networks with discontinuous Galerkin methods for partial differential equations.Journal of Computational and Applied Mathematics, 445:115830, 2024

work page 2024
[29]

Local randomized neural networks with discontinuous Galerkin methods for diffusive-viscous wave equation.Computers & Mathematics with Applications, 154:128–137, 2024

Jingbo Sun and Fei Wang. Local randomized neural networks with discontinuous Galerkin methods for diffusive-viscous wave equation.Computers & Mathematics with Applications, 154:128–137, 2024

work page 2024
[30]

Unveiling the optimization process of physics informed neural networks: How accurate and competitive can PINNs be?Journal of Computational Physics, 523:113656, 2025

Jorge F Urbán, Petros Stefanou, and José A Pons. Unveiling the optimization process of physics informed neural networks: How accurate and competitive can PINNs be?Journal of Computational Physics, 523:113656, 2025

work page 2025
[31]

Learning the solution operator of parametric partial differential equations with physics-informed DeepONets.Science Advances, 7(40):eabi8605, 2021

Sifan Wang, Hanwen Wang, and Paris Perdikaris. Learning the solution operator of parametric partial differential equations with physics-informed DeepONets.Science Advances, 7(40):eabi8605, 2021

work page 2021
[32]

Solving high-dimensional partial differential equations using tensor neural network and a posteriori error estimators.Journal of Scientific Computing, 101(3):67, 2024

Yifan Wang, Zhongshuo Lin, Yangfei Liao, Haochen Liu, and Hehu Xie. Solving high-dimensional partial differential equations using tensor neural network and a posteriori error estimators.Journal of Scientific Computing, 101(3):67, 2024

work page 2024
[33]

Computing multi-eigenpairs of high-dimensional eigenvalue problems using tensor neural networks.Journal of Computational Physics, 506:112928, 2024

Yifan Wang and Hehu Xie. Computing multi-eigenpairs of high-dimensional eigenvalue problems using tensor neural networks.Journal of Computational Physics, 506:112928, 2024

work page 2024
[34]

A practical PINN framework for multi-scale problems with multi-magnitude loss terms.Journal of Computational Physics, 510:113112, 2024

Yong Wang, Yanzhong Yao, Jiawei Guo, and Zhiming Gao. A practical PINN framework for multi-scale problems with multi-magnitude loss terms.Journal of Computational Physics, 510:113112, 2024. 37

work page 2024
[35]

Chenxi Wu, Min Zhu, Qinyang Tan, Yadhu Kartha, and Lu Lu. A comprehensive study of non-adaptive and residual-based adaptive sampling for physics-informed neural networks.Computer Methods in Applied Mechanics and Engineering, 403:115671, 2023

work page 2023
[36]

Subspace method based on neural networks for solving the partial differential equation.Computers & Mathematics with Applications, 195:109–138, 2025

Zhaodong Xu and Zhiqiang Sheng. Subspace method based on neural networks for solving the partial differential equation.Computers & Mathematics with Applications, 195:109–138, 2025

work page 2025
[37]

Gradient-enhanced physics-informed neural networks for forward and inverse PDE problems.Computer Methods in Applied Mechanics and Engineering, 393:114823, 2022

Jeremy Yu, Lu Lu, Xuhui Meng, and George Em Karniadakis. Gradient-enhanced physics-informed neural networks for forward and inverse PDE problems.Computer Methods in Applied Mechanics and Engineering, 393:114823, 2022

work page 2022
[38]

Transferable neural networks for partial differential equations.Journal of Scientific Computing, 99(1):2, 2024

Zezhong Zhang, Feng Bao, Lili Ju, and Guannan Zhang. Transferable neural networks for partial differential equations.Journal of Scientific Computing, 99(1):2, 2024. 38

work page 2024

[1] [1]

Galerkin neural networks: A framework for approximating variational equations with error control.SIAM Journal on Scientific Computing, 43(4):A2474–A2501, 2021

Mark Ainsworth and Justin Dong. Galerkin neural networks: A framework for approximating variational equations with error control.SIAM Journal on Scientific Computing, 43(4):A2474–A2501, 2021

work page 2021

[2] [2]

Bridging Traditional and Machine Learning-Based Algorithms for Solving PDEs: The Random Feature Method.Journal of Machine Learning, 1(3):268–298, 2022

Jingrun Chen, Xurong Chi, Weinan E, and Zhouwang Yang. Bridging Traditional and Machine Learning-Based Algorithms for Solving PDEs: The Random Feature Method.Journal of Machine Learning, 1(3):268–298, 2022

work page 2022

[3] [3]

Tianping Chen and Hong Chen. Universal approximation to nonlinear operators by neural networks with arbitrary activation functions and its application to dynamical systems.IEEE Transactions on Neural Networks, 6(4):911–917, 1995

work page 1995

[4] [4]

Approximation by superpositions of a sigmoidal function.Mathematics of Control, Signals, and Systems, 2(4):303–314, 1989

George Cybenko. Approximation by superpositions of a sigmoidal function.Mathematics of Control, Signals, and Systems, 2(4):303–314, 1989

work page 1989

[5] [5]

Adaptive growing randomized neural networks for solving partial differential equations.arXiv preprint arXiv:2408.17225, 2024

Haoning Dang, Fei Wang, and Song Jiang. Adaptive growing randomized neural networks for solving partial differential equations.arXiv preprint arXiv:2408.17225, 2024

work page arXiv 2024

[6] [6]

Approximation theory and applications of randomized neural networks for solving high-dimensional PDEs.arXiv preprint arXiv:2501.12145, 2025

Tim De Ryck, Siddhartha Mishra, Y Shang, and F Wang. Approximation theory and applications of randomized neural networks for solving high-dimensional PDEs.arXiv preprint arXiv:2501.12145, 2025

work page arXiv 2025

[7] [7]

Suchuan Dong and Zongwei Li. Local extreme learning machines and domain decomposition for solving linear and nonlinear partial differential equations.Computer Methods in Applied Mechanics and Engineering, 387:114129, 2021

work page 2021

[8] [8]

A method for representing periodic functions and enforcing exactly periodic boundary conditions with deep neural networks.Journal of Computational Physics, 435:110242, 2021

Suchuan Dong and Naxian Ni. A method for representing periodic functions and enforcing exactly periodic boundary conditions with deep neural networks.Journal of Computational Physics, 435:110242, 2021

work page 2021

[9] [9]

Suchuan Dong and Jielin Yang. On computing the hyperparameter of extreme learning machines: Algorithm and application to computational PDEs, and comparison with classical and high-order finite elements.Journal of Computational Physics, 463:111290, 2022

work page 2022

[10] [10]

On the stability of implicit-explicit linear multistep methods.Applied Numerical Mathematics, 25(2-3):193–205, 1997

Jason Frank, Willem Hundsdorfer, and Jan G Verwer. On the stability of implicit-explicit linear multistep methods.Applied Numerical Mathematics, 25(2-3):193–205, 1997

work page 1997

[11] [11]

The heat equation shrinking convex plane curves.Journal of Differential Geometry, 23(1):69–96, 1986

Michael Gage and Richard S Hamilton. The heat equation shrinking convex plane curves.Journal of Differential Geometry, 23(1):69–96, 1986

work page 1986

[12] [12]

The heat equation shrinks embedded plane curves to round points.Journal of Differential Geometry, 26(2):285–314, 1987

Matthew A Grayson. The heat equation shrinks embedded plane curves to round points.Journal of Differential Geometry, 26(2):285–314, 1987

work page 1987

[13] [13]

Approximation capabilities of multilayer feedforward networks.Neural Networks, 4(2):251–257, 1991

Kurt Hornik. Approximation capabilities of multilayer feedforward networks.Neural Networks, 4(2):251–257, 1991

work page 1991

[14] [14]

DeepONet augmented by randomized neural networks for efficient operator learning in PDEs.Communications in Nonlinear Science and Numerical Simulation, 155:109605, 2026

Zhaoxi Jiang, Haoning Dang, and Fei Wang. DeepONet augmented by randomized neural networks for efficient operator learning in PDEs.Communications in Nonlinear Science and Numerical Simulation, 155:109605, 2026

work page 2026

[15] [15]

Kingma and Jimmy Ba

Diederik P. Kingma and Jimmy Ba. Adam: A method for stochastic optimization. InInternational Conference on Learning Representations, 2015

work page 2015

[16] [16]

A structure-preserving PINN with embedded periodic boundary layer and adaptively enforced initial conditions for geometric flows.Computer Physics Communications, 316:109762, 2025

Meng Li and You Yang. A structure-preserving PINN with embedded periodic boundary layer and adaptively enforced initial conditions for geometric flows.Computer Physics Communications, 316:109762, 2025

work page 2025

[17] [17]

Local randomized neural networks with finite difference methods for interface problems.Journal of Computational Physics, 529:113847, 2025

Yunlong Li and Fei Wang. Local randomized neural networks with finite difference methods for interface problems.Journal of Computational Physics, 529:113847, 2025. 36

work page 2025

[18] [18]

On the limited memory BFGS method for large scale optimization.Mathematical Programming, 45(1-3):503–528, 1989

Dong C Liu and Jorge Nocedal. On the limited memory BFGS method for large scale optimization.Mathematical Programming, 45(1-3):503–528, 1989

work page 1989

[19] [19]

Learning nonlinear operators via DeepONet based on the universal approximation theorem of operators.Nature Machine Intelligence, 3(3):218–229, 2021

Lu Lu, Pengzhan Jin, Guofei Pang, Zhongqiang Zhang, and George Em Karniadakis. Learning nonlinear operators via DeepONet based on the universal approximation theorem of operators.Nature Machine Intelligence, 3(3):218–229, 2021

work page 2021

[20] [20]

Revanth Mattey and Susanta Ghosh. A novel sequential method to train physics informed neural networks for Allen Cahn and Cahn Hilliard equations.Computer Methods in Applied Mechanics and Engineering, 390:114474, 2022

work page 2022

[21] [21]

Self-adaptive physics-informed neural networks.Journal of Computational Physics, 474:111722, 2023

Levi D McClenny and Ulisses M Braga-Neto. Self-adaptive physics-informed neural networks.Journal of Computational Physics, 474:111722, 2023

work page 2023

[22] [22]

Approximation theory of the MLP model in neural networks.Acta Numerica, 8:143–195, 1999

Allan Pinkus. Approximation theory of the MLP model in neural networks.Acta Numerica, 8:143–195, 1999

work page 1999

[23] [23]

Maziar Raissi, Paris Perdikaris, and George E Karniadakis. Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations.Journal of Computational Physics, 378:686–707, 2019

work page 2019

[24] [24]

A stochastic Approximation Method.The Annals of Mathematical Statistics, 22(3):400–407, 1951

Herbert Robbins and Sutton Monro. A stochastic Approximation Method.The Annals of Mathematical Statistics, 22(3):400–407, 1951

work page 1951

[25] [25]

Randomized neural networks with Petrov–Galerkin methods for solving linear elasticity and Navier–Stokes equations.Journal of Engineering Mechanics, 150(4):04024010, 2024

Yong Shang and Fei Wang. Randomized neural networks with Petrov–Galerkin methods for solving linear elasticity and Navier–Stokes equations.Journal of Engineering Mechanics, 150(4):04024010, 2024

work page 2024

[26] [26]

Yong Shang, Fei Wang, and Jingbo Sun. Randomized neural network with Petrov–Galerkin methods for solving linear and nonlinear partial differential equations.Communications in Nonlinear Science and Numerical Simulation, 127:107518, 2023

work page 2023

[27] [27]

DGM: A deep learning algorithm for solving partial differential equations.Journal of Computational Physics, 375:1339–1364, 2018

Justin Sirignano and Konstantinos Spiliopoulos. DGM: A deep learning algorithm for solving partial differential equations.Journal of Computational Physics, 375:1339–1364, 2018

work page 2018

[28] [28]

Local randomized neural networks with discontinuous Galerkin methods for partial differential equations.Journal of Computational and Applied Mathematics, 445:115830, 2024

Jingbo Sun, Suchuan Dong, and Fei Wang. Local randomized neural networks with discontinuous Galerkin methods for partial differential equations.Journal of Computational and Applied Mathematics, 445:115830, 2024

work page 2024

[29] [29]

Local randomized neural networks with discontinuous Galerkin methods for diffusive-viscous wave equation.Computers & Mathematics with Applications, 154:128–137, 2024

Jingbo Sun and Fei Wang. Local randomized neural networks with discontinuous Galerkin methods for diffusive-viscous wave equation.Computers & Mathematics with Applications, 154:128–137, 2024

work page 2024

[30] [30]

Unveiling the optimization process of physics informed neural networks: How accurate and competitive can PINNs be?Journal of Computational Physics, 523:113656, 2025

Jorge F Urbán, Petros Stefanou, and José A Pons. Unveiling the optimization process of physics informed neural networks: How accurate and competitive can PINNs be?Journal of Computational Physics, 523:113656, 2025

work page 2025

[31] [31]

Learning the solution operator of parametric partial differential equations with physics-informed DeepONets.Science Advances, 7(40):eabi8605, 2021

Sifan Wang, Hanwen Wang, and Paris Perdikaris. Learning the solution operator of parametric partial differential equations with physics-informed DeepONets.Science Advances, 7(40):eabi8605, 2021

work page 2021

[32] [32]

Solving high-dimensional partial differential equations using tensor neural network and a posteriori error estimators.Journal of Scientific Computing, 101(3):67, 2024

Yifan Wang, Zhongshuo Lin, Yangfei Liao, Haochen Liu, and Hehu Xie. Solving high-dimensional partial differential equations using tensor neural network and a posteriori error estimators.Journal of Scientific Computing, 101(3):67, 2024

work page 2024

[33] [33]

Computing multi-eigenpairs of high-dimensional eigenvalue problems using tensor neural networks.Journal of Computational Physics, 506:112928, 2024

Yifan Wang and Hehu Xie. Computing multi-eigenpairs of high-dimensional eigenvalue problems using tensor neural networks.Journal of Computational Physics, 506:112928, 2024

work page 2024

[34] [34]

A practical PINN framework for multi-scale problems with multi-magnitude loss terms.Journal of Computational Physics, 510:113112, 2024

Yong Wang, Yanzhong Yao, Jiawei Guo, and Zhiming Gao. A practical PINN framework for multi-scale problems with multi-magnitude loss terms.Journal of Computational Physics, 510:113112, 2024. 37

work page 2024

[35] [35]

Chenxi Wu, Min Zhu, Qinyang Tan, Yadhu Kartha, and Lu Lu. A comprehensive study of non-adaptive and residual-based adaptive sampling for physics-informed neural networks.Computer Methods in Applied Mechanics and Engineering, 403:115671, 2023

work page 2023

[36] [36]

Subspace method based on neural networks for solving the partial differential equation.Computers & Mathematics with Applications, 195:109–138, 2025

Zhaodong Xu and Zhiqiang Sheng. Subspace method based on neural networks for solving the partial differential equation.Computers & Mathematics with Applications, 195:109–138, 2025

work page 2025

[37] [37]

Gradient-enhanced physics-informed neural networks for forward and inverse PDE problems.Computer Methods in Applied Mechanics and Engineering, 393:114823, 2022

Jeremy Yu, Lu Lu, Xuhui Meng, and George Em Karniadakis. Gradient-enhanced physics-informed neural networks for forward and inverse PDE problems.Computer Methods in Applied Mechanics and Engineering, 393:114823, 2022

work page 2022

[38] [38]

Transferable neural networks for partial differential equations.Journal of Scientific Computing, 99(1):2, 2024

Zezhong Zhang, Feng Bao, Lili Ju, and Guannan Zhang. Transferable neural networks for partial differential equations.Journal of Scientific Computing, 99(1):2, 2024. 38

work page 2024