pith. sign in

arxiv: 2605.15285 · v1 · pith:NI5DRLRSnew · submitted 2026-05-14 · 💻 cs.LG · cs.AI· cs.NA· math.FA· math.NA· math.OC

Universal Approximation of Nonlinear Operators and Their Derivatives

Pith reviewed 2026-05-19 16:43 UTC · model grok-4.3

classification 💻 cs.LG cs.AIcs.NAmath.FAmath.NAmath.OC
keywords universal approximation theoremoperator learningBanach spacesdifferentiable operatorsDeepONetweighted Sobolev normsBastiani differentiability
2
0 comments X

The pith

Nonlinear k-times differentiable operators between Banach spaces and their derivatives can be universally approximated by operator learning architectures.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper proves universal approximation theorems showing that nonlinear operators which are k times differentiable in the Bastiani sense between Banach spaces can be approximated together with their derivatives. The approximations hold uniformly on compact sets and in weighted Sobolev norms for general finite input measures, using encoder-decoder operator learning architectures. This extends the classical finite-dimensional results of Hornik 1991 to infinite-dimensional settings. A sympathetic reader would care because such approximations enable high-order accurate models for systems governed by operators on function spaces, such as PDEs and optimal control problems.

Core claim

We prove the first UATs of non-linear k-times differentiable operators between Banach spaces and their derivatives, uniformly on compact sets and in weighted Sobolev norms for general finite input measures, via OL architectures. Our results are the first complete generalizations of the corresponding influential classical results in [Hornik, 1991] to infinite-dimensional settings and OL.

What carries the argument

Encoder-decoder architectures combined with novel weighted Sobolev spaces and Bastiani k-times continuous differentiability, which carry the uniform approximation in natural compact-open topologies.

If this is right

  • High-order accuracy becomes achievable in operator learning for tasks requiring derivatives.
  • Fast constrained optimization in Banach spaces is enabled for problems like optimal control of PDEs and inverse problems.
  • Numerical methods for infinite-dimensional PDEs such as HJB equations on Banach spaces and SPDEs gain derivative-informed approximations.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The results suggest that derivative-informed training could improve generalization in scientific machine learning models of physical systems.
  • Similar approximation strategies might extend to other classes of operators arising in mean-field control or partially observed systems.
  • The emphasis on specific topologies indicates that standard operator-norm approximations may be insufficient for derivative-aware learning.

Load-bearing premise

The operator must be k-times continuously differentiable in the Bastiani sense and the Banach spaces must satisfy the required approximation properties under the chosen weighted Sobolev norms.

What would settle it

An explicit nonlinear operator that is k-times Bastiani differentiable yet cannot be approximated to arbitrary accuracy in the weighted Sobolev norm on some compact set by any encoder-decoder OL network would falsify the theorems.

read the original abstract

Derivative-Informed Operator Learning (DIOL), i.e. learning a (nonlinear) operator and its derivatives, is an open research frontier at the foundations of the influential field of Operator Learning (OL). In particular, Universal Approximation Theorems (UATs) of nonlinear operators and their derivatives are foundational open questions and delicate problems in nonlinear functional analysis. In this manuscript, we prove the first UATs of non-linear $k$-times differentiable operators between Banach spaces and their derivatives, uniformly on compact sets and in weighted Sobolev norms for general finite input measures, via OL architectures. Our results are the first complete generalizations of the corresponding influential classical results in [Hornik, 1991] to infinite-dimensional settings and OL. We discuss several open areas where DIOL and our UATs find applications: high-order accuracy in OL, fast constrained optimization in Banach spaces (e.g. optimal control of PDEs, inverse problems) and numerical methods for infinite-dimensional PDEs (e.g. HJB PDEs on Banach spaces from optimal control of PDEs, SPDEs, path-dependent systems, partially observed systems, mean-field control). We parameterize nonlinear operators via Encoder-Decoder Architectures, renowned classes in OL due to their generality, including classical architectures, such as DeepONets, Deep-H-ONets, PCA-Nets. Our results are based on four key features that allow us to prove UATs in full generality: (i) Approximation Properties of Banach spaces. (ii) $k$-times continuous differentiability in the sense of Bastiani (weaker than $k$-times continuous Fr\'echet differentiability). (iii) Natural compact-open topologies for UA; indeed, we show that UA in standard compact-open topologies induced by operator norms is violated even for Fr\'echet derivatives. (iv) Construction of novel weighted Sobolev spaces for the UA.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 3 minor

Summary. The manuscript establishes the first universal approximation theorems (UATs) for nonlinear k-times Bastiani-differentiable operators between Banach spaces and their derivatives. Encoder-decoder architectures are shown to be dense, uniformly on compact sets, in topologies induced by novel weighted Sobolev norms that control derivatives up to order k, for general finite input measures. The argument relies on the approximation property of Banach spaces to reduce to finite dimensions, a counter-example showing failure of standard compact-open topologies for Fréchet derivatives, and explicit constructions of the weighted spaces. Results are positioned as complete generalizations of Hornik (1991) to infinite-dimensional operator learning.

Significance. If the central derivations hold, the work supplies foundational UATs that directly support derivative-informed operator learning in infinite dimensions. Strengths include the explicit counter-example justifying the topology choice, the construction of weighted Sobolev spaces tailored to control higher derivatives, and the reduction via Banach-space approximation properties. These elements enable applications in high-order accurate operator learning, constrained optimization over Banach spaces, and numerical treatment of infinite-dimensional PDEs such as HJB equations.

major comments (2)
  1. [Main theorem / reduction step] §3 (or equivalent section containing the main UAT): the reduction from infinite- to finite-dimensional cases via the approximation property must be shown to preserve uniformity of the approximation simultaneously for the operator and all derivatives up to order k in the weighted Sobolev norm; the current sketch leaves open whether the constants depend on k or on the particular compact set.
  2. [Weighted Sobolev spaces] Definition of the weighted Sobolev spaces (likely §2.3 or §4): the norm is stated to control derivatives up to order k, but it is unclear whether the weight functions are chosen so that the resulting space is Banach and whether density of encoder-decoder maps holds without further restrictions on the finite measure; a concrete verification that the norm is equivalent to the standard Sobolev norm on the support of the measure would strengthen the claim.
minor comments (3)
  1. [Preliminaries] Notation for Bastiani differentiability is introduced without an explicit comparison table to Fréchet differentiability; adding one would clarify the weakening used throughout the proofs.
  2. [Topologies] The counter-example showing failure of the operator-norm compact-open topology is referenced but its explicit construction (e.g., the specific operator and sequence of approximants) should be placed in the main text rather than an appendix for readability.
  3. [Introduction] Several citations to classical results (Hornik 1991 and related works) appear only in the abstract and introduction; a dedicated related-work subsection would help situate the novelty.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the careful reading, positive assessment of the significance, and recommendation for minor revision. The comments help clarify key technical points in the proofs. We address each major comment below and will make the indicated revisions to strengthen the manuscript.

read point-by-point responses
  1. Referee: [Main theorem / reduction step] §3 (or equivalent section containing the main UAT): the reduction from infinite- to finite-dimensional cases via the approximation property must be shown to preserve uniformity of the approximation simultaneously for the operator and all derivatives up to order k in the weighted Sobolev norm; the current sketch leaves open whether the constants depend on k or on the particular compact set.

    Authors: We agree that the reduction step benefits from a more explicit verification of uniformity. The approximation property of Banach spaces permits uniform approximation of the infinite-dimensional operator and its Bastiani derivatives on compact sets by finite-dimensional counterparts. Because the operators are k-times Bastiani differentiable, the derivatives up to order k are uniformly continuous on the relevant compacts, and the weighted Sobolev norm is designed to control them simultaneously. In the revised manuscript we will expand the argument in the section containing the main UAT to include a detailed estimate showing that, for any fixed k and fixed compact set, there exist constants (depending on k and the compact but independent of the approximating sequence) such that the encoder-decoder maps approximate both the operator and all derivatives uniformly in the weighted norm. This mirrors the structure used in the finite-dimensional Hornik (1991) result and preserves the claimed uniformity. revision: yes

  2. Referee: [Weighted Sobolev spaces] Definition of the weighted Sobolev spaces (likely §2.3 or §4): the norm is stated to control derivatives up to order k, but it is unclear whether the weight functions are chosen so that the resulting space is Banach and whether density of encoder-decoder maps holds without further restrictions on the finite measure; a concrete verification that the norm is equivalent to the standard Sobolev norm on the support of the measure would strengthen the claim.

    Authors: We thank the referee for this suggestion. The weight functions are chosen to be positive, continuous, and bounded away from zero and infinity on the supports of the finite measures under consideration; this guarantees that the weighted Sobolev norm defines a Banach space (as a finite sum of weighted L^p norms of derivatives up to order k, each of which is complete). The density of encoder-decoder architectures holds for arbitrary finite input measures because the proof first reduces to the finite-dimensional case via the approximation property and then lifts the density using the Bastiani differentiability; no additional restrictions on the measure are required. In the revision we will insert a short proposition (in the section defining the weighted spaces) that explicitly verifies equivalence of the weighted norm to the standard Sobolev norm on the support of any finite measure, by bounding the weights between two positive constants on the relevant compact sets. revision: yes

Circularity Check

0 steps flagged

No significant circularity detected

full rationale

The manuscript establishes universal approximation theorems for k-times Bastiani-differentiable nonlinear operators between Banach spaces and their derivatives via explicit functional-analytic arguments: reduction via the approximation property to finite dimensions, construction of weighted Sobolev norms that control derivatives up to order k, and density of encoder-decoder architectures in the resulting compact-open topology. These steps rely on standard properties of Banach spaces and a new norm definition rather than any fitted parameters, self-referential definitions, or load-bearing self-citations. The provided counter-example for the failure of operator-norm topologies is external to the target result. The derivation is therefore self-contained and does not reduce to its inputs by construction.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 1 invented entities

The paper rests on standard functional-analytic background plus one novel construction; no numerical free parameters appear because the contribution is a proof rather than a data-fitted model.

axioms (2)
  • domain assumption Approximation properties of Banach spaces hold in the required uniform sense on compact sets
    Listed as key feature (i) enabling UATs in full generality.
  • domain assumption k-times continuous differentiability in the sense of Bastiani is sufficient for the operator class under study
    Invoked as key feature (ii) and described as weaker than Fréchet differentiability.
invented entities (1)
  • weighted Sobolev spaces tailored for universal approximation no independent evidence
    purpose: Provide the norm in which uniform approximation of operators and derivatives is measured
    Introduced as key feature (iv) to achieve the stated UATs.

pith-pipeline@v0.9.0 · 5888 in / 1608 out tokens · 57814 ms · 2026-05-19T16:43:50.060686+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

132 extracted references · 132 canonical work pages · 4 internal anchors

  1. [1]

    Elsevier, 2003

    Robert A Adams and John JF Fournier.Sobolev spaces, volume 140. Elsevier, 2003

  2. [2]

    On the closability of differential operators.Journal of Functional Analysis, 289(7):111029, 2025

    Giovanni Alberti, David Bate, and Andrea Marchese. On the closability of differential operators.Journal of Functional Analysis, 289(7):111029, 2025

  3. [3]

    Springer, 2006

    Fernando Albiac and Nigel J Kalton.Topics in Banach space theory. Springer, 2006

  4. [4]

    Tucker Tensor Train Taylor Series.arXiv preprint arXiv:2603.21141, 2026

    Nick Alger, Blake Christierson, Peng Chen, and Omar Ghattas. Tucker Tensor Train Taylor Series.arXiv preprint arXiv:2603.21141, 2026

  5. [5]

    A PINN approach for the online identification and control of unknown PDEs.Journal of Optimization Theory and Applications, 206(1):8, 2025

    Alessandro Alla, Giulia Bertaglia, and Elisa Calzola. A PINN approach for the online identification and control of unknown PDEs.Journal of Optimization Theory and Applications, 206(1):8, 2025

  6. [6]

    Applications différentiables et variétés différentiables de dimension infinie.Journal d’Analyse mathé- matique, 13(1):1–114, 1964

    Andrée Bastiani. Applications différentiables et variétés différentiables de dimension infinie.Journal d’Analyse mathé- matique, 13(1):1–114, 1964

  7. [7]

    Springer Science & Business Media, 2013

    Yu M Berezansky and Yuri G Kondratiev.Spectral methods in infinite-dimensional analysis, volume 12. Springer Science & Business Media, 2013

  8. [8]

    Kovachki, and Andrew M

    Kaushik Bhattacharya, Bamdad Hosseini, Nikola B. Kovachki, and Andrew M. Stuart. Model reduction and neural networks for parametric PDEs.The SMAI Journal of Computational Mathematics, 7:121–157, 2021

  9. [9]

    European Mathematical Society, 2011

    Anders Björn and Jana Björn.Nonlinear potential theory on metric spaces, volume 17. European Mathematical Society, 2011

  10. [10]

    Absolutely continuous flows generated by Sobolev class vector fields in finite and infinite dimensions.Journal of functional analysis, 167(1):1–68, 1999

    Vladimir Bogachev and Eduardo Mayer-Wolf. Absolutely continuous flows generated by Sobolev class vector fields in finite and infinite dimensions.Journal of functional analysis, 167(1):1–68, 1999

  11. [11]

    Number 62

    Vladimir Igorevich Bogachev.Gaussian measures. Number 62. American Mathematical Soc., 1998

  12. [12]

    Number 164

    Vladimir Igorevich Bogachev.Differentiable measures and the Malliavin calculus. Number 164. American Mathematical Soc., 2010

  13. [13]

    Gabriele Bolli and Filippo de Feo. Optimal control of stochastic Volterra integral equations with completely monotone kernels and stochastic differential equations on Hilbert spaces with unbounded control and diffusion operators.arXiv preprint arXiv:2602.17578, 2026

  14. [14]

    Smooth functions on banach manifolds.Journal of Mathematics and Mechanics, 15(5):877–898, 1966

    Robert Bonic and John Frampton. Smooth functions on banach manifolds.Journal of Mathematics and Mechanics, 15(5):877–898, 1966

  15. [15]

    Lianghao Cao, Joshua Chen, Michael Brennan, Thomas O’Leary-Roseberry, Youssef Marzouk, and Omar Ghattas. Lazydino: Fast, scalable, and efficiently amortized bayesian inversion via structure-exploiting and surrogate-driven measure transport.Journal of Machine Learning Research, 27(24):1–71, 2026

  16. [16]

    Derivative-informed neural operator acceleration of geometric MCMC for infinite-dimensional Bayesian inverse problems.Journal of Machine Learning Research, 26(78):1– 68, 2025

    Lianghao Cao, Thomas O’Leary-Roseberry, and Omar Ghattas. Derivative-informed neural operator acceleration of geometric MCMC for infinite-dimensional Bayesian inverse problems.Journal of Machine Learning Research, 26(78):1– 68, 2025

  17. [17]

    Laplace neural operator for solving differential equations.Nature Machine Intelligence, 6:631–640, 2023

    Qianying Cao, Somdatta Goswami, and George Em Karniadakis. Laplace neural operator for solving differential equations.Nature Machine Intelligence, 6:631–640, 2023

  18. [18]

    Approximation properties.Handbook of the geometry of Banach spaces, pages 271–316, 2001

    Peter G Casazza. Approximation properties.Handbook of the geometry of Banach spaces, pages 271–316, 2001

  19. [19]

    Coefficient quantization for frames in banach spaces.Journal of mathematical analysis and applications, 348(1):66–86, 2008

    Peter G Casazza, Stephen J Dilworth, Edward Odell, Th Schlumprecht, and Andras Zsak. Coefficient quantization for frames in banach spaces.Journal of mathematical analysis and applications, 348(1):66–86, 2008

  20. [20]

    The kolmogorov infinite dimensional equation in a hilbert space via deep learning methods.Journal of Mathematical Analysis and Applications, 527(2):127413, 2023

    Javier Castro. The kolmogorov infinite dimensional equation in a hilbert space via deep learning methods.Journal of Mathematical Analysis and Applications, 527(2):127413, 2023

  21. [21]

    The Calderón’s problem via DeepONets.Vietnam Journal of Mathematics, 52(3):775–806, 2024

    Javier Castro, Claudio Muñoz, and Nicolás Valenzuela. The Calderón’s problem via DeepONets.Vietnam Journal of Mathematics, 52(3):775–806, 2024

  22. [22]

    Kwossek, and David J

    Mihriban Ceylan, Anna P. Kwossek, and David J. Prömel. Universal approximation with signatures of non-geometric rough paths.arXiv preprint arXiv:2602.05898, 2026

  23. [23]

    Tianping Chen and Hong Chen. Universal approximation to nonlinear operators by neural networks with arbitrary activation functions and its application to dynamical systems.IEEE transactions on neural networks, 6(4):911–917, 1995. 43

  24. [24]

    Weakandquasiapproximationpropertiesinbanachspaces.Journal of mathematical analysis and applications, 316(2):722–735, 2006

    ChangsunChoiandJuMyungKim. Weakandquasiapproximationpropertiesinbanachspaces.Journal of mathematical analysis and applications, 316(2):722–735, 2006

  25. [25]

    Cohen, Filippo de Feo, Jackson Hebner, and Justin Sirignano

    Samuel N. Cohen, Filippo de Feo, Jackson Hebner, and Justin Sirignano. Deep hilbert–galerkin methods for infinite- dimensional pdes and optimal control.arXiv preprint arXiv:2603.19463, 2026

  26. [26]

    Hamilton-Jacobi equations in infinite dimensions I

    Michael G Crandall and Pierre-Louis Lions. Hamilton-Jacobi equations in infinite dimensions I. Uniqueness of viscosity solutions.Journal of Functional Analysis, 62(3):379–396, 1985

  27. [27]

    Hamilton-Jacobi equations in infinite dimensions

    Michael G Crandall and Pierre-Louis Lions. Hamilton-Jacobi equations in infinite dimensions. II. Existence of viscosity solutions.Journal of Functional Analysis, 65(3):368–405, 1986

  28. [28]

    Hamilton-Jacobi equations in infinite dimensions, III.Journal of Functional Analysis, 68(2):214–247, 1986

    Michael G Crandall and Pierre-Louis Lions. Hamilton-Jacobi equations in infinite dimensions, III.Journal of Functional Analysis, 68(2):214–247, 1986

  29. [29]

    Viscosity solutions of Hamilton-Jacobi equations in infinite dimensions

    Michael G Crandall and Pierre-Louis Lions. Viscosity solutions of Hamilton-Jacobi equations in infinite dimensions. V. Unbounded linear terms and B-continuous solutions.Journal of Functional Analysis, 97(2):417–465, 1991

  30. [30]

    Viscosity solutions of Hamilton-Jacobi equations in infinite dimensions

    Michael G Crandall and Pierre-Louis Lions. Viscosity solutions of Hamilton-Jacobi equations in infinite dimensions. VII. The HJB equation is not always satisfied.Journal of Functional Analysis, 125(1):111–148, 1994

  31. [31]

    Universal approximation theorems for continuous functions of càdlàg paths and Lévy-type signature models.Finance and Stochastics, 29(2):289–342, 2025

    Christa Cuchiero, Francesca Primavera, and Sara Svaluto-Ferro. Universal approximation theorems for continuous functions of càdlàg paths and Lévy-type signature models.Finance and Stochastics, 29(2):289–342, 2025

  32. [32]

    Cambridge University Press, 2002

    Giuseppe Da Prato and Jerzy Zabczyk.Second order partial differential equations in Hilbert spaces, volume 293. Cambridge University Press, 2002

  33. [33]

    Cambridge university press, 2014

    Giuseppe Da Prato and Jerzy Zabczyk.Stochastic equations in infinite dimensions. Cambridge university press, 2014

  34. [34]

    PhD Thesis, Politecnico di Milano, 2023

    Filippo de Feo.SDEs on Hilbert Spaces: Slow-Fast Systems of SPDEs, Stochastic Optimal Control with Delays and Applications to Economics and Finance. PhD Thesis, Politecnico di Milano, 2023

  35. [35]

    Filippo de Feo, Salvatore Federico, and Andrzej Święch. Optimal control of stochastic delay differential equations and applications to path-dependent financial and economic models.SIAM Journal on Control and Optimization, 62(3):1490– 1520, 2024

  36. [36]

    Stochastic optimal control of interacting particle systems in Hilbert spaces and applications.arXiv preprint arXiv:2511.21646, 2025

    Filippo de Feo, Fausto Gozzi, Andrzej Święch, and Lukas Wessels. Stochastic optimal control of interacting particle systems in Hilbert spaces and applications.arXiv preprint arXiv:2511.21646, 2025

  37. [37]

    A counterexample to the approximation problem in banach spaces

    Per Enflo. A counterexample to the approximation problem in banach spaces. 1973

  38. [38]

    Springer, 2000

    Klaus-Jochen Engel and Rainer Nagel.One-parameter semigroups for linear evolution equations. Springer, 2000

  39. [39]

    Springer, 2017

    Giorgio Fabbri, Fausto Gozzi, and Andrzej Święch.Stochastic Optimal Control in Infinite Dimension: Dynamic Pro- gramming and HJB Equations. Springer, 2017

  40. [40]

    Springer, 2011

    Marián Fabian, Petr Habala, Petr Hájek, Vicente Montesinos, and Václav Zizler.Banach space theory: The basis for linear and nonlinear analysis, volume 1. Springer, 2011

  41. [41]

    V. S. Fanaskov and I. V. Oseledets. Spectral neural operators.Doklady Mathematics, 108:S226–S232, 2023

  42. [42]

    An infinite-dimensional approach to path-dependent Kolmogorov equations.The Annals of Probability, 44(4):2643 – 2693, 2016

    Franco Flandoli and Giovanni Zanco. An infinite-dimensional approach to path-dependent Kolmogorov equations.The Annals of Probability, 44(4):2643 – 2693, 2016

  43. [43]

    Fleming and M

    W.H. Fleming and M. Nisio. Differential games for stochastic partial differential equations.Nagoya Mathematical Journal, 131:75–107, 1993

  44. [44]

    T. M. Flett.Differential Analysis: Differentiation, Differential Equations and Differential Inequalities. Cambridge University Press, 1980

  45. [45]

    Folland.Real Analysis: Modern Techniques and Their Applications

    Gerald B. Folland.Real Analysis: Modern Techniques and Their Applications. Wiley, second edition, 1999

  46. [46]

    Density of subalgebras of lipschitz functions in metric sobolev spaces and applications to wasserstein sobolev spaces.Journal of Functional Analysis, 285(11):110153, 2023

    Massimo Fornasier, Giuseppe Savaré, and Giacomo Enrico Sodini. Density of subalgebras of lipschitz functions in metric sobolev spaces and applications to wasserstein sobolev spaces.Journal of Functional Analysis, 285(11):110153, 2023

  47. [47]

    Smooth bump functions and the geometry of banach spaces: a brief survey.Expositiones Mathematicae, 20(2):143–183, 2002

    Robb Fry and S McManus. Smooth bump functions and the geometry of banach spaces: a brief survey.Expositiones Mathematicae, 20(2):143–183, 2002

  48. [48]

    Marco Fuhrman, Federica Masiero, and Gianmario Tessitore. Stochastic equations with delay: Optimal control via BSDEs and regular solutions of Hamilton–Jacobi–Bellman equations.SIAM Journal on Control and Optimization, 48(7):4624–4651, 2010

  49. [49]

    Finite dimensional approximations of Hamilton–Jacobi–Bellman equations in spaces of probability measures.SIAM Journal on Mathematical Analysis, 53(2):1320–1356, 2021

    Wilfrid Gangbo, Sergio Mayorga, and Andrzej Święch. Finite dimensional approximations of Hamilton–Jacobi–Bellman equations in spaces of probability measures.SIAM Journal on Mathematical Analysis, 53(2):1320–1356, 2021

  50. [50]

    Springer, 2020

    Nicola Gigli, Enrico Pasqualetto, et al.Lectures on nonsmooth differential geometry, volume 2. Springer, 2020

  51. [51]

    Infinite-dimensional Lie groups without completeness restrictions.Banach Center Publications, 55:43– 59, 2002

    Helge Glöckner. Infinite-dimensional Lie groups without completeness restrictions.Banach Center Publications, 55:43– 59, 2002. 44

  52. [52]

    Jinwoo Go and Peng Chen. Accurate, scalable, and efficient Bayesian optimal experimental design with derivative- informed neural operators.Computer Methods in Applied Mechanics and Engineering, 438:117845, 2025

  53. [53]

    Sequential infinite-dimensional bayesian optimal experimental design with derivative- informed latent attention neural operator.Journal of Computational Physics, 532:113976, 2025

    Jinwoo Go and Peng Chen. Sequential infinite-dimensional bayesian optimal experimental design with derivative- informed latent attention neural operator.Journal of Computational Physics, 532:113976, 2025

  54. [54]

    New universal operator approximation theorem for encoder-decoder architectures (preprint).arXiv preprint arXiv:2503.24092, 2025

    Janek Gödeke and Pascal Fernsel. New universal operator approximation theorem for encoder-decoder architectures (preprint).arXiv preprint arXiv:2503.24092, 2025

  55. [55]

    Shape derivative- informed neural operators with application to risk-averse shape optimization.arXiv preprint arXiv:2603.03211, 2026

    Xindi Gong, Dingcheng Luo, Thomas O’Leary-Roseberry, Ruanui Nicholson, and Omar Ghattas. Shape derivative- informed neural operators with application to risk-averse shape optimization.arXiv preprint arXiv:2603.03211, 2026

  56. [56]

    The Volterra signature.arXiv preprint arXiv:2603.04525, 2026

    Paul P. Hager, Fabian N. Harang, Luca Pelizzari, and Samy Tindel. The Volterra signature.arXiv preprint arXiv:2603.04525, 2026

  57. [57]

    Number 27

    Juha Heinonen, Pekka Koskela, Nageswari Shanmugalingam, and Jeremy T Tyson.Sobolev spaces on metric measure spaces. Number 27. cambridge university press, 2015

  58. [58]

    Springer Science & Business Media, 2008

    Michael Hinze, René Pinnau, Michael Ulbrich, and Stefan Ulbrich.Optimization with PDE constraints. Springer Science & Business Media, 2008

  59. [59]

    Minimal width for universal property of deep RNN.Journal of Machine Learning Research, 24(121):1–41, 2023

    Chang hoon Song, Geonho Hwang, Jun ho Lee, and Myungjoo Kang. Minimal width for universal property of deep RNN.Journal of Machine Learning Research, 24(121):1–41, 2023

  60. [60]

    Statistical hydromechanics and functional calculus.Journal of Rational Mechanics and Analysis, 1:87–123, 1952

    Eberhard Hopf. Statistical hydromechanics and functional calculus.Journal of Rational Mechanics and Analysis, 1:87–123, 1952

  61. [61]

    Approximation capabilities of multilayer feedforward networks.Neural networks, 4(2):251–257, 1991

    Kurt Hornik. Approximation capabilities of multilayer feedforward networks.Neural networks, 4(2):251–257, 1991

  62. [62]

    Universal approximation of an unknown mapping and its derivatives using multilayer feedforward networks.Neural networks, 3(5):551–560, 1990

    Kurt Hornik, Maxwell Stinchcombe, and Halbert White. Universal approximation of an unknown mapping and its derivatives using multilayer feedforward networks.Neural networks, 3(5):551–560, 1990

  63. [63]

    Tackling the curse of dimensionality with physics-informed neural networks.Neural Networks, 176:106369, 2024

    Zheyuan Hu, Khemraj Shukla, George Em Karniadakis, and Kenji Kawaguchi. Tackling the curse of dimensionality with physics-informed neural networks.Neural Networks, 176:106369, 2024

  64. [64]

    Basis operator network: A neural network-based model for learning nonlinear operators via neural basis.Neural Networks, 164:21–37, 2023

    Ning Hua and Wenlian Lu. Basis operator network: A neural network-based model for learning nonlinear operators via neural basis.Neural Networks, 164:21–37, 2023

  65. [65]

    An operator learning perspective on parameter- to-observable maps.arXiv preprint arXiv:2402.06031, 2024

    Daniel Zhengyu Huang, Nicholas H Nelsen, and Margaret Trautner. An operator learning perspective on parameter- to-observable maps.arXiv preprint arXiv:2402.06031, 2024

  66. [66]

    Universal approximation property of fully convolutional neural networks with zero padding.Applied and Computational Harmonic Analysis, 82:101833, 2026

    Geonho Hwang and Myungjoo Kang. Universal approximation property of fully convolutional neural networks with zero padding.Applied and Computational Harmonic Analysis, 82:101833, 2026

  67. [67]

    Mionet: Learning multiple-input operators via tensor product.SIAM Journal on Scientific Computing, 44(6):A3490–A3514, 2022

    Pengzhan Jin, Shuai Meng, and Lu Lu. Mionet: Learning multiple-input operators via tensor product.SIAM Journal on Scientific Computing, 44(6):A3490–A3514, 2022

  68. [68]

    Spaces of compact operators.Mathematische Annalen, 208(4):267–278, 1974

    Nigel J Kalton. Spaces of compact operators.Mathematische Annalen, 208(4):267–278, 1974

  69. [69]

    Springer Berlin, 1995

    Tosio Kato.Perturbation Theory for Linear Operators. Springer Berlin, 1995

  70. [70]

    Springer, 2006

    Hans Heinrich Keller.Differential calculus in locally convex spaces. Springer, 2006

  71. [71]

    Optimal control of partial differential equations in PyTorch using automatic differentiation and neural network surrogates.arXiv.OC, 2024

    Denis Khimin, Julian Roth, Alexander Henkes, and Thomas Wick. Optimal control of partial differential equations in PyTorch using automatic differentiation and neural network surrogates.arXiv.OC, 2024

  72. [72]

    Universal approximation with deep narrow networks

    Patrick Kidger and Terry Lyons. Universal approximation with deep narrow networks. InConference on Learning Theory, pages 2306–2327. PMLR, 2020

  73. [73]

    On universal approximation and error bounds for Fourier neural operators.Journal of Machine Learning Research, 22(290):1–76, 2021

    Nikola Kovachki, Samuel Lanthaler, and Siddhartha Mishra. On universal approximation and error bounds for Fourier neural operators.Journal of Machine Learning Research, 22(290):1–76, 2021

  74. [74]

    Neural operator: Learning maps between function spaces with applications to PDEs.Journal of Machine Learning Research, 2023

    Nikola Kovachki, Zongyi Li, Burigede Liu, Kamyar Azizzadenesheli, Kaushik Bhattacharya, Andrew Stuart, and Anima Anandkumar. Neural operator: Learning maps between function spaces with applications to PDEs.Journal of Machine Learning Research, 2023

  75. [75]

    Operator learning: Algorithms and analysis.Handbook of Numerical Analysis, 25:419–467, 2024

    Nikola B Kovachki, Samuel Lanthaler, and Andrew M Stuart. Operator learning: Algorithms and analysis.Handbook of Numerical Analysis, 25:419–467, 2024

  76. [76]

    How to define reasonably weighted sobolev spaces.Commentationes Mathematicae Universitatis Carolinae, 25(3):537–554, 1984

    Alois Kufner and Bohumír Opic. How to define reasonably weighted sobolev spaces.Commentationes Mathematicae Universitatis Carolinae, 25(3):537–554, 1984

  77. [77]

    Error estimates for DeepONets: a deep learning framework in infinite dimensions.Transactions of Mathematics and Its Application, 6(1):1–141, 2022

    Samuel Lanthaler, Siddhartha Mishra, and George E Karniadakis. Error estimates for DeepONets: a deep learning framework in infinite dimensions.Transactions of Mathematics and Its Application, 6(1):1–141, 2022

  78. [78]

    Learning in mean field games: A survey.arXiv preprint arXiv:2205.12944, 2022

    Mathieu Laurière, Sarah Perrin, Julien Pérolat, Sertan Girgin, Paul Muller, Romuald Élie, Matthieu Geist, and Olivier Pietquin. Learning in mean field games: A survey.arXiv preprint arXiv:2205.12944, 2022

  79. [79]

    Springer Science & Business Media, 2012

    Xungjing Li and Jiongmin Yong.Optimal Control Theory for Infinite Dimensional Systems. Springer Science & Business Media, 2012. 45

  80. [80]

    Classics in mathematics

    Joram Lindenstrauss.Classical Banach spaces I and II. Classics in mathematics. Springer-Verlag, Berlin ; London, 1996

Showing first 80 references.