pith. sign in

arxiv: 2606.20972 · v1 · pith:EM3OGYHUnew · submitted 2026-06-18 · 🧮 math.NA · cs.NA

Neural network approximation in discrete dual norms with adaptive test spaces

Pith reviewed 2026-06-26 15:54 UTC · model grok-4.3

classification 🧮 math.NA cs.NA
keywords RVPINNsadaptive test spacesRiesz representativerefinement indicatorsaturation assumptionvariational residualerror estimationelliptic problems
0
0 comments X

The pith

A computable refinement indicator under the saturation assumption reliably estimates the discrepancy between discrete and continuous Riesz representatives in RVPINNs.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper develops an adaptive strategy for enriching the discrete test space in RVPINNs to better capture the continuous Riesz representative of the variational residual. This is motivated by the fact that fixed coarse test spaces can fail to resolve localized singularities, steep gradients, or interface layers during training. Theoretical adaptive strategies are established along with their error bounds. A computable refinement indicator is proposed and shown to be a reliable and efficient estimator for the non-computable discrepancy under the saturation assumption. A practical algorithm is presented and tested on elliptic Dirichlet problems.

Core claim

The central claim is that within the RVPINN framework, an adaptive algorithm enriches the test space only where the error between the discrete and continuous Riesz representatives is pronounced, supported by theoretical error bounds and a computable refinement indicator that under the saturation assumption reliably and efficiently estimates the discrepancy between those representatives.

What carries the argument

The refinement indicator for the discrepancy between discrete and continuous Riesz representatives of the variational residual, which serves as an error estimator under the saturation assumption.

If this is right

  • The remainder term in the loss function robustness guarantee can be controlled through targeted test space enrichment.
  • Fixed coarse test spaces can be avoided in favor of adaptive ones for efficiency in problems with localized features.
  • Theoretical error bounds are derived for the adaptive strategies in the RVPINN setting.
  • Numerical experiments confirm the practical adaptive algorithm's effectiveness on elliptic Dirichlet problems.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • If the saturation assumption holds across a broader class of problems, the method could reduce computational costs in training neural networks for variational problems with sharp features.
  • The approach might be combined with other residual-based adaptation techniques in finite element methods.
  • Verification of the saturation assumption in specific applications would strengthen the practical utility of the estimator.

Load-bearing premise

The saturation assumption is required to prove that the refinement indicator is reliable and efficient, but its validity is not established for general cases.

What would settle it

A numerical test case where the saturation assumption does not hold and the refinement indicator fails to bound or efficiently estimate the actual discrepancy between the discrete and continuous Riesz representatives.

Figures

Figures reproduced from arXiv: 2606.20972 by Ignacio Brevis, Kristoffer G. van der Zee, Sergio Rojas, Tanakorn Udomworarat.

Figure 1
Figure 1. Figure 1: Test-space meshes for the smooth solution. [PITH_FULL_IMAGE:figures/full_fig_p018_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Results for the smooth solution. approximation for this example is approximately 3, exhibiting a faster empirical de￾cay than the reference piecewise-linear FEM solution rate of 0.5 (cf. [10]) when both errors are plotted against the test-space dimension. This reflects the high expressivity of the fixed neural network architecture, although this comparison does not account for the overall computational cos… view at source ↗
Figure 3
Figure 3. Figure 3: Test-space mesh and pointwise absolute error for the kink solution. [PITH_FULL_IMAGE:figures/full_fig_p020_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: Results for the kink solution. coordinates as: (6.9) u(x, y) = p3 x 2 + y 2 sin  2 3 (π − atan2(y, x)) , where atan2 denotes the four-quadrant inverse tangent. Note that the solution ex￾hibits a corner singularity at the origin (0, 0), where the gradient becomes unbounded as (x, y) → (0, 0), and belongs to the fractional Sobolev space H5/3−ϵ (Ω) for every ϵ > 0 [25]. The composite residual formulation of… view at source ↗
Figure 5
Figure 5. Figure 5: Test-space meshes and pointwise absolute error for the singular solution. [PITH_FULL_IMAGE:figures/full_fig_p022_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: Results for the singular solution [PITH_FULL_IMAGE:figures/full_fig_p023_6.png] view at source ↗
read the original abstract

In robust variational physics-informed neural networks (RVPINNs), the loss function is formulated in terms of the Riesz representative of the variational residual within a discrete test space. This approach guarantees that the loss function is robust with respect to the true error in the energy norm up to a remainder term that depends on both the neural network approximation and the discrete space configuration. However, in problems with localized singularities, steep gradients, or interface layers, a fixed coarse test space may fail to resolve the continuous Riesz representative of the residual during training. Although this can be avoided by using a sufficiently fine test space from the start, doing so may be computationally inefficient. We therefore propose an adaptive algorithm that enriches the test space only where the error between the discrete and continuous Riesz representatives is pronounced. We establish theoretical adaptive strategies within the RVPINN framework and derive their error bounds. Furthermore, we propose a computable refinement indicator and prove that, under the saturation assumption, it serves as a reliable and efficient error estimator for the non-computable discrepancy between the discrete and continuous Riesz representatives. Finally, we propose a practical adaptive algorithm and demonstrate its effectiveness through numerical experiments on elliptic Dirichlet problems.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

1 major / 2 minor

Summary. The paper proposes an adaptive enrichment strategy for the discrete test space in robust variational physics-informed neural networks (RVPINNs) to better approximate the continuous Riesz representative of the variational residual. It derives theoretical adaptive strategies and associated error bounds, introduces a computable refinement indicator, and proves that this indicator is reliable and efficient for estimating the non-computable discrepancy between discrete and continuous Riesz representatives under a saturation assumption. The approach is demonstrated via numerical experiments on elliptic Dirichlet problems with localized features.

Significance. If the saturation assumption is satisfied throughout the adaptive process, the work provides a theoretically grounded and practically useful mechanism for dynamically refining test spaces in RVPINNs, improving efficiency for problems with singularities or layers without requiring an a priori fine discretization. The combination of error bounds, a computable indicator, and numerical validation on model problems strengthens the contribution to adaptive PINN methodologies.

major comments (1)
  1. [Abstract / proof of refinement indicator] Abstract and the section presenting the proof of the refinement indicator: the reliability and efficiency of the proposed computable indicator are established only conditionally on the saturation assumption, yet no derivation, sufficient conditions, or numerical verification is supplied showing that the assumption holds (or can be ensured) for the specific adaptive enrichment algorithm when the enriched discrete space is still approximating the continuous Riesz representative.
minor comments (2)
  1. Notation for the discrete and continuous Riesz representatives should be introduced with explicit definitions before the refinement indicator is defined, to improve readability of the error bounds.
  2. The numerical experiments section would benefit from a table comparing iteration counts or degrees of freedom against a fixed fine test-space baseline to quantify the efficiency gain.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for the detailed review and constructive feedback. We address the single major comment below and are prepared to revise the manuscript accordingly.

read point-by-point responses
  1. Referee: [Abstract / proof of refinement indicator] Abstract and the section presenting the proof of the refinement indicator: the reliability and efficiency of the proposed computable indicator are established only conditionally on the saturation assumption, yet no derivation, sufficient conditions, or numerical verification is supplied showing that the assumption holds (or can be ensured) for the specific adaptive enrichment algorithm when the enriched discrete space is still approximating the continuous Riesz representative.

    Authors: The manuscript states explicitly that the reliability and efficiency of the computable refinement indicator hold under the saturation assumption (see abstract and the relevant theorem). This is a standard hypothesis in the a posteriori error estimation and adaptive finite-element literature, used to guarantee that the discrete test space remains sufficiently rich relative to the continuous Riesz representative. Deriving general sufficient conditions that guarantee the assumption throughout the specific adaptive enrichment procedure lies outside the scope of the present contribution, which focuses on establishing the indicator properties once the assumption is granted. Nevertheless, the numerical experiments on elliptic problems with localized singularities (Section 5) show that the adaptive algorithm converges reliably and that the discrete-continuous discrepancy decreases, providing indirect evidence that the assumption is satisfied in the tested regimes. We agree that a short additional discussion of the saturation assumption, together with a simple numerical check of the discrepancy evolution during adaptation, would strengthen the presentation and will incorporate such material in the revised version. revision: yes

Circularity Check

0 steps flagged

No circularity; claims explicitly conditional on saturation assumption with independent derivation

full rationale

The paper proposes an adaptive enrichment algorithm for test spaces in RVPINNs and derives error bounds for a computable refinement indicator. The key statement is that the indicator 'serves as a reliable and efficient error estimator ... under the saturation assumption' for the discrepancy between discrete and continuous Riesz representatives. This is an explicit premise rather than a derived or fitted quantity; no equation reduces to itself by definition, no parameter is fitted to data and renamed a prediction, and no load-bearing step relies on a self-citation chain or imported uniqueness theorem. The derivation chain remains self-contained once the assumption is granted, with no renaming of known results or ansatz smuggling visible in the text.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The central claims rest on standard variational theory for elliptic problems plus the saturation assumption; no free parameters or invented entities are mentioned in the abstract.

axioms (1)
  • domain assumption Saturation assumption
    Invoked to prove that the refinement indicator reliably estimates the discrete-continuous Riesz discrepancy.

pith-pipeline@v0.9.1-grok · 5752 in / 1307 out tokens · 25037 ms · 2026-06-26T15:54:35.569641+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

52 extracted references · 2 canonical work pages

  1. [1]

    doi:10.1137/140951758 , journal =

    An Adaptive Shifted Power Method for Computing Generalized Tensor Eigenpairs , author =. doi:10.1137/140951758 , journal =

  2. [2]

    Nick Higham , title =

  3. [3]

    Kolda and Ali Pinar , eprint =

    Chengbin Peng and Tamara G. Kolda and Ali Pinar , eprint =. Accelerating Community Detection by Using

  4. [4]

    and Zhang, Shanrong and Merritt, Matthew E

    Woessner, Donald E. and Zhang, Shanrong and Merritt, Matthew E. and Sherry, A. Dean , title =. Magnetic Resonance in Medicine , doi =

  5. [5]

    2003 , eid =

    Properties of Highly Clustered Networks , author =. 2003 , eid =. doi:10.1103/PhysRevE.68.026121 , journal =

  6. [6]

    Clawpack Software , author =

  7. [7]

    : A Document Preparation System

    Leslie Lamport. : A Document Preparation System. 1986

  8. [8]

    Frank Mittlebach and Michel Goossens , title =

  9. [9]

    and Van Loan, Charles F

    Golub, Gene H. and Van Loan, Charles F. , title =

  10. [10]

    Paul Dawkins , title =

  11. [11]

    User's Guide for the

  12. [12]

    Michael Downes , title =

  13. [13]

    Christian Feuers\"anger , title =

  14. [14]

    Computer Methods in Applied Mechanics and Engineering , volume=

    Multi-level neural networks for accurate solutions of boundary-value problems , author=. Computer Methods in Applied Mechanics and Engineering , volume=. 2024 , publisher=

  15. [15]

    Computer Methods in Applied Mechanics and Engineering , volume=

    Adaptive finite element interpolated neural networks , author=. Computer Methods in Applied Mechanics and Engineering , volume=. 2025 , publisher=

  16. [16]

    Journal of Scientific Computing , volume=

    Variational physics informed neural networks: the role of quadratures and test functions , author=. Journal of Scientific Computing , volume=. 2022 , publisher=

  17. [17]

    2024 , publisher=

    Berrone, Stefano and Pintore, Moreno , journal=. 2024 , publisher=

  18. [18]

    2022 , publisher=

    Brevis, Ignacio and Muga, Ignacio and van der Zee, Kristoffer G , journal=. 2022 , publisher=

  19. [19]

    Annual review of fluid mechanics , volume=

    Machine learning for fluid mechanics , author=. Annual review of fluid mechanics , volume=. 2020 , publisher=

  20. [20]

    2021 , publisher=

    Cai, Shengze and Mao, Zhiping and Wang, Zhicheng and Yin, Minglang and Karniadakis, George Em , journal=. 2021 , publisher=

  21. [21]

    SIAM Journal on Numerical Analysis , volume=

    Quasi-optimal convergence rate for an adaptive finite element method , author=. SIAM Journal on Numerical Analysis , volume=. 2008 , publisher=

  22. [22]

    Optics express , volume=

    Physics-informed neural networks for inverse problems in nano-optics and metamaterials , author=. Optics express , volume=. 2020 , publisher=

  23. [23]

    2012 , publisher=

    Cohen, Albert and DeVore, Ronald and Nochetto, Ricardo H , journal=. 2012 , publisher=

  24. [24]

    Demkowicz, Leszek , journal=

  25. [25]

    The double adaptivity paradigm:(How to circumvent the discrete inf--sup conditions of Babu

    Demkowicz, Leszek and F. The double adaptivity paradigm:(How to circumvent the discrete inf--sup conditions of Babu. Computers & Mathematics with Applications , volume=. 2021 , publisher=

  26. [26]

    SIAM Journal on Numerical Analysis , volume=

    D. SIAM Journal on Numerical Analysis , volume=. 1996 , publisher=

  27. [27]

    arXiv preprint arXiv:2507.06017 , year=

    A posteriori analysis of neural network approximations , author=. arXiv preprint arXiv:2507.06017 , year=

  28. [28]

    2023 , publisher=

    Gao, Zhiwei and Yan, Liang and Zhou, Tao , journal=. 2023 , publisher=

  29. [29]

    2024 , publisher=

    Gao, Zhiwei and Tang, Tao and Yan, Liang and Zhou, Tao , journal=. 2024 , publisher=

  30. [30]

    2026 , issn =

    Computer Methods in Applied Mechanics and Engineering , volume =. 2026 , issn =

  31. [31]

    2021 , publisher=

    Kharazmi, Ehsan and Zhang, Zhongqiang and Karniadakis, George Em , journal=. 2021 , publisher=

  32. [32]

    arXiv preprint arXiv:1912.00873 , year=

    Variational physics-informed neural networks for solving partial differential equations , author=. arXiv preprint arXiv:1912.00873 , year=

  33. [33]

    Kingma, Diederik P and Ba, Jimmy , journal=

  34. [34]

    Nature Reviews Genetics , volume=

    Machine learning applications in genetics and genomics , author=. Nature Reviews Genetics , volume=. 2015 , publisher=

  35. [35]

    2020 , publisher=

    Mao, Zhiping and Jagtap, Ameya D and Karniadakis, George Em , journal=. 2020 , publisher=

  36. [36]

    SIAM Journal on Numerical Analysis , volume=

    An adaptive superconvergent mixed finite element method based on local residual minimization , author=. SIAM Journal on Numerical Analysis , volume=. 2023 , publisher=

  37. [37]

    Foundations of computational mathematics , volume=

    Topological properties of the set of functions generated by neural networks of fixed size , author=. Foundations of computational mathematics , volume=. 2021 , publisher=

  38. [38]

    Journal of Computational physics , volume=

    Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations , author=. Journal of Computational physics , volume=. 2019 , publisher=

  39. [39]

    2022 , publisher=

    Rasht-Behesht, Majid and Huber, Christian and Shukla, Khemraj and Karniadakis, George Em , journal=. 2022 , publisher=

  40. [40]

    Computer Methods in Applied Mechanics and Engineering , volume=

    Robust variational physics-informed neural networks , author=. Computer Methods in Applied Mechanics and Engineering , volume=. 2024 , publisher=

  41. [41]

    2018 , publisher=

    Sirignano, Justin and Spiliopoulos, Konstantinos , journal=. 2018 , publisher=

  42. [42]

    2023 , publisher=

    Tang, Kejun and Wan, Xiaoliang and Yang, Chao , journal=. 2023 , publisher=

  43. [43]

    Neural network methods for Neumann series problems of

    Udomworarat, Tanakorn and Brevis, Ignacio and Richter, Martin and Rojas, Sergio and Van Der Zee, Kristoffer G , journal=. Neural network methods for Neumann series problems of. 2026 , publisher=

  44. [44]

    Journal of Computational Physics , volume=

    A mesh-free method for interface problems using the deep learning approach , author=. Journal of Computational Physics , volume=. 2020 , publisher=

  45. [45]

    2018 , publisher=

    Yu, Bing and E, Weinan , journal=. 2018 , publisher=

  46. [46]

    2025 , publisher=

    Zhang, Zhengqi and Li, Jing and Liu, Bin , journal=. 2025 , publisher=

  47. [47]

    Ainsworth, Mark and Oden, J Tinsley , year=

  48. [48]

    2013 , publisher=

    Mixed finite element methods and applications , author=. 2013 , publisher=

  49. [49]

    2011 , publisher=

    Brezis, Haim and Br. 2011 , publisher=

  50. [50]

    2012 , publisher=

    Di Pietro, Daniele Antonio and Ern, Alexandre , volume=. 2012 , publisher=

  51. [51]

    2010 , publisher=

    Partial differential equations , author=. 2010 , publisher=

  52. [52]

    Mathematical Aspects of Deep Learning , publisher=

    Gühring, Ingo and Raslan, Mones and Kutyniok, Gitta , editor=. Mathematical Aspects of Deep Learning , publisher=. 2022 , pages=