Can We Formally Verify Neural PDE Surrogates? SMT Compilation of Small Fourier Neural Operators
Pith reviewed 2026-05-12 01:46 UTC · model grok-4.3
The pith
Once trained with fixed weights and grid, small Fourier Neural Operators become piecewise-linear functions that SMT solvers can verify exactly for physical properties.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
With trained weights and grid fixed, the spectral convolution in an FNO is a linear map. As a result, the full forward pass is piecewise-linear and can be represented exactly in Z3's linear real arithmetic. The exact encoding compiles the spectral path into dense matrix multiplication, which is sound for both proving and disproving queries such as positivity and mass non-increase.
What carries the argument
The exact SMT encoding that converts the FNO spectral convolution into a dense matrix multiplication inside linear real arithmetic, enabling sound verification of the full piecewise-linear forward pass.
If this is right
- Sound positivity proofs are obtained for all ReLU-free models.
- Sound counterexamples to positivity are produced on five of the ten models.
- Sound counterexamples to mass non-increase are produced on all ten models.
- The frozen encoding scales to grid size 64 with sub-second checks but loses soundness for the original network.
- ReLU activations cause three positivity queries to time out under the exact encoding.
Where Pith is reading between the lines
- Restricting neural operators to piecewise-linear activations from the design stage could make larger instances directly amenable to the same SMT compilation.
- A hybrid workflow that first uses SMT on the linear core and then gradient or Monte Carlo search on the remaining nonlinear pieces could cover models that currently time out.
- The same linear-map reduction might apply to other spectral neural operators beyond FNOs if their Fourier layers are likewise frozen after training.
Load-bearing premise
The studied FNO models stay very small, with 85-117 parameters and grids of size 8-32, so that the SMT solver finishes the queries without timing out.
What would settle it
An FNO instance on which the exact encoding returns unsat for a mass-violation query, yet direct evaluation of the original network on the same input point shows mass increase.
Figures
read the original abstract
Fourier Neural Operators (FNOs) can greatly accelerate PDE simulation, but they are often used without formal guarantees that they preserve basic physical structure. We show that, once the trained weights and grid are fixed, the spectral convolution in an FNO is a linear map. As a result, the full forward pass is piecewise-linear and can be represented exactly in Z3's linear real arithmetic. We study two encodings. The exact encoding compiles the spectral convolution into a dense matrix multiplication, which is sound for both proofs and counterexamples. The lighter frozen encoding replaces the spectral path with a constant, making it faster but approximate. On 10 small FNO surrogates for 1D advection-diffusion-reaction (85 to 117 parameters, grids 8 to 32), the exact encoding gives 2 sound positivity proofs on linear (ReLU-free) models, 5 sound positivity counterexamples, and 10 sound mass-violation counterexamples; the remaining 3 positivity queries on ReLU models time out. For mass non-increase, Z3 finds worse counterexamples than both gradient-based falsification and Monte Carlo on 7 of 10 models. The frozen encoding scales to grid size 64 with sub-second positivity checks, but it no longer provides certificates for the original FNO. Overall, the results make the soundness--scalability tradeoff explicit and point to what is needed for formal verification of production-scale neural operators.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript claims that Fourier Neural Operators (FNOs), once trained weights and discretization grid are fixed, have a linear spectral convolution step, making the full model piecewise linear and exactly encodable in Z3's linear real arithmetic for SMT-based formal verification. Two encodings are studied: an exact one compiling to matrix multiplications (sound for proofs and counterexamples) and a frozen approximate one for speed. Experiments on 10 small 1D FNO surrogates (85-117 params, grids 8-32) for advection-diffusion-reaction PDEs show sound positivity proofs for 2 linear models, counterexamples for 5, timeouts for 3 ReLU models, and mass-violation counterexamples for all 10; the frozen version scales to larger grids but is approximate.
Significance. If the central encoding holds, this provides a novel approach to formally verifying properties like positivity and conservation in neural PDE surrogates using off-the-shelf SMT solvers. The work is significant for its explicit demonstration of soundness on the exact encoding and for highlighting practical limitations such as timeouts. It credits the use of standard linear real arithmetic without additional assumptions or fitted parameters, and the production of machine-checked proofs and counterexamples.
major comments (2)
- [Experimental results] Experimental results section: the 3 timeouts on positivity queries for ReLU models (out of 10 total) limit the exact encoding's coverage to linear cases only, which is load-bearing for the claim that FNOs can be formally verified in general.
- [Frozen encoding description] Frozen encoding description: replacing the spectral path with a constant makes the encoding approximate and unable to certify the original FNO; without quantified error bounds on this approximation, the scalability results to grid size 64 cannot be directly compared to the exact method.
minor comments (2)
- The results across the 10 models are reported only as aggregate counts without statistical analysis, variance, or per-model breakdowns, which would strengthen the reproducibility of the findings.
- Notation for the compilation of spectral convolution to dense matrix multiplication could be clarified with an explicit equation or pseudocode in the methods section.
Simulated Author's Rebuttal
We thank the referee for the careful reading and constructive feedback. We address the two major comments below, agreeing with the identified limitations while clarifying the manuscript's scope and proposing targeted revisions for improved precision.
read point-by-point responses
-
Referee: [Experimental results] Experimental results section: the 3 timeouts on positivity queries for ReLU models (out of 10 total) limit the exact encoding's coverage to linear cases only, which is load-bearing for the claim that FNOs can be formally verified in general.
Authors: We agree that the three timeouts for ReLU models demonstrate that the exact encoding, as implemented, is currently practical only for linear FNOs. However, the manuscript does not advance a claim that FNOs can be formally verified in general. The title poses an open question, the abstract and conclusion explicitly report the timeouts, and the final paragraph frames the contribution around the observed soundness-scalability tradeoff. The sound proofs and counterexamples obtained for the linear models, together with the mass-violation results for all ten models, still establish feasibility for small fixed instances. We will revise the experimental results section to state more explicitly that the exact encoding is presently limited to linear activations and to flag efficient handling of ReLU (or other nonlinearities) as future work. revision: partial
-
Referee: [Frozen encoding description] Frozen encoding description: replacing the spectral path with a constant makes the encoding approximate and unable to certify the original FNO; without quantified error bounds on this approximation, the scalability results to grid size 64 cannot be directly compared to the exact method.
Authors: We agree that the frozen encoding is approximate and does not certify the original FNO, a point already noted in the manuscript. The grid-size-64 results are presented to illustrate the computational benefit of the lighter encoding rather than to claim equivalence with the exact method. We acknowledge that the absence of quantified error bounds prevents a rigorous comparison. Deriving such bounds is non-trivial and outside the current scope; we will therefore add a clarifying sentence in the frozen-encoding subsection stating that the reported timings are for an approximation and that error analysis remains future work. revision: yes
Circularity Check
No significant circularity identified
full rationale
The paper's derivation rests on the direct mathematical observation that fixing the trained weights and discretization grid turns each FNO spectral convolution into an ordinary matrix multiplication (FFT → mode-wise multiply → IFFT), rendering the full forward pass piecewise-linear and therefore exactly representable in Z3's linear real arithmetic. This reduction follows from the standard definition of the FNO architecture and linear algebra; it is not obtained by fitting parameters to data, by renaming an empirical pattern, or by any self-citation chain. The authors explicitly document the encoding's limitations (timeouts on ReLU models, approximation in the frozen variant) and evaluate soundness only on the small instances they can solve, leaving the central claim self-contained against external benchmarks.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Spectral convolution with fixed trained weights and grid is a linear map
Lean theorems connected to this paper
-
IndisputableMonolith/Foundation/RealityFromDistinction.leanreality_from_one_distinction unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
once the trained weights and grid are fixed, the spectral convolution in an FNO is a linear map... the full forward pass is piecewise-linear and can be represented exactly in Z3's linear real arithmetic
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
We build W_spec_ℓ by applying the spectral convolution to each standard basis vector
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
International Conference on Learning Representations , year=
Fourier Neural Operator for Parametric Partial Differential Equations , author=. International Conference on Learning Representations , year=
-
[2]
Learning nonlinear operators via
Lu, Lu and Jin, Pengzhan and Pang, Guofei and Zhang, Zhongqiang and Karniadakis, George Em , journal=. Learning nonlinear operators via
-
[3]
Kochenderfer and Ali Baheri , title =
Zahra Shahrooei and Mykel J. Kochenderfer and Ali Baheri , title =. European Control Conference (. 2023 , doi =
work page 2023
- [4]
-
[5]
Ali Baheri and Hao Ren and Benjamin Johnson and Pouria Razzaghi and Peng Wei , title =. 2022 , eprint =. doi:10.48550/arXiv.2205.04590 , url =
-
[6]
Journal of Computational Physics , volume=
Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations , author=. Journal of Computational Physics , volume=
-
[7]
Katz, Guy and Barrett, Clark and Dill, David L and Julian, Kyle and Kochenderfer, Mykel J , booktitle=. Reluplex: An efficient
-
[8]
Wang, Shiqi and Zhang, Huan and Xu, Kaidi and Lin, Xue and Jana, Suman and Hsieh, Cho-Jui and Kolter, J Zico , booktitle=. Beta-
-
[9]
de Moura, Leonardo and Bj. International Conference on Tools and Algorithms for the Construction and Analysis of Systems , pages=
-
[10]
Liu, Jun and Meng, Yiming and Fitzsimmons, Maxwell and Zhou, Ruikun , booktitle=
-
[11]
Gorard, Jonathan and Hakim, Ammar and Juno, James , journal=
-
[12]
Journal of Computational Physics , pages=
Enforcing exact physics in scientific machine learning: a data-driven exterior calculus on graphs , author=. Journal of Computational Physics , pages=
-
[13]
On the effectiveness of interval bound propagation for training verifiably robust models
On the effectiveness of interval bound propagation for training verifiably robust models , author=. arXiv preprint arXiv:1810.12715 , year=
-
[14]
Neural operator: Learning maps between function spaces with applications to
Kovachki, Nikola and Li, Zongyi and Liu, Burigede and Azizzadenesheli, Kamyar and Bhattacharya, Kaushik and Stuart, Andrew and Anandkumar, Anima , journal=. Neural operator: Learning maps between function spaces with applications to
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.