One Operator to Rule Them All? On Boundary-Indexed Operator Families in Neural PDE Solvers

Lennon J. Shikhman

arxiv: 2603.01406 · v2 · pith:T6KNUNVSnew · submitted 2026-03-02 · 💻 cs.LG · cs.AI· cs.NA· math.NA

One Operator to Rule Them All? On Boundary-Indexed Operator Families in Neural PDE Solvers

Lennon J. Shikhman This is my paper

Pith reviewed 2026-05-21 11:18 UTC · model grok-4.3

classification 💻 cs.LG cs.AIcs.NAmath.NA

keywords neural operatorsboundary conditionsPDE solversoperator learninggeneralizationPoisson equationconditional risk minimizationidentifiability

0 comments

The pith

Neural PDE solvers learn a family of operators indexed by training boundary conditions rather than one boundary-agnostic operator.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper shows that when boundary conditions vary, the standard interpretation of neural PDE solvers as learning a single solution operator is incorrect. Training instead produces a boundary-indexed family of operators whose behavior is conditioned on the distribution of boundaries seen during optimization. This view follows from casting operator learning as conditional risk minimization over boundary conditions, which immediately implies that the learned mapping cannot be identified outside the training support. Experiments on the Poisson equation confirm the prediction: performance degrades sharply under boundary shifts, cross-boundary ensembles fail to transfer, and removing boundary cues collapses outputs to conditional expectations. The result means that success on varying forcings or resolutions gives no guarantee of success on new boundaries.

Core claim

We show that standard neural operator training implicitly learns a boundary-indexed family of operators, rather than a single boundary-agnostic operator, with the learned mapping fundamentally conditioned on the boundary-condition distribution seen during training. Framing operator learning as conditional risk minimization over boundary conditions leads to a non-identifiability result outside the support of the training boundary distribution. As a consequence, generalization in forcing terms or resolution does not imply generalization across boundary conditions.

What carries the argument

Boundary-indexed family of operators arising from conditional risk minimization over the training boundary distribution.

If this is right

Generalization to new forcing terms or grid resolutions does not extend to unseen boundary conditions.
Models trained on one boundary ensemble exhibit large errors when evaluated on a different boundary ensemble.
Omitting boundary information during inference causes the network to converge to the conditional expectation of the solution given the training boundary distribution.
Sharp performance degradation appears under controlled boundary-condition shifts on the Poisson equation.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

Explicit boundary encoding or conditioning mechanisms will likely be required before neural operators can serve as reliable foundation models for PDEs with variable boundaries.
Evaluation protocols for operator networks should treat boundary-condition shifts as a first-class generalization test alongside resolution and forcing changes.
The non-identifiability result suggests that broader sampling of boundary distributions during training can reduce but not eliminate gaps outside the observed support.

Load-bearing premise

Operator learning is correctly described as minimizing risk conditional on the boundary-condition distribution present in the training data.

What would settle it

A neural operator trained only on one ensemble of boundary conditions that nevertheless produces accurate solutions on an entirely disjoint ensemble of boundary conditions without receiving any explicit boundary input or retraining.

read the original abstract

Neural PDE solvers are often described as learning solution operators that map problem data to PDE solutions. In this work, we argue that this interpretation is generally incorrect when boundary conditions vary. We show that standard neural operator training implicitly learns a boundary-indexed family of operators, rather than a single boundary-agnostic operator, with the learned mapping fundamentally conditioned on the boundary-condition distribution seen during training. We formalize this perspective by framing operator learning as conditional risk minimization over boundary conditions, which leads to a non-identifiability result outside the support of the training boundary distribution. As a consequence, generalization in forcing terms or resolution does not imply generalization across boundary conditions. We support our theoretical analysis with controlled experiments on the Poisson equation, demonstrating sharp degradation under boundary-condition shifts, cross-distribution failures between distinct boundary ensembles, and convergence to conditional expectations when boundary information is removed. Our results clarify a core limitation of current neural PDE solvers and highlight the need for explicit boundary-aware modeling in the pursuit of foundation models for PDEs.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

1 major / 2 minor

Summary. The paper claims that standard neural operator training for PDEs implicitly learns a boundary-indexed family of operators conditioned on the training boundary-condition distribution, rather than a single boundary-agnostic operator. This is formalized by recasting operator learning as conditional risk minimization over a joint distribution including boundary conditions, yielding a non-identifiability result outside the support of the training boundary distribution. Consequently, generalization in forcing terms or resolution does not imply generalization across boundary conditions. The argument is supported by controlled Poisson-equation experiments showing degradation under BC shifts, cross-ensemble failures, and convergence to conditional expectations when boundary information is withheld.

Significance. If the central claim holds, the work identifies a fundamental limitation in current neural PDE solvers that has direct consequences for building generalizable or foundation models for PDEs. The conditional-risk framing provides a clean theoretical lens, and the Poisson experiments are consistent with the predicted non-identifiability. The result encourages explicit boundary-aware modeling rather than assuming boundary-agnostic operators.

major comments (1)

§4 (Poisson experiments): the reported sharp degradation under boundary-condition shifts is presented without error bars, multiple random seeds, or statistical tests; this weakens the ability to assess whether the observed failures are robust or could be mitigated by standard training variations, which is load-bearing for the empirical support of the non-identifiability claim.

minor comments (2)

Abstract and §3: the non-identifiability statement would be clearer if the precise measure-theoretic support condition on the boundary distribution were stated explicitly rather than left implicit.
Notation in §2: the distinction between the learned operator G and the conditional expectation E[·|BC] should be introduced with a short table or explicit equation to avoid reader confusion when the boundary information is withheld.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for the positive evaluation and recommendation of minor revision. The feedback on strengthening the statistical presentation of the empirical results is appreciated and will be incorporated.

read point-by-point responses

Referee: §4 (Poisson experiments): the reported sharp degradation under boundary-condition shifts is presented without error bars, multiple random seeds, or statistical tests; this weakens the ability to assess whether the observed failures are robust or could be mitigated by standard training variations, which is load-bearing for the empirical support of the non-identifiability claim.

Authors: We agree that reporting results from multiple random seeds with error bars would improve the robustness assessment of the Poisson experiments. In the revised manuscript we will rerun the key boundary-shift experiments across five independent random seeds, report mean values with standard deviations, and include error bars on the relevant figures. This addition will allow readers to evaluate consistency across training variations while preserving the controlled nature of the setup. revision: yes

Circularity Check

0 steps flagged

No significant circularity detected

full rationale

The paper's central derivation frames standard neural operator training as conditional risk minimization over a joint distribution that includes boundary conditions. This framing directly produces the stated non-identifiability result outside the support of the training boundary distribution, without any reduction to fitted parameters, self-citations, or ansatzes that are defined in terms of the target claim. The Poisson experiments (degradation on BC shifts, cross-ensemble failure, convergence to conditional expectation) are presented as consistency checks for the framing rather than as independent derivations. No load-bearing step equates a prediction to its own inputs by construction, and the argument remains self-contained against external operator-learning objectives.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The argument rests on standard statistical-learning concepts applied to the neural-operator setting; no new free parameters, ad-hoc axioms, or invented entities are introduced in the abstract.

axioms (1)

domain assumption Operator learning can be framed as conditional risk minimization over boundary conditions
This framing is invoked to derive the non-identifiability result and is treated as a modeling choice rather than a derived fact.

pith-pipeline@v0.9.0 · 5713 in / 1284 out tokens · 60132 ms · 2026-05-21T11:18:03.340927+00:00 · methodology

One Operator to Rule Them All? On Boundary-Indexed Operator Families in Neural PDE Solvers

Core claim

What carries the argument

If this is right

Where Pith is reading between the lines

Load-bearing premise

What would settle it

discussion (0)