Neural Preconditioned Born Series: A Metric-Matched Framework for Learning-based Preconditioners

Jiwei Jia; Juntao Wang; Xinliang Liu

arxiv: 2603.18527 · v4 · pith:75HRBPA7new · submitted 2026-03-19 · 🧮 math.NA · cs.NA

Neural Preconditioned Born Series: A Metric-Matched Framework for Learning-based Preconditioners

Juntao Wang , Jiwei Jia , Xinliang Liu This is my paper

Pith reviewed 2026-05-21 11:24 UTC · model grok-4.3

classification 🧮 math.NA cs.NA

keywords neural preconditionersBorn seriesHelmholtz equationmetric matchingiterative solverspreconditioned coordinatesnumerical PDEslearned preconditioning

0 comments

The pith

A learned preconditioner trained in Born-series residual coordinates cuts high-frequency Helmholtz iteration counts by up to 1.9 times over direct residual learning.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper presents Neural Preconditioned Born Series as a way to build learned preconditioners for the Helmholtz equation in heterogeneous media by working inside the coordinate system created by the classical Convergent Born Series. It replaces the fixed CBS correction with a neural residual-to-correction map and trains that map using an objective whose geometry matches the left-preconditioning metric used at inference time. This alignment produces faster convergence than both classical CBS and standard learned preconditioners, with the relative gain growing as the wavenumber increases. The same construction also accelerates linear solves that arise in convection-diffusion-reaction problems and in Newton iterations for nonlinear PDEs.

Core claim

NPBS recasts the Convergent Born Series iteration as shifted-Laplacian left preconditioning, substitutes a learned residual-to-correction map for the classical CBS preconditioner inside the resulting Born-preconditioned coordinates, and optimizes that map with a metric-matched loss that aligns training with the geometry seen at inference.

What carries the argument

The metric-matched training objective defined in the residual coordinates induced by the Convergent Born Series left preconditioner.

If this is right

On heterogeneous Helmholtz benchmarks the method reduces iteration counts by up to 1.9 times versus direct residual learning, with the factor rising from 1.2 times to 1.9 times as wavenumber increases.
Learned NPBS cuts stationary iteration counts by more than 20 times compared with classical CBS.
When used inside FGMRES, NPBS yields the lowest wall-clock time among all tested preconditioners.
The same metric-matched formulation improves convergence for convection-diffusion-reaction systems and for Newton linear systems arising from nonlinear PDEs.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The metric-matching principle could be transferred to preconditioners derived from other classical iterative schemes for wave equations.
A single trained NPBS model might handle families of media that vary in both heterogeneity and frequency range, lowering the cost of repeated solves in imaging or design applications.
The framework's performance on three-dimensional domains would indicate how well the residual-metric alignment scales beyond the two-dimensional benchmarks shown.

Load-bearing premise

The residual-to-correction map learned on one collection of heterogeneous media will continue to deliver the reported speedups on new media and at higher wavenumbers without retraining or suffering distribution shift.

What would settle it

Apply the trained NPBS model without retraining to a fresh suite of heterogeneous media whose wavenumbers lie 20 percent above the training range and measure whether the iteration-count reduction relative to direct residual learning remains above 1.5 times.

read the original abstract

High-frequency Helmholtz problems in heterogeneous media remain challenging for both classical iterative methods and end-to-end neural PDE solvers. We propose Neural Preconditioned Born Series (NPBS), a learned iterative preconditioning framework that operates in preconditioned residual coordinates induced by the Convergent Born Series (CBS). Existing learned Born-series methods primarily use Born-style unrolling for forward wavefield prediction, while learned Helmholtz preconditioners are usually formulated in physical residual coordinates. NPBS fills this gap by recasting Born-series iteration as shifted-Laplacian left preconditioning, and replacing the CBS preconditioner with a learned residual-to-correction map in the Born-preconditioned coordinates. The left preconditioner further induces a residual metric, which yields a metric-matched training objective that aligns optimization with the preconditioned geometry used at inference. On heterogeneous Helmholtz benchmarks, metric-matched NPBS reduces iteration counts by up to $1.9\times$ over direct residual learning, with gains increasing from $1.2\times$ to $1.9\times$ as the wavenumber rises. Compared to classical CBS, learned NPBS reduces stationary iteration counts by over $20\times$; when used as a preconditioner for FGMRES, it further achieves the lowest wall-clock time among all evaluated methods. The same metric-matched formulation also improves convergence on convection--diffusion--reaction systems and Newton linear systems for nonlinear PDEs, indicating that residual-metric matching is a general design principle for neural preconditioners.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

NPBS recasts the convergent Born series as left preconditioning and trains a residual-to-correction map under the induced metric, yielding reported iteration gains on Helmholtz benchmarks that grow with wavenumber.

read the letter

The main point on this paper is that it trains a neural map to serve as a preconditioner inside the convergent Born series by working directly in the preconditioned residual coordinates and using a loss that matches the metric induced by the left preconditioner. This is presented as bridging the gap between unrolled Born-style forward models and residual-based learned preconditioners for high-frequency Helmholtz problems in heterogeneous media. They achieve this by treating the classical CBS iteration as shifted-Laplacian left preconditioning and replacing the fixed preconditioner with the learned correction step. The metric-matched objective is the concrete design choice that aligns training with inference geometry. On the tested heterogeneous Helmholtz cases the numbers show up to 1.9 times fewer iterations than direct residual learning, with the advantage increasing from 1.2 to 1.9 times as wavenumber rises, plus more than 20 times fewer stationary iterations than classical CBS and the lowest wall-clock time when used inside FGMRES. The same formulation is also shown to help on convection-diffusion-reaction problems and Newton linear systems for nonlinear PDEs, which suggests the metric idea is not tied to one equation. The experiments therefore give some evidence that the synthesis works on the problems they ran. The soft spot is generalization. The headline claims rest on the learned map transferring to new heterogeneous media and higher wavenumbers without retraining, yet the abstract supplies no details on the training distribution of media, the wavenumber range covered, or whether the test cases are drawn from the same distribution. If the network has fitted to particular heterogeneity statistics, the reported scaling of gains with wavenumber and the FGMRES wall-clock edge would not necessarily hold on fresh problems. Without the full experimental protocol, data splits, architecture description, and any measure of variability it is difficult to judge how robust the numbers are. This work is aimed at researchers building iterative solvers for wave problems in geophysics, acoustics, or electromagnetics who are already exploring learned components inside classical iterations. A reader focused on preconditioner design for high-frequency PDEs would find the specific recasting and the metric objective worth examining. The paper has a clear technical idea plus supporting numerics, so it deserves a serious referee even though revisions on robustness and experimental controls would be needed.

Referee Report

2 major / 3 minor

Summary. The manuscript introduces Neural Preconditioned Born Series (NPBS), a learned iterative preconditioning framework for high-frequency Helmholtz problems in heterogeneous media. It recasts the classical Convergent Born Series as shifted-Laplacian left preconditioning and substitutes the CBS preconditioner with a neural residual-to-correction map operating in Born-preconditioned coordinates. A metric-matched training objective is derived from the induced residual metric to align optimization with inference geometry. On heterogeneous Helmholtz benchmarks the method reports up to 1.9× fewer iterations than direct residual learning (gains increasing with wavenumber), more than 20× fewer stationary iterations than classical CBS, and the lowest wall-clock time when used inside FGMRES. The same formulation is shown to accelerate convergence on convection-diffusion-reaction problems and Newton linear systems arising from nonlinear PDEs.

Significance. If the reported performance gains and generalization behavior are robust, the metric-matching principle constitutes a useful design guideline for neural preconditioners that respects the geometry of the preconditioned operator. The work bridges classical Born-series analysis with learned iterative methods and demonstrates applicability beyond the Helmholtz setting, which could influence the development of hybrid numerical-ML solvers for wave and transport problems.

major comments (2)

[§5.1 and Table 2] §5.1 and Table 2: the headline claims of 1.9× iteration reduction versus direct residual learning and gains that increase from 1.2× to 1.9× with wavenumber rest on a single trained network; the manuscript must explicitly state the wavenumber range and heterogeneity statistics used for training versus testing, together with whether any test cases involve extrapolation outside the training distribution, otherwise the generalization assumption underlying the scaling result cannot be verified.
[§5.3, Figure 4] §5.3, Figure 4: the FGMRES wall-clock-time comparison reports NPBS as fastest, yet the iteration counts and timings are presented without error bars or the number of independent media realizations; this weakens the claim that the observed advantage is statistically reliable across heterogeneous media.

minor comments (3)

[§3.2] The notation for the residual metric induced by the left preconditioner (introduced around Eq. (8)) should be cross-referenced more clearly when the metric-matched loss is defined in §3.2.
[Figure 3] Several figure captions (e.g., Figure 3) omit the precise definition of the heterogeneous media used; adding a short description or reference to the generation procedure would improve reproducibility.
[Abstract] The abstract states concrete speed-up numbers but the corresponding experimental protocol (network architecture, optimizer, data splits) is only described later; moving a concise summary of these controls into the abstract or a dedicated “Experimental Setup” paragraph would aid readers.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the positive evaluation and the constructive comments that help clarify the experimental setup. We address each major comment below and will revise the manuscript to incorporate the requested details on training/testing distributions and statistical reporting.

read point-by-point responses

Referee: [§5.1 and Table 2] §5.1 and Table 2: the headline claims of 1.9× iteration reduction versus direct residual learning and gains that increase from 1.2× to 1.9× with wavenumber rest on a single trained network; the manuscript must explicitly state the wavenumber range and heterogeneity statistics used for training versus testing, together with whether any test cases involve extrapolation outside the training distribution, otherwise the generalization assumption underlying the scaling result cannot be verified.

Authors: We agree that explicit documentation of the training and test distributions is necessary to support the generalization claims. In the revised manuscript we will add a new paragraph in §5.1 that states the wavenumber range and heterogeneity parameters (variance, correlation length) used for training the network, the corresponding ranges for the test cases reported in Table 2, and whether any reported test points lie outside the training distribution. This addition will allow readers to assess the extent of extrapolation in the observed scaling with wavenumber. revision: yes
Referee: [§5.3, Figure 4] §5.3, Figure 4: the FGMRES wall-clock-time comparison reports NPBS as fastest, yet the iteration counts and timings are presented without error bars or the number of independent media realizations; this weakens the claim that the observed advantage is statistically reliable across heterogeneous media.

Authors: We acknowledge that reporting the number of realizations and variability measures would strengthen the statistical interpretation of the wall-clock results. In the revised §5.3 and updated Figure 4 we will state the number of independent heterogeneous media realizations used for each data point and include error bars (or shaded regions) that reflect the observed variation across those realizations. These changes will be made without altering the underlying experimental protocol or the reported ordering of methods. revision: yes

Circularity Check

0 steps flagged

No circularity: framework combines external classical CBS with explicit training

full rationale

The derivation recasts the classical Convergent Born Series (CBS) as shifted-Laplacian left preconditioning and replaces the preconditioner with a learned residual-to-correction map trained under a metric-matched objective. CBS convergence properties are treated as external classical results rather than derived within the paper. The central claims rest on empirical benchmarks comparing iteration counts and wall-clock times, not on any equation that reduces by construction to a fitted parameter or self-citation chain. The learned component is explicitly trained rather than presented as a prediction forced by the inputs.

Axiom & Free-Parameter Ledger

1 free parameters · 1 axioms · 0 invented entities

The central claim rests on the classical properties of the Convergent Born Series providing a stable coordinate system and on the assumption that a neural network can learn an effective map inside that system; no new physical entities are introduced.

free parameters (1)

neural network weights
The residual-to-correction map is obtained by training a neural network on data; these weights are fitted parameters.

axioms (1)

domain assumption The Convergent Born Series supplies a convergent left preconditioner for the Helmholtz operator that can be recast as a shifted-Laplacian operator.
Invoked when the paper states that Born-series iteration is recast as shifted-Laplacian left preconditioning.

pith-pipeline@v0.9.0 · 5796 in / 1591 out tokens · 63750 ms · 2026-05-21T11:24:47.406443+00:00 · methodology

Review history (2 revisions) →

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

Proposition 2.1 (Left-preconditioned equivalence... I−GηVη = GηA = L⁻¹η A) and metric Rη := G*η Gη with LRη_bs objective
IndisputableMonolith/Foundation/AlphaCoordinateFixation.lean costAlphaLog_high_calibrated_iff unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

NPBS iteration um+1 = um + Mθ(Gη(f−Aum)) and metric-matched training

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.