Parametrization of subgrid scales in long-term simulations of the shallow-water equations using machine learning and convex limiting

Ilya Timofeyev; Md Amran Hossan Mojamder; Min Wang; Zhihang Xu

arxiv: 2602.00378 · v2 · submitted 2026-01-30 · ⚛️ physics.flu-dyn · cs.LG· physics.ao-ph· physics.comp-ph

Parametrization of subgrid scales in long-term simulations of the shallow-water equations using machine learning and convex limiting

Md Amran Hossan Mojamder , Zhihang Xu , Min Wang , Ilya Timofeyev This is my paper

Pith reviewed 2026-05-16 08:48 UTC · model grok-4.3

classification ⚛️ physics.flu-dyn cs.LGphysics.ao-phphysics.comp-ph

keywords shallow-water equationssubgrid parametrizationneural networksmachine learningturbulent simulationsflux limitinglong-term stability

0 comments

The pith

A local neural-network parametrization learns subgrid fluxes for the shallow-water equations and remains stable in flow regimes absent from training data.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper develops a feed-forward neural network that predicts sub-grid fluxes from local spatial averages of coarse variables in the shallow-water equations. This produces a parametrization that operates on a small four-point stencil and can be inserted directly into existing numerical schemes. In long simulations that reach statistical steady states, the approach improves overall energy balance and matches individual flow solutions. It also pairs with convex flux limiting to control oscillations near discontinuities. The trained network continues to generate reliable results when the underlying flow statistics change in ways not represented in the training set.

Core claim

We present a method for parametrizing sub-grid processes in the Shallow Water equations. We define coarse variables and local spatial averages and use a feed-forward neural network to learn sub-grid fluxes. Our method results in a local parametrization that uses a four-point computational stencil, which has several advantages over globally coupled parametrizations. We demonstrate numerically that our method improves energy balance in long-term turbulent simulations and also accurately reproduces individual solutions. The neural network parametrization can be easily combined with flux limiting to reduce oscillations near shocks. More importantly, our method provides reliable parametrizations,

What carries the argument

Feed-forward neural network that maps local spatial averages of coarse variables to sub-grid fluxes on a four-point stencil.

If this is right

Energy balance improves in long-term turbulent simulations of the shallow-water equations.
Individual flow solutions are reproduced with higher fidelity than the coarse model alone.
The parametrization combines directly with flux limiting to suppress oscillations near shocks.
Stable results appear even when the flow enters dynamical regimes absent from the training data.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The four-point local stencil may allow the same network to be reused across different grid resolutions without retraining.
The observed generalization suggests the network has captured scale-invariant features of the subgrid flux rather than regime-specific statistics.
Similar local averaging plus network training could be tested on other hyperbolic systems where subgrid closure is needed.

Load-bearing premise

The neural network trained on a limited set of coarse-grained simulations will continue to produce stable and physically plausible fluxes when the underlying flow statistics change substantially.

What would settle it

A long-term simulation in a dynamical regime whose turbulence statistics differ markedly from the training set, checked for whether the energy balance improvement disappears or unphysical oscillations appear.

read the original abstract

We present a method for parametrizing sub-grid processes in the Shallow Water equations. We define coarse variables and local spatial averages and use a feed-forward neural network to learn sub-grid fluxes. Our method results in a local parametrization that uses a four-point computational stencil, which has several advantages over globally coupled parametrizations. We demonstrate numerically that our method improves energy balance in long-term turbulent simulations and also accurately reproduces individual solutions. The long-term simulations refer to numerical studies where a fluid flow is simulated over a duration long enough to reach a statistical steady state. The neural network parametrization can be easily combined with flux limiting to reduce oscillations near shocks. More importantly, our method provides reliable parametrizations, even in dynamical regimes that are not included in the training data.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

Local four-point NN closure with convex limiting stabilizes long-term shallow-water runs but leans on the limiter for out-of-sample behavior rather than proven NN extrapolation.

read the letter

The main takeaway is a local feed-forward neural net that predicts sub-grid fluxes in the shallow-water equations using a four-point stencil, then pairs it with convex limiting to keep long turbulent simulations stable and energy-balanced. This setup is presented as working even for dynamical regimes outside the training data. The locality is the clearest practical step forward compared with global ML closures in the literature, since it slots directly into existing finite-volume codes without coupling distant points. The numerical examples show improved energy conservation once the flow reaches statistical steady state and fewer blow-ups when the limiter is active. That combination is worth noting for anyone running operational shallow-water or related models. The soft spot is the generalization claim. The stress-test note is right that the limiter appears to be carrying much of the load when the NN sees new forcing or Reynolds-like numbers; without reported training-data statistics, error bars on the fluxes, or systematic out-of-distribution tests, it is hard to separate genuine extrapolation from simple clipping of bad outputs. The paper would be stronger with an ablation that turns the limiter off on the unseen cases and with direct comparison to a standard eddy-viscosity closure on the same long-time metrics. This is aimed at computational geophysicists and CFD people who already use limiting schemes and want a drop-in ML replacement for sub-grid terms. A reader looking for implementable local closures will get concrete code-level ideas even if the extrapolation evidence stays empirical. I would send it to peer review. The local stencil plus limiting is a usable contribution that deserves expert eyes on the numerical setup and training details.

Referee Report

3 major / 2 minor

Summary. The paper proposes a local, data-driven parametrization of sub-grid fluxes for the shallow-water equations. Coarse variables are defined via spatial averages, a feed-forward neural network is trained on coarse-grained simulation data to predict the fluxes using a four-point stencil, and the resulting scheme is combined with convex limiting. Numerical experiments claim improved energy balance and statistical steady-state accuracy in long-term turbulent runs, together with stable performance in dynamical regimes absent from the training set.

Significance. If the reported generalization holds, the method supplies a practical, locally computable closure that avoids global coupling and can be paired with existing limiters. This would be a useful addition to the toolkit for long-time integration of geophysical models, provided the extrapolation behavior is placed on firmer footing than the current numerical demonstrations.

major comments (3)

[§5.2] §5.2 and Table 2: the out-of-sample energy-balance improvement is shown for a single Reynolds-like number and forcing; no quantitative measure (e.g., relative L2 deviation of the NN-predicted flux from the filtered truth or distance of NN inputs to the training distribution) is supplied to demonstrate that the network is extrapolating rather than being rescued by the limiter.
[§4.1] §4.1, Eq. (8): the training loss is defined solely on flux reconstruction error; no term enforces discrete conservation properties (momentum or energy) that the shallow-water system requires, leaving open the possibility that the learned fluxes violate the underlying balance laws even when the limiter keeps the solution bounded.
[§5.3] §5.3: the convex-limiting procedure is invoked to guarantee positivity and reduce oscillations, yet the manuscript provides no diagnostic showing the frequency or magnitude of limiter activations in the unseen regimes; without this, it is impossible to separate genuine NN generalization from limiter masking of non-physical outputs.

minor comments (2)

The four-point stencil is repeatedly described as an advantage, but the precise stencil geometry and its relation to the coarse-grid spacing are never illustrated; a small diagram would clarify the locality claim.
Notation for the coarse-grained variables (e.g., overbars versus angle brackets) is introduced inconsistently between §2 and §3; a single consistent symbol set would improve readability.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for the constructive and detailed comments. We address each major point below and have revised the manuscript to incorporate additional quantitative diagnostics and clarifications that strengthen the evidence for generalization and the role of the neural-network parametrization.

read point-by-point responses

Referee: [§5.2] §5.2 and Table 2: the out-of-sample energy-balance improvement is shown for a single Reynolds-like number and forcing; no quantitative measure (e.g., relative L2 deviation of the NN-predicted flux from the filtered truth or distance of NN inputs to the training distribution) is supplied to demonstrate that the network is extrapolating rather than being rescued by the limiter.

Authors: We agree that explicit quantitative measures would strengthen the extrapolation claim. In the revised manuscript we have added, in §5.2, a comparison of the empirical distributions of the four-point NN inputs between the training and out-of-sample regimes, together with the relative L2 norm of the NN-predicted sub-grid fluxes versus the filtered truth. These diagnostics show that the inputs lie outside the training support while the flux errors remain comparable to the in-sample case. The sustained energy-balance improvement is therefore not attributable solely to limiter intervention. Table 2 has been updated with the new error metrics. revision: yes
Referee: [§4.1] §4.1, Eq. (8): the training loss is defined solely on flux reconstruction error; no term enforces discrete conservation properties (momentum or energy) that the shallow-water system requires, leaving open the possibility that the learned fluxes violate the underlying balance laws even when the limiter keeps the solution bounded.

Authors: The underlying finite-volume discretization is written in strict conservation form; discrete momentum is therefore conserved to machine precision for any consistent numerical fluxes, independent of how those fluxes are obtained. The training data are extracted from high-resolution simulations that already satisfy the discrete balance laws, so the learned mapping approximates sub-grid contributions that are consistent with those laws. We did not augment the loss with explicit conservation penalties in order to preserve the strictly local, stencil-based character of the parametrization. The long-term energy diagnostics already reported in the paper indicate that no systematic violation occurs. We have added a short clarifying paragraph in §4.1 that makes this reasoning explicit. revision: partial
Referee: [§5.3] §5.3: the convex-limiting procedure is invoked to guarantee positivity and reduce oscillations, yet the manuscript provides no diagnostic showing the frequency or magnitude of limiter activations in the unseen regimes; without this, it is impossible to separate genuine NN generalization from limiter masking of non-physical outputs.

Authors: We accept that a quantitative record of limiter activity is needed to separate the contributions of the neural network and the limiter. In the revised §5.3 we have inserted a new diagnostic figure that reports, for both in-sample and out-of-sample runs, the time-averaged fraction of cells in which the convex limiter is active and the typical magnitude of the correction. The activation rates remain low and statistically indistinguishable between the two regimes, indicating that the neural-network fluxes are largely admissible without heavy limiter intervention. This supports the claim of genuine generalization. revision: yes

Circularity Check

0 steps flagged

No circularity: data-driven NN parametrization is independent of target outputs

full rationale

The paper trains a feed-forward neural network on coarse-grained simulation data to learn local sub-grid fluxes for the shallow-water equations, then combines the result with convex limiting. This is a standard supervised learning setup where the NN outputs are fitted to external data rather than defined in terms of the quantities they are later used to predict. No self-definitional loop, fitted-input-renamed-as-prediction, or load-bearing self-citation chain appears in the derivation; the claim of generalization to unseen regimes is presented as an empirical numerical result, not a mathematical reduction to the training inputs. The method remains self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

1 free parameters · 1 axioms · 0 invented entities

The approach assumes that sub-grid fluxes can be adequately represented by a feed-forward network acting on local averages and that convex limiting suffices to control oscillations; no new physical entities are introduced.

free parameters (1)

neural-network weights and biases
Learned from training data; central to the parametrization.

axioms (1)

domain assumption Sub-grid fluxes depend only on local four-point stencil averages
Stated as the basis for the local parametrization.

pith-pipeline@v0.9.0 · 5459 in / 1218 out tokens · 43797 ms · 2026-05-16T08:48:09.780111+00:00 · methodology

Parametrization of subgrid scales in long-term simulations of the shallow-water equations using machine learning and convex limiting

Core claim

What carries the argument

If this is right

Where Pith is reading between the lines

Load-bearing premise

What would settle it

discussion (0)