pith. sign in

arxiv: 2605.05540 · v1 · submitted 2026-05-07 · 💻 cs.LG · physics.flu-dyn

Towards Scalable One-Step Generative Modeling for Autoregressive Dynamical System Forecasting

Pith reviewed 2026-05-08 15:09 UTC · model grok-4.3

classification 💻 cs.LG physics.flu-dyn
keywords melisaautoregressivegenerativelong-horizonmeanflowmodelsscalablestatistical
0
0 comments X

The pith

MeLISA delivers one-step blockwise generative forecasting for dynamical systems that improves short-term accuracy and long-horizon statistical fidelity over neural operators while matching or exceeding their inference speed.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

This paper introduces a new type of model for predicting how complex physical systems evolve over time, focusing on things like fluid turbulence. Existing approaches either lose accuracy over long periods or take many steps to generate each prediction, which is slow. The proposed MeLISA model works in the original pixel space of the data and generates blocks of future states all at once with just one pass through the model. It avoids the need for separate latent space models or repeated denoising processes used in diffusion models. To make sure the predictions stay realistic over many steps, the training uses two special losses: one that ensures consistency within observed time windows and another that matches the way quantities change over different time lags, preserving things like energy spectra in the flow. The model was tested on two challenging fluid flow simulations at high resolution using both small and large neural network architectures. Results show it beats standard neural operator methods in both immediate accuracy and in keeping the long-term statistics of the turbulence correct, all while running at similar or better speeds. This suggests a promising direction for building fast, accurate simulators for physical phenomena.

Core claim

MeLISA outperforms neural-operator baselines on short-term forecasting accuracy and long-horizon statistical metrics, including energy spectra, turbulent kinetic energy, and mixing-rate-related dynamics, while achieving inference speeds comparable to, and in some cases faster than, neural operators.

Load-bearing premise

That the Window-Consistency MeanFlow objective combined with the Time Increment Consistency loss will stabilize long-horizon rollouts and preserve statistical structure without introducing artifacts or requiring additional post-hoc corrections.

read the original abstract

Fast surrogate modeling for high-dimensional physical dynamics requires more than low short-term error: useful models must roll out efficiently while preserving the statistical structure of long trajectories. Neural operators provide inexpensive autoregressive forecasts but can drift in turbulent regimes, whereas rolling diffusion and latent generative surrogates can represent stochastic transitions at the cost of multi-step denoising, noise-schedule design, or auxiliary compression models. We propose MeanFlow Long-term Invariant Spatiotemporal Consistency Autoregressive Models (MeLISA), a latent-free autoregressive generative surrogate built on pixel-space MeanFlow. MeLISA defines a blockwise stochastic transition kernel that generates each forecast block with a single model evaluation, avoiding latent encoders and iterative diffusion solvers at inference time. To stabilize long-horizon rollouts, MeLISA combines a Window-Consistency MeanFlow objective that learns conditional spatiotemporal generation from partially observed temporal windows with a Time Increment Consistency loss that constrains multi-lag finite increments and targets temporal-correlation structure. We evaluate MeLISA with compact UNet and scalable DiT backbones on two high-resolution benchmarks, extended 2D Kolmogorov flow at $256 \times 256$ and turbulent channel-flow slice at $192 \times 192$. MeLISA outperforms neural-operator baselines on short-term forecasting accuracy and long-horizon statistical metrics, including energy spectra, turbulent kinetic energy, and mixing-rate-related dynamics, while achieving inference speeds comparable to, and in some cases faster than, neural operators. Compact 3.7-5.7M-parameter variants already deliver strong parameter efficiency, and DiT variants provide a scalable path up to 150M parameters. Overall, MeLISA benefits both rollout efficiency and long-horizon statistical accuracy.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract provides no explicit free parameters, axioms, or invented entities beyond standard neural network assumptions; the two consistency losses are introduced as training objectives without further decomposition.

pith-pipeline@v0.9.0 · 5611 in / 1155 out tokens · 31189 ms · 2026-05-08T15:09:45.214445+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.