Flow marching for a generative PDE foundation model
Pith reviewed 2026-05-18 13:45 UTC · model grok-4.3
The pith
Joint sampling of noise levels and physical time steps lets a flow-matching model learn a unified velocity field that transports noisy states to clean successors in PDE trajectories.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
By jointly sampling the noise level and the physical time step between adjacent states, the model learns a unified velocity field that transports a noisy current state toward its clean successor, reducing long-term rollout drift while enabling uncertainty-aware ensemble generations.
What carries the argument
Flow Marching: the scheme that jointly samples noise level and physical time step to train a single velocity field bridging noisy and clean states in latent space.
If this is right
- The model produces stable rollouts over hundreds of steps on unseen Kolmogorov turbulence after few-shot adaptation.
- Ensemble generations become uncertainty-aware without separate models for each member.
- Pretraining on 2.5 million trajectories across 12 PDE families becomes feasible at 15 times lower cost than full video diffusion.
- The latent temporal pyramids and diffusion-forcing scheme in the Flow Marching Transformer support efficient scaling.
Where Pith is reading between the lines
- The same joint-sampling idea might extend to other sequence models that suffer compounding errors, such as autoregressive video or climate simulators.
- The compact latent space from the Physics-Pretrained VAE could support downstream tasks like parameter estimation or control that were not tested here.
- Uncertainty stratification in the ensembles might identify which regions of state space are most sensitive to initial conditions.
Load-bearing premise
The premise that jointly sampling noise and time directly counters error accumulation in physical dynamical systems and thereby reduces long-term rollout drift.
What would settle it
A controlled comparison on the same long-horizon PDE trajectories where the Flow Marching model exhibits equal or larger cumulative error than a deterministic baseline after hundreds of steps.
Figures
read the original abstract
Pretraining on large-scale collections of PDE-governed spatiotemporal trajectories has recently shown promise for building generalizable models of dynamical systems. Yet most existing PDE foundation models rely on deterministic Transformer architectures, which lack generative flexibility for many science and engineering applications. We propose Flow Marching, an algorithm that bridges neural operator learning with flow matching motivated by an analysis of error accumulation in physical dynamical systems, and we build a generative PDE foundation model on top of it. By jointly sampling the noise level and the physical time step between adjacent states, the model learns a unified velocity field that transports a noisy current state toward its clean successor, reducing long-term rollout drift while enabling uncertainty-aware ensemble generations. Alongside this core algorithm, we introduce a Physics-Pretrained Variational Autoencoder (P2VAE) to embed physical states into a compact latent space, and an efficient Flow Marching Transformer (FMT) that combines a diffusion-forcing scheme with latent temporal pyramids, achieving up to 15x greater computational efficiency than full-length video diffusion models and thereby enabling large-scale pretraining at substantially reduced cost. We curate a corpus of ~2.5M trajectories across 12 distinct PDE families and train suites of P2VAEs and FMTs at multiple scales. On downstream evaluation, we benchmark on unseen Kolmogorov turbulence with few-shot adaptation, demonstrate long-term rollout stability over deterministic counterparts, and present uncertainty-stratified ensemble results, highlighting the importance of generative PDE foundation models for real-world applications.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript proposes Flow Marching, an algorithm bridging neural operator learning and flow matching for generative PDE foundation models. By jointly sampling noise level and physical time step between adjacent states, it learns a unified velocity field to transport noisy current states toward clean successors, aiming to reduce long-term rollout drift and enable uncertainty-aware ensembles. The work introduces a Physics-Pretrained Variational Autoencoder (P2VAE) for compact latent embeddings of physical states and an efficient Flow Marching Transformer (FMT) combining diffusion-forcing with latent temporal pyramids. The model is pretrained on a corpus of ~2.5M trajectories across 12 PDE families and evaluated via few-shot adaptation on unseen Kolmogorov turbulence, claiming long-term stability over deterministic baselines and up to 15x computational efficiency relative to full-length video diffusion models.
Significance. If the central claims on drift reduction and efficiency hold after verification, the work would advance generative modeling of dynamical systems by providing a scalable, uncertainty-aware alternative to deterministic PDE foundation models. The curation of a large multi-family PDE trajectory corpus and the efficiency gains from latent pyramids represent concrete strengths that could support broader pretraining efforts in scientific machine learning.
major comments (2)
- [Abstract and §3] Abstract and §3 (Flow Marching motivation): The claim that jointly sampling noise level and physical time step produces a unified velocity field reducing long-term rollout drift is load-bearing for the central contribution, yet no ablation isolates this joint-sampling procedure while holding the P2VAE embedding and FMT components (diffusion-forcing and latent temporal pyramids) fixed. The error-accumulation analysis therefore remains untested as the specific mechanism.
- [§5] §5 (Downstream evaluation): The reported outcomes (long-term rollout stability on Kolmogorov turbulence, 15x efficiency, uncertainty-stratified ensembles) are presented without quantitative metrics, error bars, ablation tables, or explicit dataset splits and baseline comparisons, preventing direct verification of the stability and efficiency claims against deterministic counterparts.
minor comments (2)
- [§3] The notation for the joint noise-and-time sampling distribution in the Flow Marching objective could be clarified with an explicit equation contrasting it to standard flow-matching conditioning.
- [Figures] Figure captions describing ensemble results should specify the number of samples drawn and the exact stratification procedure used for uncertainty quantification.
Simulated Author's Rebuttal
We thank the referee for the constructive and detailed feedback. We address each major comment below and commit to revisions that directly strengthen the empirical support for our central claims without altering the core contributions.
read point-by-point responses
-
Referee: [Abstract and §3] Abstract and §3 (Flow Marching motivation): The claim that jointly sampling noise level and physical time step produces a unified velocity field reducing long-term rollout drift is load-bearing for the central contribution, yet no ablation isolates this joint-sampling procedure while holding the P2VAE embedding and FMT components (diffusion-forcing and latent temporal pyramids) fixed. The error-accumulation analysis therefore remains untested as the specific mechanism.
Authors: We agree that isolating the joint-sampling mechanism is important for validating the error-accumulation analysis in §3. While the current experiments compare Flow Marching against deterministic baselines and separate diffusion models, they do not hold P2VAE and FMT fixed while varying only the joint vs. non-joint sampling. In the revised manuscript we will add a targeted ablation in §5 that fixes the P2VAE and FMT architecture and directly compares joint sampling of noise level and physical time against (i) fixed physical time with varying noise and (ii) separate noise and time sampling. Preliminary results from this ablation support the drift-reduction benefit; the full table and rollout curves will be included. revision: yes
-
Referee: [§5] §5 (Downstream evaluation): The reported outcomes (long-term rollout stability on Kolmogorov turbulence, 15x efficiency, uncertainty-stratified ensembles) are presented without quantitative metrics, error bars, ablation tables, or explicit dataset splits and baseline comparisons, preventing direct verification of the stability and efficiency claims against deterministic counterparts.
Authors: We acknowledge that the presentation in §5 can be made more self-contained for verification. The manuscript already reports rollout MSE curves, wall-clock and FLOP comparisons yielding the 15x figure, and ensemble variance statistics, but we agree that error bars from repeated seeds, explicit ablation tables, and precise dataset splits were not sufficiently highlighted. In the revision we will expand §5 with (i) error bars computed over five independent runs, (ii) a dedicated ablation table contrasting Flow Marching against deterministic neural-operator baselines on the same Kolmogorov few-shot splits, and (iii) explicit train/validation/test splits for the 2.5M-trajectory corpus and the downstream Kolmogorov adaptation. These additions will be placed in new tables and figures. revision: yes
Circularity Check
No circularity: Flow Marching derivation stands on independent sampling scheme and error analysis
full rationale
The paper introduces Flow Marching as a new algorithm that jointly samples noise level and physical time step to learn a unified velocity field, motivated by a separate analysis of error accumulation in dynamical systems. No equations or claims reduce the target outcome (reduced rollout drift) to a fitted parameter or self-citation by construction. The P2VAE and FMT components are presented as additional architectural choices, not as definitional inputs that force the main result. The derivation chain remains self-contained against external benchmarks such as Kolmogorov turbulence rollouts.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Flow matching can be extended to jointly condition on noise level and physical time step to produce a unified transport velocity field for dynamical systems.
invented entities (3)
-
Flow Marching algorithm
no independent evidence
-
Physics-Pretrained Variational Autoencoder (P2VAE)
no independent evidence
-
Flow Marching Transformer (FMT)
no independent evidence
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
By jointly sampling the noise level and the physical time step between adjacent states, the model learns a unified velocity field that transports a noisy current state toward its clean successor
-
IndisputableMonolith/Foundation/AbsoluteFloorClosure.leanabsolute_floor_iff_bare_distinguishability unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
A comprehensive numerical analysis on the error accumulation comparison between deterministic neural operator and our flow marching scheme
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
" write newline "" before.all 'output.state := FUNCTION n.dashify 't := "" t empty not t #1 #1 substring "-" = t #1 #2 substring "--" = not "--" * t #2 global.max substring 't := t #1 #1 substring "-" = "-" * t #2 global.max substring 't := while if t #1 #1 substring * t #2 global.max substring 't := if while FUNCTION format.date year duplicate empty "emp...
-
[2]
\@ifxundefined[1] #1\@undefined \@firstoftwo \@secondoftwo \@ifnum[1] #1 \@firstoftwo \@secondoftwo \@ifx[1] #1 \@firstoftwo \@secondoftwo [2] @ #1 \@temptokena #2 #1 @ \@temptokena \@ifclassloaded agu2001 natbib The agu2001 class already includes natbib coding, so you should not add it explicitly Type <Return> for now, but then later remove the command n...
-
[3]
\@lbibitem[] @bibitem@first@sw\@secondoftwo \@lbibitem[#1]#2 \@extra@b@citeb \@ifundefined br@#2\@extra@b@citeb \@namedef br@#2 \@nameuse br@#2\@extra@b@citeb \@ifundefined b@#2\@extra@b@citeb @num @parse #2 @tmp #1 NAT@b@open@#2 NAT@b@shut@#2 \@ifnum @merge>\@ne @bibitem@first@sw \@firstoftwo \@ifundefined NAT@b*@#2 \@firstoftwo @num @NAT@ctr \@secondoft...
-
[4]
@open @close @open @close and [1] URL: #1 \@ifundefined chapter * \@mkboth \@ifxundefined @sectionbib * \@mkboth * \@mkboth\@gobbletwo \@ifclassloaded amsart * \@ifclassloaded amsbook * \@ifxundefined @heading @heading NAT@ctr thebibliography [1] @ \@biblabel @NAT@ctr \@bibsetup #1 @NAT@ctr @ @openbib .11em \@plus.33em \@minus.07em 4000 4000 `\.\@m @bibit...
work page 1999
-
[5]
Universal physics transformers: A framework for efficiently scaling neural operators
Benedikt Alkin, Andreas F \"u rst, Simon Schmid, Lukas Gruber, Markus Holzleitner, and Johannes Brandstetter. Universal physics transformers: A framework for efficiently scaling neural operators. Advances in Neural Information Processing Systems, 37: 0 25152--25194, 2024
work page 2024
-
[6]
Kushal Arora, Layla El Asri, Hareesh Bahuleyan, and Jackie Chi Kit Cheung. Why exposure bias matters: An imitation learning perspective of error accumulation in language generation, 2023. URL https://arxiv.org/abs/2204.01171
-
[7]
Neural operators for accelerating scientific simulations and design
Kamyar Azizzadenesheli, Nikola Kovachki, Zongyi Li, Miguel Liu-Schiaffini, Jean Kossaifi, and Anima Anandkumar. Neural operators for accelerating scientific simulations and design. Nature Reviews Physics, 6 0 (5): 0 320--328, 2024
work page 2024
-
[8]
Chipilski, Siming Liang, Guannan Zhang, and Jeffrey S
Feng Bao, Hristo G. Chipilski, Siming Liang, Guannan Zhang, and Jeffrey S. Whitaker. Nonlinear ensemble filtering with diffusion models: Application to the surface quasi-geostrophic dynamics, 2024 a . URL https://arxiv.org/abs/2404.00844
-
[9]
A score-based filter for nonlinear data assimilation
Feng Bao, Zezhong Zhang, and Guannan Zhang. A score-based filter for nonlinear data assimilation. Journal of Computational Physics, 514: 0 113207, 2024 b
work page 2024
- [10]
-
[11]
Salva Rühling Cachay, Bo Zhao, Hailey Joren, and Rose Yu. Dyffusion: A dynamics-informed diffusion model for spatiotemporal forecasting, 2023. URL https://arxiv.org/abs/2306.01984
-
[12]
Yadi Cao, Yuxuan Liu, Liu Yang, Rose Yu, Hayden Schaeffer, and Stanley Osher. Vicon: Vision in-context operator networks for multi-physics fluid dynamics prediction, 2025. URL https://arxiv.org/abs/2411.16063
- [13]
-
[14]
Gan-duf: Hierarchical deep generative models for design under free-form geometric uncertainty, 2022
Wei Wayne Chen, Doksoo Lee, Oluwaseyi Balogun, and Wei Chen. Gan-duf: Hierarchical deep generative models for design under free-form geometric uncertainty, 2022. URL https://arxiv.org/abs/2202.10558
-
[15]
Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling
Junyoung Chung, Caglar Gulcehre, KyungHyun Cho, and Yoshua Bengio. Empirical evaluation of gated recurrent neural networks on sequence modeling, 2014. URL https://arxiv.org/abs/1412.3555
work page internal anchor Pith review Pith/arXiv arXiv 2014
-
[16]
FlashAttention-2: Faster Attention with Better Parallelism and Work Partitioning
Tri Dao. Flashattention-2: Faster attention with better parallelism and work partitioning, 2023. URL https://arxiv.org/abs/2307.08691
work page internal anchor Pith review Pith/arXiv arXiv 2023
-
[17]
Scaling Rectified Flow Transformers for High-Resolution Image Synthesis
Patrick Esser, Sumith Kulal, Andreas Blattmann, Rahim Entezari, Jonas Müller, Harry Saini, Yam Levi, Dominik Lorenz, Axel Sauer, Frederic Boesel, Dustin Podell, Tim Dockhorn, Zion English, Kyle Lacey, Alex Goodwin, Yannik Marek, and Robin Rombach. Scaling rectified flow transformers for high-resolution image synthesis, 2024. URL https://arxiv.org/abs/2403.03206
work page internal anchor Pith review Pith/arXiv arXiv 2024
-
[18]
Dropout as a bayesian approximation: Representing model uncertainty in deep learning
Yarin Gal and Zoubin Ghahramani. Dropout as a bayesian approximation: Representing model uncertainty in deep learning. In international conference on machine learning, pp.\ 1050--1059. PMLR, 2016
work page 2016
-
[19]
Auto-regressive moving diffusion models for time series forecasting, 2024
Jiaxin Gao, Qinglong Cao, and Yuntian Chen. Auto-regressive moving diffusion models for time series forecasting, 2024. URL https://arxiv.org/abs/2412.09328
-
[20]
Ca2-vdm: Efficient autore- gressive video diffusion model with causal generation and cache sharing,
Kaifeng Gao, Jiaxin Shi, Hanwang Zhang, Chunping Wang, Jun Xiao, and Long Chen. Ca2-vdm: Efficient autoregressive video diffusion model with causal generation and cache sharing, 2025. URL https://arxiv.org/abs/2411.16375
-
[21]
Gupta and Johannes Brandstetter
Jayesh K. Gupta and Johannes Brandstetter. Towards multi-spatiotemporal-scale generalized pde modeling, 2022. URL https://arxiv.org/abs/2209.15616
-
[22]
Learnings from scaling visual tokenizers for reconstruction and generation
Philippe Hansen-Estruch, David Yan, Ching-Yao Chung, Orr Zohar, Jialiang Wang, Tingbo Hou, Tao Xu, Sriram Vishwanath, Peter Vajda, and Xinlei Chen. Learnings from scaling visual tokenizers for reconstruction and generation, 2025. URL https://arxiv.org/abs/2501.09755
-
[23]
Dpot: Auto-regressive denoising operator transformer for large-scale pde pre-training, 2024
Zhongkai Hao, Chang Su, Songming Liu, Julius Berner, Chengyang Ying, Hang Su, Anima Anandkumar, Jian Song, and Jun Zhu. Dpot: Auto-regressive denoising operator transformer for large-scale pde pre-training, 2024. URL https://arxiv.org/abs/2403.03542
-
[24]
Väinö Hatanpää, Eugene Ku, Jason Stock, Murali Emani, Sam Foreman, Chunyong Jung, Sandeep Madireddy, Tung Nguyen, Varuni Sastry, Ray A. O. Sinurat, Sam Wheeler, Huihuo Zheng, Troy Arcomano, Venkatram Vishwanath, and Rao Kotamarthi. Aeris: Argonne earth systems model for reliable and skillful predictions, 2025. URL https://arxiv.org/abs/2509.13523
-
[25]
Jonathan Ho, Tim Salimans, Alexey Gritsenko, William Chan, Mohammad Norouzi, and David J. Fleet. Video diffusion models, 2022. URL https://arxiv.org/abs/2204.03458
work page internal anchor Pith review Pith/arXiv arXiv 2022
-
[26]
Diffusionpde: Generative pde-solving under partial observation, 2024
Jiahe Huang, Guandao Yang, Zichen Wang, and Jeong Joon Park. Diffusionpde: Generative pde-solving under partial observation, 2024. URL https://arxiv.org/abs/2406.17763
-
[27]
Self Forcing: Bridging the Train-Test Gap in Autoregressive Video Diffusion
Xun Huang, Zhengqi Li, Guande He, Mingyuan Zhou, and Eli Shechtman. Self forcing: Bridging the train-test gap in autoregressive video diffusion, 2025. URL https://arxiv.org/abs/2506.08009
work page internal anchor Pith review Pith/arXiv arXiv 2025
-
[28]
Acc-unet: A completely convolutional unet model for the 2020s, 2023
Nabil Ibtehaz and Daisuke Kihara. Acc-unet: A completely convolutional unet model for the 2020s, 2023. URL https://arxiv.org/abs/2308.13680
-
[29]
Zhonglin Jiang, Qian Tang, and Zequn Wang. Generative reliability-based design optimization using in-context learning capabilities of large language models, 2025. URL https://arxiv.org/abs/2503.22401
-
[30]
Pyramidal flow matching for efficient video generative modeling
Yang Jin, Zhicheng Sun, Ningyuan Li, Kun Xu, Kun Xu, Hao Jiang, Nan Zhuang, Quzhe Huang, Yang Song, Yadong Mu, and Zhouchen Lin. Pyramidal flow matching for efficient video generative modeling, 2025. URL https://arxiv.org/abs/2410.05954
-
[31]
Neural operator: Learning maps between function spaces with applications to pdes
Nikola Kovachki, Zongyi Li, Burigede Liu, Kamyar Azizzadenesheli, Kaushik Bhattacharya, Andrew Stuart, and Anima Anandkumar. Neural operator: Learning maps between function spaces with applications to pdes. Journal of Machine Learning Research, 24 0 (89): 0 1--97, 2023
work page 2023
-
[32]
Learning nonlinear reduced models from data with operator inference
Boris Kramer, Benjamin Peherstorfer, and Karen E Willcox. Learning nonlinear reduced models from data with operator inference. Annual Review of Fluid Mechanics, 56 0 (1): 0 521--548, 2024
work page 2024
-
[33]
Xingjian Leng, Jaskirat Singh, Yunzhong Hou, Zhenchang Xing, Saining Xie, and Liang Zheng. Repa-e: Unlocking vae for end-to-end tuning with latent diffusion transformers, 2025. URL https://arxiv.org/abs/2504.10483
-
[34]
Generative emulation of weather forecast ensembles with diffusion models
Lizao Li, Robert Carver, Ignacio Lopez-Gomez, Fei Sha, and John Anderson. Generative emulation of weather forecast ensembles with diffusion models. Science Advances, 10 0 (13): 0 eadk4489, 2024 a . doi:10.1126/sciadv.adk4489. URL https://www.science.org/doi/abs/10.1126/sciadv.adk4489
-
[35]
Zijie Li, Kazem Meidani, and Amir Barati Farimani. Transformer for partial differential equations' operator learning, 2023. URL https://arxiv.org/abs/2205.13671
-
[36]
Generative latent neural PDE solver using flow matching.arXiv preprint arXiv:2503.22600, 2025
Zijie Li, Anthony Zhou, and Amir Barati Farimani. Generative latent neural pde solver using flow matching, 2025. URL https://arxiv.org/abs/2503.22600
-
[37]
Fourier Neural Operator for Parametric Partial Differential Equations
Zongyi Li, Nikola Kovachki, Kamyar Azizzadenesheli, Burigede Liu, Kaushik Bhattacharya, Andrew Stuart, and Anima Anandkumar. Fourier neural operator for parametric partial differential equations, 2021. URL https://arxiv.org/abs/2010.08895
work page internal anchor Pith review Pith/arXiv arXiv 2021
-
[38]
Physics-informed neural operator for learning partial differential equations
Zongyi Li, Hongkai Zheng, Nikola Kovachki, David Jin, Haoxuan Chen, Burigede Liu, Kamyar Azizzadenesheli, and Anima Anandkumar. Physics-informed neural operator for learning partial differential equations. ACM/JMS Journal of Data Science, 1 0 (3): 0 1--27, 2024 b
work page 2024
-
[39]
Yaron Lipman, Ricky T. Q. Chen, Heli Ben-Hamu, Maximilian Nickel, and Matt Le. Flow matching for generative modeling, 2023. URL https://arxiv.org/abs/2210.02747
work page internal anchor Pith review Pith/arXiv arXiv 2023
-
[40]
Flow-GRPO: Training Flow Matching Models via Online RL
Jie Liu, Gongye Liu, Jiajun Liang, Yangguang Li, Jiaheng Liu, Xintao Wang, Pengfei Wan, Di Zhang, and Wanli Ouyang. Flow-grpo: Training flow matching models via online rl, 2025. URL https://arxiv.org/abs/2505.05470
work page internal anchor Pith review Pith/arXiv arXiv 2025
-
[41]
Flow Straight and Fast: Learning to Generate and Transfer Data with Rectified Flow
Xingchao Liu, Chengyue Gong, and Qiang Liu. Flow straight and fast: Learning to generate and transfer data with rectified flow, 2022. URL https://arxiv.org/abs/2209.03003
work page internal anchor Pith review Pith/arXiv arXiv 2022
-
[42]
Yuxuan Liu, Jingmin Sun, Xinjie He, Griffin Pinney, Zecheng Zhang, and Hayden Schaeffer. Prose-fd: A multimodal pde foundation model for learning multiple operators for forecasting fluid dynamics, 2024. URL https://arxiv.org/abs/2409.09811
-
[43]
Physics informed token transformer for solving partial differential equations
Cooper Lorsung, Zijie Li, and Amir Barati Farimani. Physics informed token transformer for solving partial differential equations. Machine Learning: Science and Technology, 5 0 (1): 0 015032, 2024
work page 2024
-
[44]
Learning nonlinear operators via deeponet based on the universal approximation theorem of operators
Lu Lu, Pengzhan Jin, Guofei Pang, Zhongqiang Zhang, and George Em Karniadakis. Learning nonlinear operators via deeponet based on the universal approximation theorem of operators. Nature machine intelligence, 3 0 (3): 0 218--229, 2021
work page 2021
-
[45]
Nanye Ma, Mark Goldstein, Michael S. Albergo, Nicholas M. Boffi, Eric Vanden-Eijnden, and Saining Xie. Sit: Exploring flow and diffusion-based generative models with scalable interpolant transformers, 2024. URL https://arxiv.org/abs/2401.08740
-
[46]
Multiple physics pretraining for physical surrogate models.arXiv preprint arXiv:2310.02994, 2023
Michael McCabe, Bruno Régaldo-Saint Blancard, Liam Holden Parker, Ruben Ohana, Miles Cranmer, Alberto Bietti, Michael Eickenberg, Siavash Golkar, Geraud Krawezik, Francois Lanusse, Mariel Pettee, Tiberiu Tesileanu, Kyunghyun Cho, and Shirley Ho. Multiple physics pretraining for physical surrogate models, 2024. URL https://arxiv.org/abs/2310.02994
-
[47]
Agocs, Miguel Beneitez, Marsha Berger, Blakesley Burkhart, Keaton Burns, Stuart B
Ruben Ohana, Michael McCabe, Lucas Meyer, Rudy Morel, Fruzsina J. Agocs, Miguel Beneitez, Marsha Berger, Blakesley Burkhart, Keaton Burns, Stuart B. Dalziel, Drummond B. Fielding, Daniel Fortunato, Jared A. Goldberg, Keiya Hirashima, Yan-Fei Jiang, Rich R. Kerswell, Suryanarayana Maddu, Jonah Miller, Payel Mukhopadhyay, Stefan S. Nixon, Jeff Shen, Romain ...
-
[48]
Vivek Oommen, Aniruddha Bora, Zhen Zhang, and George Em Karniadakis. Integrating neural operators with diffusion models improves spectral representation in turbulence modeling, 2025. URL https://arxiv.org/abs/2409.08477
-
[49]
On calibrating diffusion probabilistic models, 2023
Tianyu Pang, Cheng Lu, Chao Du, Min Lin, Shuicheng Yan, and Zhijie Deng. On calibrating diffusion probabilistic models, 2023. URL https://arxiv.org/abs/2302.10688
-
[50]
Scalable Diffusion Models with Transformers
William Peebles and Saining Xie. Scalable diffusion models with transformers, 2023. URL https://arxiv.org/abs/2212.09748
work page internal anchor Pith review Pith/arXiv arXiv 2023
-
[51]
Gencast: Diffusion- based ensemble forecasting for medium-range weather
Ilan Price, Alvaro Sanchez-Gonzalez, Ferran Alet, Tom R. Andersson, Andrew El-Kadi, Dominic Masters, Timo Ewalds, Jacklynn Stott, Shakir Mohamed, Peter Battaglia, Remi Lam, and Matthew Willson. Gencast: Diffusion-based ensemble forecasting for medium-range weather, 2024. URL https://arxiv.org/abs/2312.15796
-
[52]
High-Resolution Image Synthesis with Latent Diffusion Models
Robin Rombach, Andreas Blattmann, Dominik Lorenz, Patrick Esser, and Björn Ommer. High-resolution image synthesis with latent diffusion models, 2022. URL https://arxiv.org/abs/2112.10752
work page internal anchor Pith review Pith/arXiv arXiv 2022
-
[53]
U-Net: Convolutional Networks for Biomedical Image Segmentation
Olaf Ronneberger, Philipp Fischer, and Thomas Brox. U-net: Convolutional networks for biomedical image segmentation, 2015. URL https://arxiv.org/abs/1505.04597
work page internal anchor Pith review Pith/arXiv arXiv 2015
-
[54]
Mohammed Sardar and Alex Skillen. Turbulent flow data as pytorch tensors for ml: Kolmogorov flow at re=222, and kelvin-helmholtz instability. Dataset, 2025. URL https://doi.org/10.48420/29329565.v1
-
[55]
Jingmin Sun, Yuxuan Liu, Zecheng Zhang, and Hayden Schaeffer. Towards a foundation model for partial differential equations: Multioperator learning and extrapolation. Physical Review E, 111 0 (3): 0 035304, 2025
work page 2025
-
[56]
PDEBench: An extensive benchmark for scientific machine learning
Makoto Takamoto, Timothy Praditia, Raphael Leiteritz, Dan MacKinlay, Francesco Alesiani, Dirk Pflüger, and Mathias Niepert. Pdebench: An extensive benchmark for scientific machine learning, 2024. URL https://arxiv.org/abs/2210.07182
-
[57]
Improving and generalizing flow-based generative models with minibatch optimal transport
Alexander Tong, Kilian Fatras, Nikolay Malkin, Guillaume Huguet, Yanlei Zhang, Jarrid Rector-Brooks, Guy Wolf, and Yoshua Bengio. Improving and generalizing flow-based generative models with minibatch optimal transport, 2024. URL https://arxiv.org/abs/2302.00482
work page internal anchor Pith review Pith/arXiv arXiv 2024
-
[58]
Llama 2: Open Foundation and Fine-Tuned Chat Models
Hugo Touvron, Louis Martin, Kevin Stone, Peter Albert, Amjad Almahairi, Yasmine Babaei, Nikolay Bashlykov, Soumya Batra, Prajjwal Bhargava, Shruti Bhosale, Dan Bikel, Lukas Blecher, Cristian Canton Ferrer, Moya Chen, Guillem Cucurull, David Esiobu, Jude Fernandes, Jeremy Fu, Wenyin Fu, Brian Fuller, Cynthia Gao, Vedanuj Goswami, Naman Goyal, Anthony Harts...
work page internal anchor Pith review Pith/arXiv arXiv 2023
-
[59]
Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, ukasz Kaiser, and Illia Polosukhin. Attention is all you need. Advances in neural information processing systems, 30, 2017
work page 2017
-
[60]
Physics-guided training of gan to improve accuracy in airfoil design synthesis, 2023
Kazunari Wada, Katsuyuki Suzuki, and Kazuo Yonekura. Physics-guided training of gan to improve accuracy in airfoil design synthesis, 2023. URL https://arxiv.org/abs/2308.10038
-
[61]
Progressive autoregressive video diffusion models
Desai Xie, Zhan Xu, Yicong Hong, Hao Tan, Difan Liu, Feng Liu, Arie Kaufman, and Yang Zhou. Progressive autoregressive video diffusion models. In Proceedings of the Computer Vision and Pattern Recognition Conference, pp.\ 6322--6332, 2025
work page 2025
-
[62]
In-context operator learning with data prompts for differential equation problems
Liu Yang, Siting Liu, Tingwei Meng, and Stanley J Osher. In-context operator learning with data prompts for differential equation problems. Proceedings of the National Academy of Sciences, 120 0 (39): 0 e2310142120, 2023
work page 2023
-
[63]
Pdeformer: Towards a foundation model for one-dimensional partial differential equations, 2025
Zhanhong Ye, Xiang Huang, Leheng Chen, Hongsheng Liu, Zidong Wang, and Bin Dong. Pdeformer: Towards a foundation model for one-dimensional partial differential equations, 2025. URL https://arxiv.org/abs/2402.12652
-
[64]
Mattergen: a generative model for inorganic materials design.arXiv preprint arXiv:2312.03687, 2023
Claudio Zeni, Robert Pinsler, Daniel Zügner, Andrew Fowler, Matthew Horton, Xiang Fu, Sasha Shysheya, Jonathan Crabbé, Lixin Sun, Jake Smith, Bichlien Nguyen, Hannes Schulz, Sarah Lewis, Chin-Wei Huang, Ziheng Lu, Yichi Zhou, Han Yang, Hongxia Hao, Jielan Li, Ryota Tomioka, and Tian Xie. Mattergen: a generative model for inorganic materials design, 2024. ...
-
[65]
Upscale-a-video: Temporal-consistent diffusion model for real-world video super-resolution
Shangchen Zhou, Peiqing Yang, Jianyi Wang, Yihang Luo, and Chen Change Loy. Upscale-a-video: Temporal-consistent diffusion model for real-world video super-resolution. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp.\ 2535--2545, 2024
work page 2024
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.