An ML model trained only on harmonized gridded observations achieves competitive medium-range weather forecast skill with the IFS for several upper-air and surface headline scores when verified against observations.
Baseline reference
Title resolution pending
Baseline reference. 55% of citing Pith papers use this work as a benchmark or comparison.
citation-role summary
citation-polarity summary
representative citing papers
HO-FNO extends standard FNO with n-linear spectral mixing and shows improved accuracy on nonlinear PDE benchmarks, sometimes with a single layer beating deeper FNO models.
A novel observationally constrained probabilistic trigger for mesoscale convective systems improves spatiotemporal scales of tropical precipitation and ensemble spread in NWP models compared to prior MCSP schemes.
Trains ACE emulator on independent SST-CO2 variations plus energy constraint to improve accuracy in decoupled climate forcing scenarios.
GPROF-IR is a CNN-based retrieval that uses temporal context in geostationary IR observations to produce precipitation estimates with lower error than prior IR methods and climatological consistency with PMW retrievals for integration into IMERG V08.
A multilinear operator learned on PCA coefficients maps time-since-ignition inputs to smoke outputs, matching Monte Carlo accuracy with half the model calls and outperforming prior classifiers on holdout data.
Cast3 translates NWP principles into a data-driven model using cubed-sphere grids, super-ensembles, and generative nudging to achieve state-of-the-art ensemble predictions that outperform baselines.
Time series foundation models match the performance of specialized models for day-ahead load forecasting while providing explanations that match domain knowledge on weather and calendar effects.
SCARFACE provides a harmonized annual panel dataset of over 2,700 socio-economic, environmental, and agricultural indicators for 256 sub-regions in the Po Valley, Italy, spanning 2011 to 2024.
Relaxed unitary convolutions for GNNs on meshes balance smoothness preservation with natural smoothing in dynamics, outperforming unitary convolutions and other models on PDEs and weather tasks.
FPPF uses a learned conditional generative proposal approximating the optimal proposal in particle filters, with tractable likelihoods for Bayesian updates and localization for high dimensions, outperforming baselines on nonlinear non-Gaussian systems.
MTGNN with hybrid adjacency matrix reconstructs GRACE-like TWS anomalies to 1940, reaching basin-mean correlation 0.94 and competitive performance with fewer predictors than baselines.
Online conformal prediction post-processing guarantees calibrated uncertainty coverage for GenCast, NeuralGCM, and AIFS-ENS forecasts of temperature and precipitation including extremes.
A zero-shot pipeline with physics-informed synthetic histories lets time-series foundation models outperform baselines by 1.7-2x in cold-start PV forecasting on 440 sites across four datasets.
NTK-UQ produces 31-37% sharper 90% prediction intervals than split conformal prediction for extreme weather forecasts, with adaptive scaling via architecture-dependent eigenvalue truncation and ICA decomposition of last-layer features.
Distributional autoencoders trained on climate model simulations model full conditional distributions of European temperature fields to enable probabilistic storyline attribution, illustrated by higher intensities and probability ratios for a 2003-like heatwave in 2028 and 2053.
ML climate emulators degrade under seasonal distribution shifts that proxy long-term climate change, but physically motivated compositional decompositions improve out-of-distribution performance with modest in-distribution trade-offs.
AxiomOcean deploys a 3D encoder-backbone-decoder architecture that jointly predicts upper-ocean variables and outperforms prior AI models by 20-35% in day-1 RMSE while preserving eddy kinetic energy and vertical consistency.
Commutativity regularization mitigates transient error amplification in autoregressive neural simulators by penalizing non-normality and non-commutativity of Jacobians, yielding stable long-horizon rollouts.
AIMIP Phase 1 sets up a common experiment and five evaluation criteria for AI atmosphere models forced by historical sea surface temperatures, finding they match conventional models on most metrics but underestimate some warming trends and diverge on out-of-sample tests.
A new optimization algorithm with double machine learning for wildfire spread estimation enables better crew assignments that reduce total area burned.
M-CaStLe generalizes local stencil-based causal discovery to the multivariate case and decomposes resulting graphs into reaction and spatial components for interpretation in space-time gridded data.
ESFM is a single open foundation model that unifies heterogeneous Earth data sources and forecasts missing regions while preserving inter-variable physical relationships.
Probabilistic bias correction doubles AI subseasonal forecast skill and wins a 2025 international competition by correcting biases in ECMWF models for pressure, temperature, and precipitation.
citing papers explorer
-
AIFS-DOP: End-to-End Medium-Range Weather Prediction from Observations Alone with Machine Learning
An ML model trained only on harmonized gridded observations achieves competitive medium-range weather forecast skill with the IFS for several upper-air and surface headline scores when verified against observations.
-
Higher-Order Fourier Neural Operator: Explicit Mode Mixer for Nonlinear PDEs
HO-FNO extends standard FNO with n-linear spectral mixing and shows improved accuracy on nonlinear PDE benchmarks, sometimes with a single layer beating deeper FNO models.
-
An observationally constrained probabilistic trigger for organized deep convection in an NWP ensemble
A novel observationally constrained probabilistic trigger for mesoscale convective systems improves spatiotemporal scales of tropical precipitation and ensemble spread in NWP models compared to prior MCSP schemes.
-
Disentangling the effects of sea surface temperature and CO$_2$ in global machine learned weather-climate emulators
Trains ACE emulator on independent SST-CO2 variations plus energy constraint to improve accuracy in decoupled climate forcing scenarios.
-
GPROF-IR: An Improved Single-Channel Infrared Precipitation Retrieval for Merged Satellite Precipitation Products
GPROF-IR is a CNN-based retrieval that uses temporal context in geostationary IR observations to produce precipitation estimates with lower error than prior IR methods and climatological consistency with PMW retrievals for integration into IMERG V08.
-
Enabling Real-Time Training of a Wildfire-to-Smoke Map with Multilinear Operators
A multilinear operator learned on PCA coefficients maps time-since-ignition inputs to smoke outputs, matching Monte Carlo accuracy with half the model calls and outperforming prior classifiers on holdout data.
-
Cast3: Translating numerical weather prediction principles into data-driven forecasting
Cast3 translates NWP principles into a data-driven model using cubed-sphere grids, super-ensembles, and generative nudging to achieve state-of-the-art ensemble predictions that outperform baselines.
-
Explainable Load Forecasting with Covariate-Informed Time Series Foundation Models
Time series foundation models match the performance of specialized models for day-ahead load forecasting while providing explanations that match domain knowledge on weather and calendar effects.
-
Smoothness Errors in Dynamics Models and How to Avoid Them
Relaxed unitary convolutions for GNNs on meshes balance smoothness preservation with natural smoothing in dynamics, outperforming unitary convolutions and other models on PDEs and weather tasks.
-
Generative Model Proposal based Particle Filtering for Data Assimilation
FPPF uses a learned conditional generative proposal approximating the optimal proposal in particle filters, with tractable likelihoods for Bayesian updates and localization for high dimensions, outperforming baselines on nonlinear non-Gaussian systems.
-
Reconstructing GRACE Terrestrial Water Storage with Spatio-Temporal Graph Neural Networks: An Application to South America
MTGNN with hybrid adjacency matrix reconstructs GRACE-like TWS anomalies to 1940, reaching basin-mean correlation 0.94 and competitive performance with fewer predictors than baselines.
-
Rigorous uncertainty quantification of probabilistic AI weather forecasts with conformal prediction
Online conformal prediction post-processing guarantees calibrated uncertainty coverage for GenCast, NeuralGCM, and AIFS-ENS forecasts of temperature and precipitation including extremes.
-
Time series Foundation Models based on Physics-Informed Synthetic Histories for Cold-Start Photovoltaic Forecasting
A zero-shot pipeline with physics-informed synthetic histories lets time-series foundation models outperform baselines by 1.7-2x in cold-start PV forecasting on 440 sites across four datasets.
-
Scalable Uncertainty Quantification for Extreme Weather Forecasting via Empirical Neural Tangent Kernels
NTK-UQ produces 31-37% sharper 90% prediction intervals than split conformal prediction for extreme weather forecasts, with adaptive scaling via architecture-dependent eigenvalue truncation and ICA decomposition of last-layer features.
-
Probabilistic storyline attribution using machine learning
Distributional autoencoders trained on climate model simulations model full conditional distributions of European temperature fields to enable probabilistic storyline attribution, illustrated by higher intensities and probability ratios for a 2003-like heatwave in 2028 and 2053.
-
AxiomOcean: Forecasting the Three-Dimensional Structure of the Upper Ocean
AxiomOcean deploys a 3D encoder-backbone-decoder architecture that jointly predicts upper-ocean variables and outperforms prior AI models by 20-35% in day-1 RMSE while preserving eddy kinetic energy and vertical consistency.
-
AIMIP Phase 1: systematic evaluations of AI weather and climate models
AIMIP Phase 1 sets up a common experiment and five evaluation criteria for AI atmosphere models forced by historical sea surface temperatures, finding they match conventional models on most metrics but underestimate some warming trends and diverge on out-of-sample tests.
-
Predictive and Prescriptive AI toward Optimizing Wildfire Suppression
A new optimization algorithm with double machine learning for wildfire spread estimation enables better crew assignments that reduce total area burned.
-
M-CaStLe: Uncovering Local Causal Structures in Multivariate Space-Time Gridded Data
M-CaStLe generalizes local stencil-based causal discovery to the multivariate case and decomposes resulting graphs into reaction and spatial components for interpretation in space-time gridded data.
-
Earth System Foundation Model (ESFM): A unified framework for heterogeneous data integration and forecasting
ESFM is a single open foundation model that unifies heterogeneous Earth data sources and forecasts missing regions while preserving inter-variable physical relationships.
-
Enhancing AI and Dynamical Subseasonal Forecasts with Probabilistic Bias Correction
Probabilistic bias correction doubles AI subseasonal forecast skill and wins a 2025 international competition by correcting biases in ECMWF models for pressure, temperature, and precipitation.
-
Non-stationary time series attribution for heatwaves over Europe
A non-stationary Markov process with bivariate extreme value theory attributes full heatwave time series over Europe to anthropogenic forcing via likelihood ratios between ERA5 and CMIP6 runs, finding strong evidence since the 1970s but no signal beyond mean temperature increase.
-
Otter Weather: Skillful and Computationally Efficient Medium-Range Weather Forecasting
Otter Weather is a spatiotemporal model that outperforms NWP baselines by 9.6% at 24h lead with under 3.5 A100-days training and extends efficiency gains to probabilistic forecasting via CRPS.
-
Towards Fair Comparisons of AI- and Physics-Based Weather Models for Extreme Events via the Weighted Potential CRPS
Extends Potential CRPS with weights and IDR post-processing to enable fair comparisons of AIWP and NWP models on extreme weather, finding AI models more informative across most variables and thresholds.
-
CMIP-Forge: An Agentic System that Retrieves, Computes, and Self-Reviews Climate Science
CMIP-Forge presents a retrieval-augmented agentic system with automated guardrails and adversarial self-review for autonomous execution of climate research tasks on CMIP6 literature and ESGF data.
-
Water vapor buoyancy and the African easterly jet
The negative meridional moisture gradient reduces African easterly jet magnitude by 30% through vapor buoyancy counteracting the temperature gradient in thermal wind balance, with the effect strengthening under global warming in CMIP6 data.
-
Prediction and Predictability of the Wet-Season Rainfall over Southeast India
Wet-season rainfall over southeast India is increasing in amount and variability but shows potential predictability up to 10 months ahead from tropical sea surface temperature networks.
-
Dynamics of East Atlantic seed vortex populations in global km-scale models
The explicit-convection km-scale simulation produces fewer and weaker Atlantic hurricanes than parameterized coarser runs because seed vortices fail to amplify after crossing the West African coast due to weaker top-heavy mass flux profiles and underestimated MCS stratiform components.
-
Mechanistic Interpretability Tool for AI Weather Models
An open-source tool is developed for mechanistic interpretability of AI weather models, demonstrated on GraphCast by identifying latent directions corresponding to interpretable weather features.
-
CycloneMAE: A Scalable Multi-Task Learning Model for Global Tropical Cyclone Probabilistic Forecasting
CycloneMAE uses a TC structure-aware masked autoencoder with discrete probabilistic gridding and pre-train/fine-tune to deliver both deterministic and probabilistic forecasts, outperforming NWP systems in pressure and wind up to 120 hours and track up to 24 hours across five basins.
-
Structural Gating and Effect-aligned Lag-resolved Temporal Causal Discovery Framework with Application to Heat-Pollution Extremes
SGED-TCD is a lag-resolved causal discovery framework that uses structural gating and perturbation-effect alignment to infer interpretable weighted causal networks from complex time series, shown on heat-pollution extremes in China.
-
Sampling Parallelism for Fast and Efficient Bayesian Learning
Sampling parallelism distributes Bayesian sample evaluations across GPUs for near-perfect scaling, lower memory use, and faster convergence via per-GPU data augmentations, outperforming pure data parallelism in diversity.
-
Event-Aware Loss Design for Forecasting of Convective Precipitation and Lightning
A multi-task Patch-cGAN with lightning-derived spatial loss weighting improves post-processed forecasts of intense precipitation and lightning occurrence over the Korean Peninsula in summer 2025.
-
AI and physics-based weather forecasting: A comparative study
Raw IFS forecasts outperform raw AIFS for wind speed at all horizons, but post-processing with EMOS or QR reduces the gap, leaving IFS ahead mainly at short leads.
-
Achieving Skilled and Reliable Daily Probabilistic Forecasts of Wind Power at Subseasonal-to-Seasonal Timescales over France
A post-processing pipeline applied to ECMWF subseasonal ensembles produces calibrated daily wind power forecasts for France that improve on climatology by 5-15% in CRPS up to 16 days ahead.
-
Modelling convective cell occurrence in proximity to cold fronts using extreme gradient boosting
An XGBoost model reproduces convective cell frequency near cold fronts with high skill but underestimates counts at the surface front, depending most on CAPE and time of day.
-
Comparison of Two Operational Microphysics Schemes Across Various Regional-MPAS Simulations
Comparison of NSSL and TEMPO microphysics in MPAS-A shows structural differences in convection but both schemes produce less organized storms and poorer rainfall matches to observations than to each other.
-
Reviewing methods and assumptions for high-resolution large-scale onshore wind energy potential assessments
A critical review of methods for estimating onshore wind energy potentials at multiple levels, with an attempt to derive best practice recommendations.