Using machine learning to downscale coarse-resolution environmental variables for understanding the spatial frequency of convective storms

Hungjui Yu; Imme Ebert-Uphoff; Kristen L. Rasmussen; Lander Ver Hoef

arxiv: 2509.08802 · v1 · submitted 2025-09-10 · ⚛️ physics.ao-ph

Using machine learning to downscale coarse-resolution environmental variables for understanding the spatial frequency of convective storms

Hungjui Yu , Lander Ver Hoef , Kristen L. Rasmussen , Imme Ebert-Uphoff This is my paper

Pith reviewed 2026-05-18 17:09 UTC · model grok-4.3

classification ⚛️ physics.ao-ph

keywords machine learningconvective stormsdownscalingclimate modelingneural networksenvironmental variablesstorm frequency

0 comments

The pith

Pixel-based neural networks can predict convective storm frequency from coarse environmental variables with SSIM above 0.8 while capturing diurnal and orographic patterns.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper trains simple neural networks that treat each grid point independently to map environmental variables from a convection-permitting model onto maps of convective storm frequency. These models reach structural similarity values over 0.8 and reproduce the daily cycle of storms plus mountain-driven convection even when given no explicit time or location information. The work shows that machine learning can act as an efficient link between the large-scale conditions resolved by global climate models and the kilometer-scale convective processes those models cannot simulate directly. Results weaken when fewer input variables are used or when training excludes certain regions, indicating that diverse physical drivers must be represented for reliable performance.

Core claim

We train simple, pixel-based neural networks to predict convective storm frequency from environmental variables produced by a regional convection-permitting model. The ML models achieve promising results, with structural similarity index measure (SSIM) values exceeding 0.8, capturing the diurnal cycle and orographic convection without explicit temporal or spatial coordinates as input. Unlike convolutional neural networks, the pixel-based approach treats each grid point independently, enabling value-to-value prediction without spatial context and supporting adaptation to different resolutions.

What carries the argument

A feedforward neural network applied independently at each grid point to map environmental inputs directly to storm frequency values, without using neighboring pixels or explicit coordinates.

If this is right

Machine learning can supply a low-cost way to include convective effects inside large-scale climate simulations.
Performance improves when the training data include a wide range of physical mechanisms from multiple regions.
The networks can highlight which environmental conditions most strongly control local storm frequency.
The pixel-independent design lets the same model be used on grids of any resolution without retraining the architecture.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The approach could be retrained directly on observational datasets to produce downscaled storm statistics for historical climate analysis.
Feature-importance analysis on the trained networks might reveal previously under-appreciated links between large-scale moisture or wind patterns and storm occurrence.
Testing the networks on climate-model output from future scenarios would show whether the learned relationships remain stable under changed mean conditions.

Load-bearing premise

The relationships between environmental variables and storm frequency learned from one regional high-resolution model will hold when the same networks are applied to coarser global climate model output or to other geographic domains.

What would settle it

Apply the trained networks to output from a different convection-permitting model or to real observational records of storm frequency and measure whether SSIM remains above 0.8 across multiple regions and seasons.

read the original abstract

Global climate models (GCMs), typically run at ~100-km resolution, capture large-scale environmental conditions but cannot resolve convection and cloud processes at kilometer scales. Convection-permitting models offer higher-resolution simulations that explicitly simulate convection but are computationally expensive and impractical for large ensemble runs. This study explores machine learning (ML) as a bridge between these approaches. We train simple, pixel-based neural networks to predict convective storm frequency from environmental variables produced by a regional convection-permitting model. The ML models achieve promising results, with structural similarity index measure (SSIM) values exceeding 0.8, capturing the diurnal cycle and orographic convection without explicit temporal or spatial coordinates as input. Model performance declines when fewer input features are used or specific regions are excluded, underscoring the role of diverse physical mechanisms in convective activity. These findings highlight ML potential as a computationally efficient tool for representing convection and as a means of scientific discovery, offering insights into convective processes. Unlike convolutional neural networks, which depend on spatial structure and grid size, the pixel-based model treats each grid point independently, enabling value-to-value prediction without spatial context. This design enhances adaptability to resolution changes and supports generalization to unseen environmental regimes, making it particularly suited for linking environmental conditions to convective features and for application across diverse model grids or climate scenarios.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

Pixel-wise NN gets SSIM above 0.8 on held-out CPM points and surfaces diurnal/orographic signals without coordinates, but the transfer to GCM output remains untested.

read the letter

The paper shows that a simple pixel-based neural network can map environmental fields to convective storm frequency inside a regional convection-permitting simulation, reaching SSIM values over 0.8 while reproducing the diurnal cycle and orographic patterns without any explicit time or location inputs. Performance drops when features are removed or regions are withheld, which at least indicates the network is using a mix of physical signals rather than latching onto one artifact.

Referee Report

2 major / 2 minor

Summary. The manuscript trains simple pixel-based neural networks on environmental variables (e.g., CAPE, shear, humidity) from a regional convection-permitting model to predict convective storm frequency. It reports SSIM values exceeding 0.8 on held-out CPM grid points, with the models reproducing diurnal cycles and orographic signals without explicit spatial or temporal inputs. The authors highlight the pixel-independent design for potential application to coarser GCM grids and different domains, and note performance degradation with reduced features or withheld regions.

Significance. If the learned mappings prove transferable, the work offers a computationally lightweight, resolution-agnostic route to represent sub-grid convection in GCMs and to extract physical insights from environmental predictors. The avoidance of convolutional architectures is a clear methodological advantage for cross-grid adaptability.

major comments (2)

[Abstract and §4] Abstract and §4 (results on generalization): The assertion that the pixel-based approach 'supports generalization to unseen environmental regimes' and is 'particularly suited for application across diverse model grids or climate scenarios' is not supported by any experiment that applies the trained network to actual GCM output fields (different resolution, bias structure, and large-scale forcing) and compares against independent high-resolution truth or observations; all quantitative metrics remain within the training CPM distribution.
[§3 and §4] §3 (methods) and §4 (results): No quantitative cross-validation protocol, uncertainty estimates, or error bars are supplied for the reported SSIM values >0.8; the text only states that performance declines with fewer features or excluded regions, without specifying the exact ablation design or statistical significance of those declines.

minor comments (2)

[Figure captions and §4] Figure captions and §4: Add explicit labels for the number of input features used in each ablation experiment and the geographic extent of the withheld regions to allow readers to reproduce the sensitivity tests.
[Throughout] Throughout: Define acronyms (SSIM, CPM, GCM) at first use and ensure consistent capitalization of 'convection-permitting model'.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their constructive comments, which help clarify the scope and presentation of our results. We respond to each major comment below and outline the corresponding revisions.

read point-by-point responses

Referee: [Abstract and §4] The assertion that the pixel-based approach 'supports generalization to unseen environmental regimes' and is 'particularly suited for application across diverse model grids or climate scenarios' is not supported by any experiment that applies the trained network to actual GCM output fields (different resolution, bias structure, and large-scale forcing) and compares against independent high-resolution truth or observations; all quantitative metrics remain within the training CPM distribution.

Authors: We agree that the manuscript contains no direct experiments transferring the trained model to GCM fields. The statements in the abstract and §4 are motivated by the pixel-independent architecture, which avoids dependence on fixed spatial neighborhoods or grid resolution, together with the observed robustness to feature ablation and regional withholding within the CPM domain. These elements provide a methodological basis for expecting transferability, but we acknowledge that this remains an untested extrapolation. We will revise the abstract and §4 to replace the stronger claims with language that describes the potential for such applications based on the design, while explicitly noting that validation against GCM output or independent observations is left for future work. revision: yes
Referee: [§3 and §4] No quantitative cross-validation protocol, uncertainty estimates, or error bars are supplied for the reported SSIM values >0.8; the text only states that performance declines with fewer features or excluded regions, without specifying the exact ablation design or statistical significance of those declines.

Authors: We accept that the current text lacks sufficient detail on the evaluation protocol and statistical characterization. In the revised manuscript we will add an explicit description of the train–test partitioning (random 80/20 split of grid points, with temporal blocks preserved to avoid leakage), report SSIM as mean ± standard deviation across five independent training runs with different random seeds, and include error bars on all relevant figures. For the ablation experiments we will specify the precise feature subsets removed and the geographic masks applied, together with the quantitative SSIM changes and a brief note on whether the differences exceed the run-to-run variability. revision: yes

Circularity Check

0 steps flagged

No significant circularity; derivation is self-contained

full rationale

The paper trains pixel-based neural networks to map environmental variables to convective storm frequency using output from a single regional convection-permitting model, then reports SSIM performance on held-out grid points from the same simulation. This constitutes a standard supervised learning evaluation against an independent test split; the metric is computed directly from model predictions versus withheld target fields and is not algebraically forced by any fitted parameter or self-referential definition. No load-bearing self-citation, uniqueness theorem, or ansatz is invoked to justify the central result. Claims regarding future application to GCMs or new domains are aspirational and do not participate in the reported derivation chain.

Axiom & Free-Parameter Ledger

1 free parameters · 1 axioms · 0 invented entities

The central claim rests on the assumption that high-resolution model output supplies a reliable training target and that the learned mapping will transfer to coarser models; no new physical entities are introduced.

free parameters (1)

Neural-network architecture and training hyperparameters
Number of layers, learning rate, and feature selection are chosen to optimize fit on the training data.

axioms (1)

domain assumption Environmental variables at coarse resolution contain sufficient information to statistically predict convective-storm occurrence frequency.
Invoked when the networks are trained to map environmental fields directly to storm-frequency fields.

pith-pipeline@v0.9.0 · 5784 in / 1292 out tokens · 43914 ms · 2026-05-18T17:09:04.825754+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

3 extracted references · 3 canonical work pages

[1]

Data and methodology The primary objective of this study is to establish relationships between the environmental conditions and sub-grid-scale cloud features. To achieve this, we utilize long-term, convection-permitting regional climate model simulations in which both environmental variables and cloud characteristics are reasonably represented. The four-k...

work page 2023
[2]

We apply the trained models to the hourly environmental variables on the coarsened 28-km grid to generate hourly predictions of convective spatial frequency for the testing period

Model Performance We use hourly convective frequency data from the CONUS404 CTRL simulation from January to September 2022 as the testing target (ground truth) to evaluate the model performance. We apply the trained models to the hourly environmental variables on the coarsened 28-km grid to generate hourly predictions of convective spatial frequency for t...

work page 2022
[3]

Conclusions In this study we demonstrate that machine learning (ML) techniques such as multilayer perceptron (MLP) models can effectively predict sub-grid-scale convective cloud information, such as convective frequency, from environmental variables alone. MLP models trained on coarsened 28-km grid data exhibit robust performance across different initiali...

work page doi:10.5065/zyy0-y036 2002

[1] [1]

Data and methodology The primary objective of this study is to establish relationships between the environmental conditions and sub-grid-scale cloud features. To achieve this, we utilize long-term, convection-permitting regional climate model simulations in which both environmental variables and cloud characteristics are reasonably represented. The four-k...

work page 2023

[2] [2]

We apply the trained models to the hourly environmental variables on the coarsened 28-km grid to generate hourly predictions of convective spatial frequency for the testing period

Model Performance We use hourly convective frequency data from the CONUS404 CTRL simulation from January to September 2022 as the testing target (ground truth) to evaluate the model performance. We apply the trained models to the hourly environmental variables on the coarsened 28-km grid to generate hourly predictions of convective spatial frequency for t...

work page 2022

[3] [3]

Conclusions In this study we demonstrate that machine learning (ML) techniques such as multilayer perceptron (MLP) models can effectively predict sub-grid-scale convective cloud information, such as convective frequency, from environmental variables alone. MLP models trained on coarsened 28-km grid data exhibit robust performance across different initiali...

work page doi:10.5065/zyy0-y036 2002