Modeling Nonstationary Extremal Dependence via Deep Spatial Deformations

Jordan Richards; Raphael Huser; Xuanjie Shao

arxiv: 2505.12548 · v2 · submitted 2025-05-18 · 📊 stat.ME · stat.ML

Modeling Nonstationary Extremal Dependence via Deep Spatial Deformations

Xuanjie Shao , Jordan Richards , Raphael Huser This is my paper

Pith reviewed 2026-05-22 13:50 UTC · model grok-4.3

classification 📊 stat.ME stat.ML

keywords nonstationary extremal dependencespatial warpingdeep compositional modelsr-Pareto processesprecipitation extremesspatial statisticsthreshold exceedances

0 comments

The pith

Deep compositional warps turn nonstationary spatial extremes into stationary latent processes.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper develops deep compositional spatial models that warp the observed domain into a latent space where standard stationary and isotropic assumptions hold for extremal dependence. By focusing on high threshold exceedances modeled via limiting r-Pareto processes, the approach enables efficient inference that scales to large numbers of locations. Simulations show improved recovery of the warping function compared to earlier methods, and the technique is demonstrated on UK precipitation extremes observed at thousands of sites.

Core claim

We overcome these challenges by developing deep compositional spatial models to capture nonstationarity in extremal dependence. Specifically, we focus on modeling high threshold exceedances of process functionals by leveraging efficient inference methods for limiting r-Pareto processes. A detailed high-dimensional simulation study demonstrates the superior performance of our model in estimating the warped space. We illustrate our method by modeling UK precipitation extremes and show that we can efficiently estimate the extremal dependence structure of data observed at thousands of locations.

What carries the argument

Deep compositional warping function that deforms the physical domain into a latent stationary isotropic space for r-Pareto process modeling.

Load-bearing premise

The deep compositional warping function remains bijective and produces physically realistic transformations without domain folding.

What would settle it

A fitted warping function that maps two distinct observed locations to the same latent point or produces crossing trajectories in the transformed coordinates would indicate the central claim fails for that dataset.

read the original abstract

Modeling nonstationarity that often prevails in extremal dependence of spatial data can be challenging, and typically requires bespoke or complex spatial models that are difficult to estimate. Inference for stationary and isotropic models is considerably easier, but the assumptions that underpin these models are rarely met by data observed over large or topographically complex domains. A possible approach for accommodating nonstationarity in a spatial model is to warp the spatial domain to a latent space where stationarity and isotropy can be reasonably assumed. Although this approach is very flexible, estimating the warping function can be computationally expensive, and the transformation is not always guaranteed to be bijective, which may lead to physically unrealistic transformations when the domain folds onto itself. We overcome these challenges by developing deep compositional spatial models to capture nonstationarity in extremal dependence. Specifically, we focus on modeling high threshold exceedances of process functionals by leveraging efficient inference methods for limiting r-Pareto processes. A detailed high-dimensional simulation study demonstrates the superior performance of our model in estimating the warped space. We illustrate our method by modeling UK precipitation extremes and show that we can efficiently estimate the extremal dependence structure of data observed at thousands of locations.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The paper proposes deep compositional spatial models to model nonstationary extremal dependence by learning a warping function that maps the observed spatial domain to a latent space in which an r-Pareto process is stationary and isotropic. Efficient inference is developed for high-threshold exceedances, with claims of superior performance in a high-dimensional simulation study for recovering the warped space and successful application to UK precipitation extremes observed at thousands of locations.

Significance. If the bijectivity and physical realism of the learned warping can be rigorously established, the approach would offer a flexible, scalable alternative to bespoke nonstationary extremal models, enabling reliable inference for large spatial datasets. The high-dimensional simulation study and the real-data illustration on UK precipitation provide concrete evidence of computational feasibility and practical applicability.

major comments (2)

[Abstract and Section 3] Abstract and Section 3 (warping construction): The claim that the deep compositional construction 'overcomes' non-bijectivity and folding is not supported by an explicit mechanism (e.g., strictly monotone radial basis functions, invertible residual blocks with positive Jacobian constraint, or post-hoc verification). Without such a guarantee, the r-Pareto process in latent space is misspecified whenever the map folds, undermining consistency of the extremal dependence estimates.
[Section 4] Section 4 (simulation study): The reported superior performance in estimating the warped space is shown only via visual or aggregate metrics; no quantitative assessment of bijectivity (e.g., minimum Jacobian determinant or folding detection) is provided for the fitted maps, leaving open whether the stationarity assumption holds in the reported high-dimensional cases.

minor comments (2)

[Sections 2-4] Notation for the compositional layers and the r-Pareto process parameters should be unified across the methods and simulation sections to avoid ambiguity.
[Section 5] The real-data application would benefit from a brief discussion of how the estimated warping respects known topographic features of the UK domain.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their constructive and detailed comments on our manuscript. The feedback highlights important aspects of bijectivity that warrant clarification and additional supporting material. We address each major comment below and describe the revisions we will incorporate.

read point-by-point responses

Referee: [Abstract and Section 3] Abstract and Section 3 (warping construction): The claim that the deep compositional construction 'overcomes' non-bijectivity and folding is not supported by an explicit mechanism (e.g., strictly monotone radial basis functions, invertible residual blocks with positive Jacobian constraint, or post-hoc verification). Without such a guarantee, the r-Pareto process in latent space is misspecified whenever the map folds, undermining consistency of the extremal dependence estimates.

Authors: We thank the referee for this observation. The manuscript states that the deep compositional models overcome challenges including potential non-bijectivity, but does not supply an explicit mechanism such as a positive Jacobian constraint or formal invertibility proof. In practice the architecture employs residual blocks and activations chosen to promote smooth, non-folding deformations, and the simulation results indicate that the learned maps recover the target warping without visible folding. To strengthen the presentation we will revise Section 3 to describe these architectural choices in greater detail and add a short discussion of conditions under which the composition remains bijective. We will also include post-hoc Jacobian-determinant checks in the numerical experiments to verify that the stationarity assumption holds for the reported fits. revision: yes
Referee: [Section 4] Section 4 (simulation study): The reported superior performance in estimating the warped space is shown only via visual or aggregate metrics; no quantitative assessment of bijectivity (e.g., minimum Jacobian determinant or folding detection) is provided for the fitted maps, leaving open whether the stationarity assumption holds in the reported high-dimensional cases.

Authors: We agree that quantitative verification of bijectivity would provide stronger support for the validity of the latent-space stationarity assumption. The current simulation study relies on visual inspection of the recovered warping and aggregate performance metrics. In the revised manuscript we will augment Section 4 with explicit quantitative diagnostics: the minimum Jacobian determinant evaluated on a dense grid of test points, together with a simple folding-detection procedure that flags any regions where the map is non-injective. These additions will be reported for all high-dimensional simulation scenarios. revision: yes

Circularity Check

0 steps flagged

No significant circularity; derivation relies on novel model construction validated on independent data

full rationale

The paper proposes deep compositional spatial models to warp the domain for stationary r-Pareto processes in extremal dependence modeling. The abstract outlines the approach to overcome non-bijectivity issues via compositional construction and reports performance on a high-dimensional simulation study plus UK precipitation application at thousands of locations. No quoted equations or steps reduce by construction to inputs, fitted parameters renamed as predictions, or load-bearing self-citations; the central claims depend on external empirical validation rather than tautological redefinition.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

Based solely on the abstract; no explicit free parameters, axioms, or invented entities are enumerated. The approach implicitly relies on the validity of the r-Pareto limiting approximation after warping.

axioms (1)

domain assumption High-threshold exceedances of the process can be approximated by a limiting r-Pareto process once the domain is warped to stationarity and isotropy.
Stated as the focus of the modeling approach in the abstract.

pith-pipeline@v0.9.0 · 5735 in / 1191 out tokens · 64079 ms · 2026-05-22T13:50:39.463224+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Foundation/RealityFromDistinction.lean reality_from_one_distinction unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

We overcome these challenges by developing deep compositional spatial models to capture nonstationarity in extremal dependence... leveraging efficient inference methods for limiting r-Pareto processes.
IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

the warping function f(·) = f_n ◦ f_{n−1} ◦ · · · ◦ f_1(·) of simple bijective warping functions
IndisputableMonolith/Foundation/AlexanderDuality.lean alexander_duality_circle_linking unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

Möbius transformation units... K=2

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.