pith. sign in

arxiv: 1907.05750 · v1 · pith:N4JKQQ2Jnew · submitted 2019-07-12 · 📊 stat.AP

A Regionalisation Approach for Rainfall based on Extremal Dependence

Pith reviewed 2026-05-24 22:05 UTC · model grok-4.3

classification 📊 stat.AP
keywords regionalisationextremal dependenceclusteringmax-stable processesrainfall extremesspatial dependenceAustralia
0
0 comments X

The pith

Clustering stations by extremal dependence produces regions where max-stable models fit reliably.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces a regionalisation method that groups rainfall stations into clusters sharing similar patterns of extremal dependence. This tackles the issue that dependence structures change across large areas, which prevents direct fitting of models like max-stable processes over entire continents. The clustering creates smaller regions inside which dependence can be treated as stationary. Applied to Australian stations, the resulting partitions are compared against known climate and topographic features. Max-stable models are then fitted within each region to assess whether the partitions support reliable modeling.

Core claim

The central claim is that a clustering procedure based on extremal dependence measures successfully partitions rainfall stations into regions of comparable tail dependence, such that a stationary dependence structure holds inside each region and max-stable processes can be fitted to the data within those regions even when the full domain is geographically large.

What carries the argument

The extremal dependence clustering algorithm that assigns stations to groups according to similarity in tail dependence measures between pairs of stations.

If this is right

  • Max-stable processes become feasible to fit on large domains by applying them separately inside each derived region.
  • The resulting regions correspond to known climate and topographic divisions across Australia.
  • The partitioning supplies a practical route for handling non-stationary spatial dependence by reducing it to a collection of stationary sub-problems.
  • Model fitting inside the regions serves as a direct check on whether the clusters have captured similar dependence structures.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The same clustering step could be applied to other extreme variables such as wind or temperature to produce analogous regional models.
  • Computational cost of fitting spatial extreme models drops when the domain is broken into smaller stationary pieces.
  • If the clusters align with physical drivers of rainfall, the regions may improve risk mapping for infrastructure planning.

Load-bearing premise

That stations grouped together by extremal dependence will exhibit a dependence structure stable enough inside each group for max-stable models to be fitted without major bias.

What would settle it

Finding that fitted max-stable models within the derived regions produce poor matches to the observed joint tail probabilities for pairs of stations that fall inside the same cluster.

read the original abstract

To mitigate the risk posed by extreme rainfall events, we require statistical models that reliably capture extremes in continuous space with dependence. However, assuming a stationary dependence structure in such models is often erroneous, particularly over large geographical domains. Furthermore, there are limitations on the ability to fit existing models, such as max-stable processes, to a large number of locations. To address these modelling challenges, we present a regionalisation method that partitions stations into regions of similar extremal dependence using clustering. To demonstrate our regionalisation approach, we consider a study region of Australia and discuss the results with respect to known climate and topographic features. To visualise and evaluate the effectiveness of the partitioning, we fit max-stable models to each of the regions. This work serves as a prelude to how one might consider undertaking a project where spatial dependence is non-stationary and is modelled on a large geographical scale.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 0 minor

Summary. The paper presents a regionalisation method that uses clustering on a measure of extremal dependence to partition rainfall stations into regions, with the goal of enabling reliable fits of max-stable processes within each region; the approach is demonstrated on Australian rainfall stations and evaluated by fitting max-stable models per region, motivated by the need to handle non-stationary dependence over large domains.

Significance. If the clustering produces regions with sufficiently stationary dependence, the method would offer a practical preprocessing step for applying max-stable processes to large spatial datasets where global stationarity fails, addressing both modeling assumptions and computational scalability in spatial extremes analysis.

major comments (2)
  1. [Abstract] Abstract (method motivation paragraph): the central assumption that clustering stations by similarity of extremal dependence produces regions inside which a stationary dependence structure holds for max-stable processes is load-bearing but receives no direct diagnostic; pairwise extremal coefficient similarity does not imply that the dependence function is constant with respect to distance or topography inside each cluster, and the manuscript provides only the visual fit of the max-stable model itself rather than a stationarity check such as parameter stability across sub-regions or constancy of the extremal coefficient function.
  2. [Abstract] Abstract (validation paragraph): the effectiveness of the partitioning is evaluated solely by fitting max-stable models to each region, but without reported quantitative diagnostics (e.g., comparison of fitted parameters or extremal coefficients across sub-clusters or against a null of non-stationarity), it is unclear whether the clustering step has achieved the required intra-region stationarity.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their constructive comments on the motivation and validation sections of the abstract. We address each major comment below.

read point-by-point responses
  1. Referee: [Abstract] Abstract (method motivation paragraph): the central assumption that clustering stations by similarity of extremal dependence produces regions inside which a stationary dependence structure holds for max-stable processes is load-bearing but receives no direct diagnostic; pairwise extremal coefficient similarity does not imply that the dependence function is constant with respect to distance or topography inside each cluster, and the manuscript provides only the visual fit of the max-stable model itself rather than a stationarity check such as parameter stability across sub-regions or constancy of the extremal coefficient function.

    Authors: We agree that pairwise extremal coefficient similarity does not automatically guarantee a constant dependence function within clusters. The clustering groups stations by extremal dependence similarity, and the resulting partitions align with known Australian climate and topographic features; the fitted max-stable models provide visual support for the approach. To strengthen the manuscript we will add explicit quantitative stationarity diagnostics (parameter stability across sub-regions and constancy checks on the extremal coefficient function) in a revised version. revision: yes

  2. Referee: [Abstract] Abstract (validation paragraph): the effectiveness of the partitioning is evaluated solely by fitting max-stable models to each region, but without reported quantitative diagnostics (e.g., comparison of fitted parameters or extremal coefficients across sub-clusters or against a null of non-stationarity), it is unclear whether the clustering step has achieved the required intra-region stationarity.

    Authors: The current evaluation relies on the quality of the fitted max-stable models and their consistency with known regional climate features. We accept that additional quantitative diagnostics would make the validation more rigorous. In revision we will report comparisons of fitted parameters and extremal coefficients across sub-clusters within each region, together with checks against a non-stationarity null. revision: yes

Circularity Check

0 steps flagged

No circularity: clustering method is independent preprocessing step

full rationale

The paper presents a clustering procedure on extremal dependence measures as an independent regionalisation step, followed by separate max-stable model fitting per region solely for visualisation and evaluation. No equations, fitted parameters, or self-citations are shown that would make any claimed result equivalent to its inputs by construction. The derivation chain remains self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract supplies no explicit free parameters, axioms or invented entities; all technical details are absent.

pith-pipeline@v0.9.0 · 5679 in / 926 out tokens · 19716 ms · 2026-05-24T22:05:44.524620+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.