Reflecting on a Decade of Formalized Tornado Emergencies

Edward C. Wolff; Elijah M. Conklin; Evan T. Gustafson; James S. Goodnight; Joseph E. Trujillo-Falc\'on; Leanne Blind-Doskocil

arxiv: 2510.25011 · v2 · submitted 2025-10-28 · ⚛️ physics.ao-ph

Reflecting on a Decade of Formalized Tornado Emergencies

Edward C. Wolff , James S. Goodnight , Leanne Blind-Doskocil , Elijah M. Conklin , Evan T. Gustafson , Joseph E. Trujillo-Falc\'on This is my paper

Pith reviewed 2026-05-18 03:20 UTC · model grok-4.3

classification ⚛️ physics.ao-ph

keywords tornado emergenciesimpact-based warningstornado forecastingwarning criteriaNWS severe weatherrotational velocitytornado ratingsfatality statistics

0 comments

The pith

Forecasters issue most tornado emergencies without meeting both suggested velocity and risk thresholds, yet these warnings still cover the majority of the strongest tornadoes and many of the deadliest ones.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper examines the 89 tornado emergencies issued under the formal impact-based warning system between 2014 and 2023. It shows that only a small number of these met both the rotational velocity and significant tornado parameter thresholds recommended in training guidance. Despite the mismatch, the emergencies were used for 70 percent of EF-3 and stronger tornadoes and captured 55 percent of EF-4 events along with 41 percent of tornadoes that caused three or more deaths. The authors therefore suggest revisions to existing training materials so that guidance better matches observed forecaster practice. A reader would care because clearer criteria could improve how rare catastrophic warnings are decided and communicated during severe weather.

Core claim

Analysis of all 89 IBW tornado emergencies issued from 2014-2023 found that forecasters do not appear to follow the suggested criteria for issuance in the majority of cases, with only a handful of tornado emergencies meeting both the VROT and STP thresholds. Regardless, 70% of tornado emergencies were issued for EF-3+ tornadoes, and tornado emergencies covered 55% of all EF-4 tornadoes as well as 41% of all tornadoes resulting in 3 or more fatalities. Based on these results, several updates to current training materials for impact-based tornado warnings are proposed.

What carries the argument

Direct comparison of each issued tornado emergency against the two WDTD-suggested thresholds for rotational velocity and significant tornado parameter, then cross-checked against official tornado damage ratings and fatality counts.

If this is right

Forecasters rely on additional factors beyond the two main thresholds when deciding to issue a tornado emergency.
The special warnings still successfully identify most EF-3 and stronger tornadoes despite the threshold mismatch.
Tornado emergencies cover more than half of all EF-4 tornadoes and 41 percent of those causing multiple fatalities.
Current training materials may need revision to reflect how decisions are actually made in operations.
The protective value of these warnings may depend more on overall situational awareness than on strict threshold compliance.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

If thresholds are routinely exceeded in practice, training could shift emphasis toward observed storm behavior and local impacts rather than parameter cutoffs alone.
The pattern may point to a need for expanded criteria that include additional radar signatures or environmental context forecasters already use.
Similar analyses for other high-impact warning phrases could reveal whether the same gap between guidance and practice exists elsewhere.
Testing revised criteria in real-time exercises would show whether updated guidance changes issuance rates or warning outcomes.

Load-bearing premise

The analysis assumes the WDTD-suggested VROT and STP thresholds represent the correct standard against which to judge whether a tornado emergency should have been issued.

What would settle it

Reprocessing the same 89 events with consistent radar and parameter data sources that shows a majority meeting both thresholds at issuance time would undermine the claim of widespread non-adherence to the criteria.

read the original abstract

In 1999 the NWS began using the phrase "tornado emergency" to denote tornado warnings for storms with the potential to cause rare, catastrophic damage. After years of informal usage, tornado emergencies were formally introduced to 46 weather forecasting offices in 2014 as part of the impact-based warning (IBW) program, with a nationwide rollout occurring over the following years. In concert with the new tiered warning approach, the Warning Decision Training Division (WDTD) also introduced suggested criteria for when forecasters should consider upgrading a tornado warning to a tornado emergency, which includes thresholds of rotational velocity (VROT) and significant tornado parameter (STP). Although significant research has studied both tornado forecasting and tornado warning dissemination in the decade since, relatively little work has examined the effectiveness of the tornado emergency specifically. Our analysis of all 89 IBW tornado emergencies issued from 2014-2023 found that forecasters do not appear to follow the suggested criteria for issuance in the majority of cases, with only a handful of tornado emergencies meeting both the VROT and STP thresholds. Regardless, 70% of tornado emergencies were issued for EF-3+ tornadoes, and tornado emergencies covered 55% of all EF-4 tornadoes as well as 41% of all tornadoes resulting in 3 or more fatalities. Based on these results, we propose several updates to the current NWS training materials for impact-based tornado warnings.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The manuscript analyzes all 89 impact-based warning tornado emergencies issued from 2014-2023. It reports that forecasters do not appear to follow the WDTD-suggested VROT and STP thresholds in the majority of cases, with only a handful meeting both. Nevertheless, 70% of these emergencies were issued for EF-3+ tornadoes, covering 55% of all EF-4 tornadoes and 41% of all tornadoes resulting in 3 or more fatalities. The authors propose updates to NWS training materials for impact-based tornado warnings.

Significance. If the findings hold, this work delivers a useful empirical baseline on the practical use and effectiveness of the formalized tornado emergency designation over a decade. The direct count against external NWS records and published guidelines, without fitted parameters or circular derivations, provides a transparent assessment of whether the program is highlighting rare catastrophic events as intended. The coverage statistics for high-impact tornadoes could inform targeted revisions to forecaster guidance.

major comments (2)

[Methods] Methods section: The manuscript must specify whether the VROT and STP values for each of the 89 events were drawn from real-time operational radar scans and SPC model fields available at issuance or from post-event reanalyses. VROT is sensitive to scan timing, elevation, and range, while STP differs between real-time and later products; using non-contemporaneous values directly affects the count of cases meeting both thresholds and thus the claim that forecasters do not follow the criteria in the majority of cases.
[Results] Results section: Provide the exact number (rather than 'a handful') of the 89 events that met both VROT and STP thresholds. Also detail the matching procedure between the official emergency list, EF ratings, and fatality counts, including any exclusions or potential mismatches, to substantiate the reported 70%, 55%, and 41% figures.

minor comments (2)

[Abstract] Abstract: Replace the qualitative phrase 'a handful' with the precise count of compliant events for clarity and reproducibility.
[Methods] Ensure all data sources (NWS records, WDTD guidelines) are cited with specific references or URLs in the methods or data section.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their constructive and detailed comments, which help strengthen the clarity and rigor of our empirical analysis of tornado emergencies. We address each major comment below and will incorporate revisions to the manuscript.

read point-by-point responses

Referee: [Methods] Methods section: The manuscript must specify whether the VROT and STP values for each of the 89 events were drawn from real-time operational radar scans and SPC model fields available at issuance or from post-event reanalyses. VROT is sensitive to scan timing, elevation, and range, while STP differs between real-time and later products; using non-contemporaneous values directly affects the count of cases meeting both thresholds and thus the claim that forecasters do not follow the criteria in the majority of cases.

Authors: We appreciate this important clarification request. The VROT and STP values in our analysis were obtained from post-event reanalyses using the most complete and consistent datasets available (including archived radar data and SPC environmental fields) to enable uniform comparison across all 89 events. Real-time operational values are not uniformly archived or accessible for every case, which motivated this approach. We will revise the Methods section to explicitly describe the data sources, note the distinction from strictly contemporaneous operational data, and discuss how this choice supports a retrospective assessment of guideline adherence while acknowledging potential differences in forecaster decision-making context. This addition will not change the reported counts or conclusions but will improve transparency. revision: yes
Referee: [Results] Results section: Provide the exact number (rather than 'a handful') of the 89 events that met both VROT and STP thresholds. Also detail the matching procedure between the official emergency list, EF ratings, and fatality counts, including any exclusions or potential mismatches, to substantiate the reported 70%, 55%, and 41% figures.

Authors: We agree that greater precision and methodological detail will strengthen the Results section. Exactly 4 of the 89 events met both the suggested VROT and STP thresholds; we will replace the phrase 'a handful' with this specific count. We will also add a dedicated paragraph describing the matching procedure: The official list of 89 IBW tornado emergencies was obtained directly from NWS records covering 2014–2023. Each event was matched to EF-scale ratings and fatality counts using the NWS Storm Data publication, with cross-verification against SPC tornado event logs by date, time, and geographic coordinates. Manual review resolved any ambiguous cases (e.g., multi-county warnings). No events were excluded from the primary analysis; the 70% EF-3+ coverage, 55% EF-4 coverage, and 41% coverage of 3+ fatality tornadoes were computed directly from these matches. We will note minor potential sources of mismatch, such as delayed fatality reporting or boundary uncertainties, which affect fewer than 5% of cases and do not alter the overall statistics. revision: yes

Circularity Check

0 steps flagged

No circularity: direct empirical counts from external records

full rationale

The paper performs a retrospective enumeration of all 89 IBW tornado emergencies (2014-2023) against published WDTD criteria and official NWS tornado ratings/fatality data. No equations, fitted parameters, derived predictions, or self-referential derivations appear. The central claims (compliance rates, 70% EF-3+ coverage, 55% EF-4 coverage) are simple tallies from independent external sources, with no reduction to the paper's own inputs or prior self-citations. This is a standard self-contained empirical study.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 0 invented entities

The central claim rests on the assumption that the WDTD thresholds constitute the authoritative benchmark and that the NWS database of emergencies, EF ratings, and fatality counts is exhaustive and correctly linked. No free parameters or invented entities are introduced.

axioms (2)

domain assumption The WDTD-suggested VROT and STP thresholds are the appropriate standard for evaluating whether a tornado emergency was issued correctly.
The paper uses non-adherence to these thresholds as its primary negative finding.
domain assumption The official NWS list of 89 tornado emergencies from 2014-2023 together with associated tornado ratings and fatality data is complete and accurately matched.
All reported percentages depend on this completeness.

pith-pipeline@v0.9.0 · 5817 in / 1442 out tokens · 55343 ms · 2026-05-18T03:20:21.720628+00:00 · methodology

Reflecting on a Decade of Formalized Tornado Emergencies

Core claim

What carries the argument

If this is right

Where Pith is reading between the lines

Load-bearing premise

What would settle it

discussion (0)