pith. machine review for the scientific record. sign in

arxiv: 2601.14013 · v2 · submitted 2026-01-20 · 🧮 math.ST · stat.TH

Recognition: 2 theorem links

· Lean Theorem

Robustness for free: asymptotic size and power of max-tests in high dimensions

Authors on Pith no claims yet

Pith reviewed 2026-05-16 12:45 UTC · model grok-4.3

classification 🧮 math.ST stat.TH
keywords high-dimensional mean testingadversarial contaminationquantile winsorizingmax-testasymptotic powerheavy tailsbootstrap critical valuesrobustness
0
0 comments X

The pith

A quantile-winsorized max-test achieves robustness to adversarial contamination in high dimensions without sacrificing asymptotic power.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

Standard max-tests based on sample averages lose size control when observations suffer heavy tails or adversarial contamination. The paper replaces averages with quantile-winsorized versions inside the max statistic. This version controls asymptotic size with only m>2 moments and lets dimension grow exponentially in sample size. Its asymptotic power equals that of the ordinary max-test precisely when the ordinary test is valid. Bootstrap critical values never reduce power and can raise it in strongly correlated settings.

Core claim

The max-test based on quantile-winsorized observations controls asymptotic size under adversarial contamination and heavy tails while requiring only m>2 moments and allowing the dimension to grow exponentially with sample size. Its asymptotic power function coincides with that of the standard max-test under the stricter conditions that validate the standard test, so that robustness is obtained at no first-order asymptotic power cost.

What carries the argument

The quantile-winsorized max-test statistic, formed by replacing each coordinate average with a winsorized version at fixed quantiles before taking the coordinate-wise maximum.

If this is right

  • The robust test maintains asymptotic size control under arbitrary adversarial contamination that respects the winsorizing bounds.
  • Asymptotic power equals the standard max-test's power exactly when the standard test is valid.
  • Bootstrap critical values never lower first-order asymptotic power and strictly raise it in designs with extreme coordinate correlations.
  • Dimension may grow exponentially with sample size while size and power characterizations remain valid.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The same winsorizing device could be inserted into other coordinate-wise high-dimensional statistics without changing their limiting power under clean data.
  • In practice, analysts facing possible contamination can default to the robust version with no detectable power penalty on uncontaminated data.
  • The result suggests that careful truncation can neutralize worst-case contamination while preserving the exact local power envelope of the untruncated procedure.

Load-bearing premise

The data have at least three moments and any adversarial contamination stays inside the quantile bounds used for winsorizing.

What would settle it

A high-dimensional Monte Carlo study with m=2.5 moments and fixed adversarial contamination in which the robust test's empirical rejection rate under the alternative falls measurably below the standard max-test's rate while both control size.

read the original abstract

Allowing for adversarial contamination and heavy tails, we study testing whether the mean of a high-dimensional random vector equals zero. Because standard max-tests based on sample averages are highly non-robust, we propose a max-test based on quantile-winsorized observations. The test controls asymptotic size under adversarial contamination and only requires $m>2$ moments, while allowing dimension to grow exponentially with sample size. We fully characterize its asymptotic power function. Comparing with the standard max-test, for which we also derive a power characterization as a benchmark, we show that robustness is obtained for free: under the stronger conditions that make the standard max-test valid, our robust test has identical asymptotic power. We also study the role of bootstrap critical values, showing that their use never decreases power, can strictly improve asymptotic power in extremely correlated designs, but often has no first-order asymptotic effect.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

1 major / 2 minor

Summary. The manuscript studies testing the mean of a high-dimensional vector equal to zero under adversarial contamination and heavy tails. It proposes a max-test based on quantile-winsorized observations that controls asymptotic size with only m>2 moments while allowing dimension p to grow exponentially in sample size n. The paper fully characterizes the asymptotic power function of the proposed test and, as a benchmark, derives the corresponding characterization for the standard (non-robust) max-test. It concludes that robustness is obtained for free: under the stronger conditions validating the standard max-test, the quantile-winsorized version has identical asymptotic power. Bootstrap critical values are also analyzed, with the result that they never decrease power and can strictly improve it in highly correlated designs.

Significance. If the derivations hold, the result is significant for high-dimensional inference. It supplies explicit power characterizations that permit direct comparison between robust and non-robust procedures, shows that exponential dimension growth is compatible with size control under contamination, and demonstrates that a simple winsorization step can restore robustness without first-order power loss. The bootstrap analysis adds practical insight. These features address a genuine gap between theoretical robustness guarantees and the power requirements of high-dimensional testing.

major comments (1)
  1. [§3] §3 (asymptotic power characterization): the claim that the robust test has identical limiting power to the standard max-test under the stronger (no-contamination, higher-moment) regime requires that any centering shift induced by quantile winsorization at fixed levels α>0 is asymptotically negligible uniformly over p=exp(o(n)) coordinates. The tail-integral bias per coordinate remains after normalization and can perturb the extreme-value limit of the maximum; the manuscript must either let the winsorizing proportion vanish with n or supply the uniform negligibility argument that keeps the limiting power function unchanged.
minor comments (2)
  1. [§2] The precise definition of the winsorized observations (including how the quantiles are estimated) and the exact normalization constants used for the max-statistic should be stated explicitly before the main theorems.
  2. [Simulation section] Figure 1 (or the corresponding simulation panel) would benefit from an additional curve showing the standard max-test under contamination to visually confirm the size distortion that the robust version corrects.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for the careful reading and constructive comments on our manuscript. We address the major comment below and believe the point can be resolved by elaborating the existing argument.

read point-by-point responses
  1. Referee: [§3] §3 (asymptotic power characterization): the claim that the robust test has identical limiting power to the standard max-test under the stronger (no-contamination, higher-moment) regime requires that any centering shift induced by quantile winsorization at fixed levels α>0 is asymptotically negligible uniformly over p=exp(o(n)) coordinates. The tail-integral bias per coordinate remains after normalization and can perturb the extreme-value limit of the maximum; the manuscript must either let the winsorizing proportion vanish with n or supply the uniform negligibility argument that keeps the limiting power function unchanged.

    Authors: We appreciate the referee's observation on the potential centering bias. Under the stronger regime (no contamination and m>4 moments) that validates the standard max-test, the fixed-α quantile-winsorization bias per coordinate is of smaller order than (log p)^{-1/2} uniformly in p=exp(o(n)). This follows from the higher-moment integrability controlling the tail integrals beyond the fixed quantiles; after centering and scaling, the bias term vanishes in the extreme-value limit and does not alter the Gumbel distribution or the power function. We will add an explicit lemma establishing this uniform negligibility in the revised §3 to make the argument fully transparent. revision: yes

Circularity Check

0 steps flagged

No significant circularity: derivations rely on standard extreme-value asymptotics without self-referential reductions

full rationale

The paper derives asymptotic size and power functions for the standard max-test and the proposed quantile-winsorized version from first principles under high-dimensional regimes with m>2 moments. The 'robustness for free' claim is obtained by direct comparison of the two independently characterized limiting distributions under the stronger conditions that validate the non-robust test; no parameter is fitted to data and then relabeled as a prediction, no ansatz is smuggled via self-citation, and no uniqueness theorem is imported from the authors' prior work to force the result. The equivalence holds by explicit limiting expressions rather than by construction or renaming of known patterns.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 0 invented entities

The claims rest on standard moment and asymptotic-regime assumptions common to high-dimensional statistics; no new free parameters or invented entities are introduced.

axioms (2)
  • domain assumption Observations possess at least m>2 moments
    Invoked to guarantee asymptotic size control under heavy tails and adversarial contamination.
  • domain assumption High-dimensional regime with dimension growing exponentially in sample size
    Required for the max-test limiting behavior described in the abstract.

pith-pipeline@v0.9.0 · 5447 in / 1321 out tokens · 61337 ms · 2026-05-16T12:45:50.640885+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.