Intermediate efficiency of some weighted goodness-of-fit statistics
Pith reviewed 2026-05-25 18:20 UTC · model grok-4.3
The pith
Weighted goodness-of-fit statistics permit analytic efficiency comparisons to the Kolmogorov-Smirnov test when alternatives place moderate mass in the tails.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
When the alternative distribution allocates a moderately large portion of probability mass toward the tails, the intermediate-efficiency approach yields explicit analytic comparisons between weighted statistics and the unweighted Kolmogorov-Smirnov benchmark, together with reliable quantitative rankings; a slight modification of the Borovkov-Sycheva construction produces an Eicker-Jaeschke statistic competitive with Anderson-Darling.
What carries the argument
Intermediate-efficiency comparison of weighted supremum-type statistics (Anderson-Darling and Eicker-Jaeschke) against the Kolmogorov-Smirnov benchmark under tail-heavy alternatives.
If this is right
- Weighted tests such as Anderson-Darling become preferable to Kolmogorov-Smirnov for detecting moderate tail departures.
- Analytic rather than purely numerical evaluation of relative performance becomes feasible for a range of weighted statistics.
- The modified Eicker-Jaeschke statistic offers a practical alternative to the Anderson-Darling statistic with comparable or better properties.
- Finite-sample behavior aligns with the intermediate-efficiency ordering derived from the analytic comparison.
Where Pith is reading between the lines
- The same intermediate-efficiency technique could be applied to other weighting functions or to alternatives concentrated in the center rather than the tails.
- Practitioners facing data with suspected tail discrepancies could adopt the modified Eicker-Jaeschke statistic as a default over Anderson-Darling without loss of power.
- The approach supplies a concrete benchmark that future efficiency studies of goodness-of-fit procedures could use to calibrate new weighting schemes.
Load-bearing premise
The comparison applies specifically when under the alternative a moderately large portion of probability mass is allocated towards the tails.
What would settle it
A direct calculation or simulation for an alternative that places only a very small or very large fraction of mass in the tails, showing that the predicted efficiency ordering between the weighted statistics and Kolmogorov-Smirnov reverses or disappears.
Figures
read the original abstract
This paper compares the Anderson-Darling and some Eicker-Jaeschke statistics to the classical unweighted Kolmogorov-Smirnov statistic. The goal is to provide a quantitative comparison of such tests and to study real possibilities of using them to detect departures from the hypothesized distribution that occur in the tails. This contribution covers the case when under the alternative a moderately large portion of probability mass is allocated towards the tails. It is demonstrated that the approach allows for tractable, analytic comparison between the given test and the benchmark, and for reliable quantitative evaluation of weighted statistics. Finite sample results illustrate the proposed approach and confirm the theoretical findings. In the course of the investigation we also prove that a slight and natural modification of the solution proposed by Borovkov and Sycheva (1968) leads to a statistic which is a member of Eicker-Jaeschke class and can be considered an attractive competitor of the very popular supremum-type Anderson-Darling statistic.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper develops an intermediate-efficiency framework for comparing the Anderson-Darling statistic and Eicker-Jaeschke weighted goodness-of-fit statistics against the classical Kolmogorov-Smirnov statistic. The analysis is restricted to alternatives that place a moderately large portion of probability mass in the tails; it claims that this framework yields tractable analytic comparisons and reliable quantitative evaluations. Finite-sample simulations are presented to illustrate the results, and a slight modification of the Borovkov-Sycheva (1968) construction is shown to belong to the Eicker-Jaeschke class and to serve as a competitive alternative to the supremum-type Anderson-Darling statistic.
Significance. If the analytic comparisons hold under the stated regime, the work supplies a concrete, quantitative tool for assessing relative performance of weighted tests when tail departures matter. The explicit identification of a modified Borovkov-Sycheva statistic as an Eicker-Jaeschke competitor and the supporting finite-sample results are clear strengths that enhance the practical value of the contribution.
minor comments (3)
- The abstract refers to 'tractable, analytic comparison' without previewing the precise form of the intermediate-efficiency functional; a one-sentence indication of the efficiency measure (e.g., the ratio of sample sizes needed to achieve a given power) would improve readability.
- Notation for the weight functions defining the Eicker-Jaeschke class and the precise statement of the 'slight and natural modification' of Borovkov-Sycheva should be introduced before the main theorems to avoid forward references.
- In the finite-sample section, the number of Monte Carlo replications and the exact sample sizes used should be stated explicitly so that the reported power curves can be reproduced.
Simulated Author's Rebuttal
We thank the referee for the positive evaluation of our manuscript and the recommendation of minor revision. The referee's summary correctly reflects the scope and contributions of the work on intermediate efficiency comparisons for weighted goodness-of-fit statistics under tail alternatives.
Circularity Check
No significant circularity; derivation self-contained in asymptotic theory
full rationale
The paper applies the intermediate-efficiency framework to derive analytic comparisons between Anderson-Darling/Eicker-Jaeschke weighted statistics and the Kolmogorov-Smirnov benchmark under tail-heavy alternatives. All load-bearing steps rest on standard large-sample theory and a new proof that a minor modification of the 1968 Borovkov-Sycheva construction belongs to the Eicker-Jaeschke class; neither step reduces to a fitted parameter renamed as prediction, a self-definitional loop, nor a load-bearing self-citation. The central quantitative evaluation claim therefore remains independent of the paper's own inputs.
Axiom & Free-Parameter Ledger
axioms (1)
- standard math Standard regularity conditions of asymptotic statistics underlying intermediate efficiency calculations
Reference graph
Works this paper leans on
-
[1]
Adler, R.J. (1990). An Introduction to Continuity, Extrema, and Related Topics for General Gaussian Processes . Institute of Mathematical Statistics. Lecture Notes- Monograph Series. Vol. 12, Hayward, California
work page 1990
-
[2]
Anderson, T. W. and Darling, D. A. (1952). Asymptotic theory of certain “goodness of fit” criteria based on stochastic processes. Ann. Math. Statist. 23 193-212
work page 1952
-
[3]
Billingsley, P. (1968). Convergence of Probability Measures. Wiley
work page 1968
-
[4]
Borovkov, A. A. and Sycheva, N. M. (1968). On asymptotically optimal non-parametric criteria. Theory Probab. Appl. 13 359-393
work page 1968
-
[5]
Borovkov, A. A. and Sycheva, N. M. (1970). On asymptotically optimal nonparametric criteria. In Nonparametric Techniques in Statistical Inference(M. L. Puri, ed.) 259-266. Cambridge Univ. Press
work page 1970
-
[6]
Cai, T. T., J. Jeng and J. Jin (2011). Optimal detection of heterogeneous and het- eroscedastic mixtures. J. R. Statist. Soc. B 73 629-662
work page 2011
-
[7]
Chicheportiche, R. and Bouchaud, J.-F. (2012). Weighted Kolmogorov-Smirnov test: Accounting for the tails. Physical Review E 86 041115-1 - 041115-6
work page 2012
-
[8]
Cont, R. (2001). Empirical properties of assets: stylized facts and statistical issues. Quant. Fin. 1 223-236
work page 2001
-
[9]
Weighted Kolmogorov Smirnov testing: an alternative for Gene Set Enrichment Analysis
Chrampi, K. and Ycart, B. (2015). Weighted Kolmogorov Smirnov testing: and alter- native for Gene Set Enrichment Analysis. Stat. Appl. Genet. Mol. Biol. 14 279-293; arXiv:1410.1620v1. REFERENCES 33
work page internal anchor Pith review Pith/arXiv arXiv 2015
-
[10]
Cs¨ org˝ o, M., Cs¨ org˝ o, S., Horvath, L. and Mason, D. M. (1986). Weighted empirical and quantile processes. Ann. Probab. 14, 31-85
work page 1986
-
[11]
Cs¨ org˝ o, M. and Horvath, L. (1993).Weighted Approximations in Probability and Statis- tics. Wiley
work page 1993
-
[12]
Ditzhaus, M. (2018). Signal detection via Phi-divergences for general mixtures. arXiv: 1803.06519v1
work page internal anchor Pith review Pith/arXiv arXiv 2018
-
[13]
Donoho, D. and Jin, J. (2004). Higher criticism for detecting sparse heterogeneous mixtures. Ann. Statist. 32 962-994
work page 2004
-
[14]
Donoho, D. and Jin, J. (2015). Higher criticism for large-scale inference, especially for rare and weak effects. Stat. Sci. 30, 1-25
work page 2015
-
[15]
Eicker, F. (1979). The asymptotic distribution of the suprema of the standardized empirical process. Ann. Statist. 7 116-138
work page 1979
-
[16]
Ermakov, M.S. (2004). On asymptotically efficient statistical inference for moderate deviation probabilities. Theory Probab. Appl. 48, 622-641
work page 2004
-
[17]
Gontscharuk, V., Landwehr, S. and Finner, H. (2016). Goodness of fit tests in terms of local levels with special emphasis on higher criticism tests. Bernoulli 22 1331-1363
work page 2016
-
[18]
Grabchak, M. and Samorodnitsky, G. (2010). Do financial returns have finite or infinite variance? A paradox and an explanation. Quant. Finance 10 883-893
work page 2010
-
[19]
Greenshtein, E. and Park, J. (2012). Robust test for detecting a signal in a high di- mensional sparse normal vector. J. Statist. Plann. Inference 142 1445-1456
work page 2012
-
[20]
Handcock, M. S. and Morris, M. (1999). Relative Distribution Methods in the Social Sciences. Springer, New York
work page 1999
-
[21]
Inglot, T. (1999). Generalized intermediate efficiency of goodness of fit tests. Math. Methods Statist. 8 487-509
work page 1999
-
[22]
Inglot, T. (2019). Intermediate efficiency of tests under heavy-tailed alternatives. arXiv:1902.06622v1 [math.ST]
work page internal anchor Pith review Pith/arXiv arXiv 2019
-
[23]
Inglot, T., Kallenberg, W.C.M. and Ledwina, T. (1998). Vanishing shortcoming and asymptotic relative efficiency. Memorandum 1467, Faculty of Mathematical Sciences, Univ. Twente
work page 1998
-
[24]
Inglot, T., Kallenberg, W.C.M. and Ledwina, T. (2000). Vanishing shortcoming and asymptotic relative efficiency. Ann. Statist. 28, 215-238. REFERENCES 34
work page 2000
-
[25]
Inglot, T. and Ledwina, T. (1990). On probabilities of excessive deviations for Kolmogorov-Smirnov, Cram´ er-von Mises and chi-square statistics. Ann. Statist. 18 1491-1495
work page 1990
-
[26]
Inglot, T. and Ledwina, T. (1993). Moderately large deviations and expansions of large deviations for some functionals of weighted empirical process. Ann. Probab. 21 1691- 1705
work page 1993
-
[27]
Inglot, T. and Ledwina, T. (1996). Asymptotic optimality of data driven Neyman’s tests for uniformity. Ann. Statist. 24 1982-2019
work page 1996
-
[28]
Inglot, T. and Ledwina, T. (2006). Intermediate efficiency of some max-type statistics. J. Statist. Plan. Inference. 136 2918-2935
work page 2006
-
[29]
Inglot, T., Ledwina, T. and ´Cmiel, B. (2018). Intermediate efficiency in nonparametric testing problems with an application to some weighted statistics. ESAIM: Probab. Stat. accepted; for a preliminary variant see arXiv:1806.02020v1 [math.ST]
work page internal anchor Pith review Pith/arXiv arXiv 2018
-
[30]
Ingster, Y. (1997). Some problems of hypothesis testing leading to infinitely divisible distributions. Math. Methods Statist. 6, 47-69
work page 1997
-
[31]
Ivchenko, G. I. and Mirakhmedov Sh. A. (1995). Large deviations and intermediate efficiency of decomposable statistics in a multinomial scheme. Math. Methods Statist. 4 294-311
work page 1995
-
[32]
Jager, L. and Wellner, J. A. (2004). On the “Poisson boundaries” of the family of weighted Kolmogorov statistics. In Festschrift for Herman Rubin (A. DasGupta, ed.) 319-331. IMS, Beachwood, OH
work page 2004
-
[33]
Jager, L. and Wellner, J. A. (2007). Goodness-of-fit tests via phi-divergences. Ann. Statist. 35 2018-2053
work page 2007
-
[34]
Janssen A. (1995). Principal component decomposition of non-parametric tests. Probab. Theory Relat. Fields 101 193-209
work page 1995
-
[35]
Jin, J., Starck, J. -L., Donoho, D. L., Aghanim, N., and Forni, O. (2005), Cosmological non-Gaussian signature detection: Comparing performance of different statistical tests, EURASIP J. Appl. Signal Processing 15 2470-2485
work page 2005
-
[36]
Kallenberg, W. C. M. (1983). Intermediate efficiency, theory and examples. Ann. Statist. 11 1401-1420
work page 1983
-
[37]
Lehmann, E. L. (1953). The power of rank tests. Ann. Math. Staist. 24 23-43
work page 1953
-
[38]
Ledwina, T. and Wy lupek, G. (2012). Nonparametric tests for first order stochastic dominance. TEST 21, 730-756. REFERENCES 35
work page 2012
-
[39]
Li, J. and Siegmund D. (2015). Higher criticism: p-values and criticism. Ann. Statist. 43 1323-1350
work page 2015
-
[40]
Lifshits, M. A. (1995). Gaussian Random Functions. Springer Science+Business Media, Dordrecht
work page 1995
-
[41]
Lockhart, R. A. (1991). Overweight tails are inefficient. Ann. Statist. 19 2254-2258
work page 1991
-
[42]
Mason, D. M. (1985). Some large deviation results for weighted empirical processes. Statistics & Decisions 2 89-98
work page 1985
-
[43]
Mason, D. M. and Schuenemeyer, J. H. (1983). A modified Kolmogorov-Smirnov test sensitive to tail alternatives. Ann. Statist. 11 933-946
work page 1983
-
[44]
Mason, D. M. and Schuenemeyer, J. H. (1992). Correction: A modified Kolmogorov- Smirnov test sensitive to tail alternatives. Ann. Statist. 20 620-621
work page 1992
-
[45]
Merlev` ede, F. and Peligrad, M. (2009). Functional moderate deviations for triangular arrays and applications. ALEA Lat. Am. J. Probab. Math. Stat. 5 3-20
work page 2009
-
[46]
Milbrodt, H. and Strasser, H. (1990). On the asymptotic power of the two-sided Kolmogorov-Smirnov test. J. Statist. Plann. Inference 26 1-23
work page 1990
-
[47]
Moscovich, A., Nadler, B. and Spiegelman, C. (2016). On the exact Berk-Jones statistics and their p-value calculation. Electron. J. Stat. 10 2329-2354
work page 2016
-
[48]
Pearson, E. S., D’Agostino, R. B. and Bowman, K. O. (1977). Tests for departures from normality: Comparison of powers. Biometrika 64 231-246
work page 1977
-
[49]
R´ ev´ esz, P. (1982). A joint study of the Kolmogorov-Smirnov and the Eicker-Jaeschke statistics. Statistics & Decisions 1, 57-65
work page 1982
-
[50]
Stepanova, N. and Pavlenko, T. (2018). Goodness-of-fit tests based on sup-functionals of weighted empirical processess. Theory Probab. Appl. 63, 292-317
work page 2018
-
[51]
Thas, O. (2010). Comparing Distributions. Springer, New York
work page 2010
-
[52]
van der Vaart, W. and Wellner, J.A. (2000). Weak Convergence and Empirical Pro- cesses. Springer Series in Statistics
work page 2000
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.