pith. sign in

arxiv: 2606.22712 · v1 · pith:7LKJ27RQnew · submitted 2026-06-21 · 📊 stat.ME

Unbiased estimation of normalized scale-invariant indices under the gamma distribution

Pith reviewed 2026-06-26 09:29 UTC · model grok-4.3

classification 📊 stat.ME
keywords unbiased estimationgamma distributionnormalized scale-invariant indicesU-statisticGini coefficientDirichlet proportionsinequality measures
0
0 comments X

The pith

A U-statistic based on sum-proportion independence yields an unbiased estimator for any normalized scale-invariant index when data follow a gamma distribution.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper defines normalized scale-invariant indices generated by homogeneous functions, covering the Gini coefficient, generalized Gini indices, entropy measures, and variability indices. Explicit formulas for these indices are derived when the underlying population is gamma. Exploiting the exact independence of the total sum and the Dirichlet proportions that arises under gamma, a U-statistic is constructed that remains unbiased for every index in the class. Monte Carlo experiments illustrate the estimator's behavior under gamma and generalized gamma data, and the method is applied to GDP per capita observations.

Core claim

When observations are drawn from a gamma distribution, the independence between the total sum and the associated Dirichlet proportions permits construction of a U-statistic that is unbiased for every normalized scale-invariant index generated by a homogeneous function.

What carries the argument

The U-statistic that exploits independence of the total sum and Dirichlet proportions under gamma populations.

If this is right

  • The same estimator applies without modification to the Gini coefficient, generalized Gini indices, entropy-based measures, and variability indices.
  • Closed-form expressions for the indices themselves exist under gamma populations.
  • Monte Carlo results show the estimator maintains low bias and stable variance across choices of index.
  • The construction continues to perform well in simulations when the data follow a generalized gamma distribution.
  • The indices and their unbiased estimates can be computed directly on real datasets such as GDP per capita across countries.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • If gamma is only an approximation, the estimator may still reduce finite-sample bias relative to plug-in methods that ignore the independence structure.
  • The same independence device could be examined for other scale families where analogous decompositions hold approximately.
  • Direct comparison of the U-statistic against the sample index on deliberately non-gamma data would measure how quickly unbiasedness degrades.
  • The framework may extend to other economic or ecological indices that are homogeneous and scale-invariant.

Load-bearing premise

The data must be drawn from a gamma distribution so that the total sum and Dirichlet proportions are exactly independent.

What would settle it

Calculate the expected value of the proposed U-statistic on gamma-distributed samples and verify whether it equals the true index value for several different homogeneous functions.

Figures

Figures reproduced from arXiv: 2606.22712 by Felipe Quintino, Helton Saulo, Roberto Vila.

Figure 1
Figure 1. Figure 1: Spatial distribution of income levels in the Americas. [PITH_FULL_IMAGE:figures/full_fig_p011_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Histograms with fitted Gamma density (left), empirical CDF with fitted Gamma distribution [PITH_FULL_IMAGE:figures/full_fig_p011_2.png] view at source ↗
read the original abstract

We introduce a broad class of normalized scale-invariant indices (NPRIs) generated by homogeneous functions and encompassing several well-known measures, including the Gini coefficient, generalized Gini indices, entropy-based measures, and variability indices. Explicit expressions are obtained for these indices under gamma populations. Exploiting the independence between the total sum and the associated Dirichlet proportions, we derive a simple unbiased estimator based on a U-statistic. The resulting estimator is shown to be unbiased for any NPRI when the underlying population follows a gamma distribution. Several examples are provided to illustrate the general theory. A Monte Carlo simulation study is carried out that shows the good performance of the unbiased estimator in several scenarios of index choices. We also present a simulation study that goes beyond the established theory by examining the estimator's applicability in settings characterized by a generalized gamma distribution. We evaluate the effectiveness of the NPRIs and their estimates in modeling a real-world dataset related to gross domestic product per capita in the Americas.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

0 major / 2 minor

Summary. The manuscript defines a class of normalized scale-invariant indices (NPRIs) via homogeneous functions (encompassing Gini, generalized Gini, entropy, and variability measures), derives explicit expressions for these indices when the population is gamma, and constructs an unbiased U-statistic estimator that exploits the classical independence of the sample sum and the Dirichlet vector of proportions. The central result is that this estimator is unbiased for any NPRI under i.i.d. gamma sampling. The paper supplies examples, Monte Carlo illustrations under gamma and generalized gamma, and an application to GDP-per-capita data.

Significance. If the derivation holds, the work supplies a parameter-free, unbiased estimator for an entire family of scale-invariant functionals under the gamma model by direct application of the gamma-Dirichlet independence and U-statistic theory. This is a clean methodological contribution that avoids plug-in estimation of shape or scale parameters and directly yields unbiasedness for the listed indices once the homogeneous-function representation is verified.

minor comments (2)
  1. The Monte Carlo section examines performance under the generalized gamma distribution, which lies outside the gamma theory; a short paragraph explaining the motivation for this extension (e.g., robustness check) would improve readability.
  2. Notation for the homogeneous-function class and the associated kernel of the U-statistic could be introduced with one additional displayed equation in the general-theory section to make the transition from the abstract definition to the estimator fully explicit.

Simulated Author's Rebuttal

0 responses · 0 unresolved

We thank the referee for the positive assessment of the manuscript, the accurate summary of its contributions, and the recommendation for minor revision. No specific major comments were listed in the report.

Circularity Check

0 steps flagged

No significant circularity detected

full rationale

The paper's central derivation establishes an unbiased U-statistic estimator for any NPRI (defined via homogeneous functions) under i.i.d. gamma sampling by invoking the standard gamma property that the total sum is independent of the Dirichlet vector of proportions; this independence directly implies that the expectation of any scale-invariant functional of the proportions equals the expectation of a symmetric kernel, so the U-statistic is unbiased by the usual U-statistic theory. No step reduces to a fitted parameter renamed as prediction, a self-definitional loop, or a load-bearing self-citation; the gamma independence is an external, well-known fact independent of the present work, and the Monte Carlo and data sections are presented only as illustrations. The logical chain (homogeneous function → scale invariance → Dirichlet representation → U-statistic kernel) is therefore self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 0 invented entities

Central claim depends on the gamma distributional assumption and the definition of NPRIs via homogeneous functions; no free parameters or invented entities are introduced in the abstract.

axioms (2)
  • domain assumption Observations are i.i.d. from a gamma distribution, enabling independence of total sum and Dirichlet proportions.
    Invoked to derive the unbiased U-statistic estimator.
  • domain assumption The indices are generated by homogeneous functions of a certain degree.
    Used to define the broad class of NPRIs.

pith-pipeline@v0.9.1-grok · 5694 in / 1165 out tokens · 18137 ms · 2026-06-26T09:29:59.419345+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

19 extracted references · 1 canonical work pages

  1. [1]

    A. B. Atkinson, On the measurement of inequality, Journal of Economic Theory, 2(3):244--263, 1970

  2. [2]

    P. S. Bullen, Handbook of Means and Their Inequalities, Springer, Dordrecht, 2003

  3. [3]

    T. M. Cover and J. A. Thomas, Elements of Information Theory, 2nd ed., Wiley, Hoboken, 2006

  4. [4]

    F. A. Cowell. Measuring Inequality. 3rd ed., Oxford University Press, Oxford, 2011

  5. [5]

    H. A. David and H. N. Nagaraja, Order Statistics, 3rd ed., Wiley, Hoboken, 2003

  6. [6]

    J. M. Gavilan-Ruiz, \'A. Ruiz-G\'andara, F.J. Ortega-Irizo and L. Gonzalez-Abril, Some Notes on the Gini Index and New Inequality Measures: The nth Gini Index. Stats , 7:1354--1365, 2024

  7. [7]

    Gini, Variabilit\`a e mutabilit\`a, Studi Economico-Giuridici della R

    C. Gini, Variabilit\`a e mutabilit\`a, Studi Economico-Giuridici della R. Universit\`a di Cagliari, 1912

  8. [8]

    C. Gini. On the measure of concentration with special reference to income and statistics. Colorado College Publication, General Series, 208 , 73--79, 1936

  9. [9]

    N. Henze. Asymptotic Stochastics: An Introduction with a View towards Statistics. Springer Berlin, Heidelberg, 2024

  10. [10]

    Hoeffding

    W. Hoeffding. A class of statistics with asymptotically normal distribution. Annals of Mathematical Statistics , 19, 293–325, 1948

  11. [11]

    A. J. Lee. U-Statistics. Marcel Dekker, New York, 1990

  12. [12]

    A. W. Marshall, I. Olkin and B. C. Arnold, Inequalities: Theory of Majorization and Its Applications, 2nd ed., Springer, New York, 2011

  13. [13]

    J. E. Mosimann, On the Compound Multinomial Distribution, the Multivariate Beta-Distribution, and Correlations among Proportions. Biometrika , 49:65--82, 1962

  14. [14]

    Shaked and J

    M. Shaked and J. G. Shanthikumar, Stochastic Orders, Springer, New York, 2007

  15. [15]

    C. E. Shannon, A mathematical theory of communication, Bell System Technical Journal, 27:379--423, 623--656, 1948

  16. [16]

    Vila and H

    R. Vila and H. Saulo, The mth Gini index estimator: Unbiasedness for gamma populations. The Journal of Economic Inequality , 2026. https://doi.org/10.1007/s10888-025-09715-3

  17. [17]

    Vila and H

    R. Vila and H. Saulo, An unbiased estimator of a novel extended mth Gini index for gamma distributed populations. Journal of Computational and Applied Mathematics , 482:117320, 2026

  18. [18]

    Vila and H

    R. Vila and H. Saulo, Bias analysis of a linear order-statistic inequality index estimator: Unbiasedness under gamma populations. Preprint , 2026. https://arxiv.org/abs/2602.14861

  19. [19]

    Yitzhaki and E

    S. Yitzhaki and E. Schechtman, The Gini Methodology: A Primer on a Statistical Methodology, Springer, New York, 2013