pith. sign in

arxiv: 2605.11147 · v3 · pith:I2IYMHMYnew · submitted 2026-05-11 · ✦ hep-ph

An approximate formula for the entropy of the negative binomial distribution

Pith reviewed 2026-05-20 21:34 UTC · model grok-4.3

classification ✦ hep-ph
keywords negative binomial distributionShannon entropyapproximate formulacharged particle multiplicitieshigh energy physicsmultiplicity distributionsentropy approximationparticle production
0
0 comments X

The pith

An approximate formula estimates the Shannon entropy of the negative binomial distribution within 20 percent for extreme parameter values.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper investigates series and integral representations of the Shannon entropy for the negative binomial distribution, which has no known closed form. It derives a practical approximation that stays within roughly 20 percent of the exact value when the distribution parameters reach extreme regimes. This matters for studies of charged particle multiplicities in high-energy collisions, where entropy quantifies the spread of multiplicity data and exact calculations can be costly. A sympathetic reader would see it as a tool to handle limiting cases without repeated numerical work.

Core claim

By approximating one of the existing series or integral representations, the authors obtain a formula for the entropy of the negative binomial distribution that deviates by at most about 20 percent from the true value when the parameters are taken to their extreme limits.

What carries the argument

An approximate closed-form expression obtained by simplifying a series or integral representation of the Shannon entropy for the negative binomial distribution.

If this is right

  • Entropy estimates become feasible for multiplicity data in limiting regimes without full numerical summation.
  • Model comparisons that rely on entropy differences can be performed more rapidly across wide parameter ranges.
  • Theoretical predictions for particle production in extreme kinematic conditions gain a lightweight entropy observable.
  • Parameter fitting routines for negative binomial distributions can incorporate entropy constraints at lower computational cost.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The same approximation strategy could be tested on related distributions such as the Poisson or binomial to see if similar accuracy holds.
  • In experimental analyses, the formula might allow real-time entropy monitoring during multiplicity unfolding procedures.
  • If the 20 percent tolerance proves acceptable in practice, it opens the door to analytic derivatives of entropy with respect to the NBD parameters.

Load-bearing premise

One of the existing series or integral representations of the NBD entropy admits an approximation accurate to within 20 percent for extreme parameter values.

What would settle it

Compute the exact entropy numerically from the full series representation at several extreme parameter points and check whether the proposed formula stays inside 20 percent of those values.

Figures

Figures reproduced from arXiv: 2605.11147 by S\'andor L\"ok\"os.

Figure 1
Figure 1. Figure 1: FIG. 1. The [PITH_FULL_IMAGE:figures/full_fig_p005_1.png] view at source ↗
read the original abstract

Recent theoretical developments revived the interest in charged particle multiplicities and their wide-spread parametrization, the negative binomial distribution (NBD). The central observable of the studies is the Shannon entropy of the NBD. A closed form is not known, however, there are representations with special series and integrals. In this note, we will investigate one of these and give an approximate formula for the entropy that is valid up to $\sim$20\% deviation from the exact value for extreme values of the NBD parameters.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 1 minor

Summary. The manuscript investigates one of the series or integral representations of the Shannon entropy of the negative binomial distribution (NBD) and derives an approximate formula claimed to remain within ∼20% of the exact value for extreme values of the NBD parameters.

Significance. If the approximation and its error bound are established, the result would provide a practical tool for entropy calculations in high-energy physics analyses that employ the NBD to parametrize charged-particle multiplicity distributions, where closed-form expressions are unavailable.

major comments (2)
  1. [Abstract] Abstract: the central claim that the approximation is 'valid up to ∼20% deviation from the exact value for extreme values of the NBD parameters' does not define the extreme regime (e.g., explicit ranges or limiting behavior for parameters r and p, or equivalently the mean and variance). This definition is load-bearing for assessing whether the stated accuracy holds throughout the claimed domain.
  2. [Derivation] Derivation section: the step that produces the approximate formula from the chosen series/integral representation must be accompanied by an explicit error bound or asymptotic analysis demonstrating that the truncation error remains ≤20% for all extreme parameter values; numerical checks at isolated points are insufficient to establish the bound over the full regime (e.g., r→0 at fixed mean or r→∞).
minor comments (1)
  1. Add a plot or table of relative error versus the NBD parameters across the extreme regime to make the 20% claim visually verifiable.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the careful reading of our manuscript and the constructive comments. We address each major comment below and outline the revisions we intend to implement.

read point-by-point responses
  1. Referee: [Abstract] Abstract: the central claim that the approximation is 'valid up to ∼20% deviation from the exact value for extreme values of the NBD parameters' does not define the extreme regime (e.g., explicit ranges or limiting behavior for parameters r and p, or equivalently the mean and variance). This definition is load-bearing for assessing whether the stated accuracy holds throughout the claimed domain.

    Authors: We agree that a precise definition of the 'extreme' regime is necessary. In the revised version we will explicitly state the parameter domain, for example 0.01 ≤ r ≤ 1 with mean multiplicity μ ≥ 10 (corresponding to p = r/(r+μ) approaching the relevant limits), and confirm that the stated 20% deviation holds throughout this region by additional targeted checks. revision: yes

  2. Referee: [Derivation] Derivation section: the step that produces the approximate formula from the chosen series/integral representation must be accompanied by an explicit error bound or asymptotic analysis demonstrating that the truncation error remains ≤20% for all extreme parameter values; numerical checks at isolated points are insufficient to establish the bound over the full regime (e.g., r→0 at fixed mean or r→∞).

    Authors: The approximation arises from truncating the integral representation after the leading contribution that dominates when r is small or large at fixed mean. While the original manuscript relied on numerical verification at representative points, we acknowledge that this is not a substitute for an explicit bound. In the revision we will add a short asymptotic analysis of the truncation error in the limits r→0 (fixed mean) and r→∞, together with a denser numerical scan confirming that the relative deviation stays below 20% across the full extreme regime. revision: yes

Circularity Check

0 steps flagged

No circularity: approximation derived from existing representations without reduction to fitted inputs or self-citations

full rationale

The paper starts from known series and integral representations of the NBD entropy (external to the present work), investigates one of them, and produces an approximate closed-form expression whose claimed accuracy is checked against the exact entropy. No step equates the output formula to a parameter fitted from the target result itself, renames a known pattern, or relies on a load-bearing self-citation or uniqueness theorem imported from the authors' prior work. The derivation chain remains independent of the final approximation and is therefore self-contained.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The central claim rests on the existence of series and integral representations for NBD entropy and on the premise that one such representation admits a useful approximation at extreme parameters.

axioms (1)
  • domain assumption The Shannon entropy of the negative binomial distribution admits representations via special series and integrals.
    Invoked in the abstract as the starting point for the investigation.

pith-pipeline@v0.9.0 · 5600 in / 1160 out tokens · 91503 ms · 2026-05-20T21:34:11.966453+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

23 extracted references · 23 canonical work pages · 11 internal anchors

  1. [1]

    Charged Particle Multiplicities in Deep Inelastic Scattering at HERA

    S. Aidet al.(H1), Charged particle multiplicities in deep inelastic scattering at HERA, Z. Phys. C72, 573 (1996), arXiv:hep-ex/9608011

  2. [2]

    Andreevet al.(H1), Measurement of charged particle multiplicity distributions in DIS at HERA and its implication to entanglement entropy of partons, Eur

    V. Andreevet al.(H1), Measurement of charged particle multiplicity distributions in DIS at HERA and its implication to entanglement entropy of partons, Eur. Phys. J. C81, 212 (2021), arXiv:2011.01812 [hep-ex]

  3. [3]

    Measurement of charged particle multiplicities and densities in $pp$ collisions at $\sqrt{s}=7\;$TeV in the forward region

    R. Aaijet al.(LHCb), Measurement of charged particle multiplicities and densities inppcol- lisions at √s= 7 TeV in the forward region, Eur. Phys. J. C74, 2888 (2014), arXiv:1402.4430 [hep-ex]

  4. [4]

    Charged-particle multiplicities in proton-proton collisions at $\sqrt{s}$ = 0.9 to 8 TeV

    J. Adamet al.(ALICE), Charged-particle multiplicities in proton–proton collisions at √s= 0.9 to 8 TeV, Eur. Phys. J. C77, 33 (2017), arXiv:1509.07541 [nucl-ex]

  5. [5]

    Charged-particle multiplicity distributions over a wide pseudorapidity range in proton-proton collisions at $\mathbf{\sqrt{s}=}$ 0.9, 7 and 8 TeV

    S. Acharyaet al.(ALICE), Charged-particle multiplicity distributions over a wide pseudora- pidity range in proton-proton collisions at √s= 0.9, 7, and 8 TeV, Eur. Phys. J. C77, 852 (2017), arXiv:1708.01435 [hep-ex]

  6. [6]

    Charged-particle multiplicities in pp interactions measured with the ATLAS detector at the LHC

    G. Aadet al.(ATLAS), Charged-particle multiplicities in pp interactions measured with the ATLAS detector at the LHC, New J. Phys.13, 053033 (2011), arXiv:1012.5104 [hep-ex]

  7. [7]

    Charged-particle distributions in $pp$ interactions at $\sqrt{s}=8$ TeV measured with the ATLAS detector

    G. Aadet al.(ATLAS), Charged-particle distributions inppinteractions at √s= 8 TeV measured with the ATLAS detector, Eur. Phys. J. C76, 403 (2016), arXiv:1603.02439 [hep- ex]

  8. [8]

    Charged-particle distributions in $\sqrt{s}=13$ TeV $pp$ interactions measured with the ATLAS detector at the LHC

    G. Aadet al.(ATLAS), Charged-particle distributions in √s= 13 TeV pp interactions mea- sured with the ATLAS detector, Phys. Lett. B758, 67 (2016), arXiv:1602.01633 [hep-ex]

  9. [9]

    Charged particle multiplicities in pp interactions at sqrt(s) = 0.9, 2.36, and 7 TeV

    V. Khachatryanet al.(CMS), Charged Particle Multiplicities inppInteractions at √s= 0.9, 2.36, and 7 TeV, JHEP01, 079, arXiv:1011.5531 [hep-ex]

  10. [10]

    R. E. Ansorgeet al.(UA5), Charged Particle Multiplicity Distributions at 200-GeV and 900- GeV Center-Of-Mass Energy, Z. Phys. C43, 357 (1989)

  11. [11]

    Charged hadron multiplicity fluctuations in Au+Au and Cu+Cu collisions from sqrt(s_NN) = 22.5 to 200 GeV

    A. Adareet al.(PHENIX), Charged hadron multiplicity fluctuations in Au+Au and Cu+Cu collisions from √sN N = 22.5 to 200 GeV, Phys. Rev. C78, 044902 (2008), arXiv:0805.1521 [nucl-ex]. 6

  12. [12]

    D. E. Kharzeev and E. M. Levin, Deep inelastic scattering as a probe of entanglement, Phys. Rev. D95, 114008 (2017), arXiv:1702.03489 [hep-ph]

  13. [13]

    D. E. Kharzeev, Quantum information approach to high energy interactions, Phil. Trans. A. Math. Phys. Eng. Sci.380, 20210063 (2021), arXiv:2108.08792 [hep-ph]

  14. [14]

    D. E. Kharzeev and E. Levin, Deep inelastic scattering as a probe of entanglement: Con- fronting experimental data, Phys. Rev. D104, L031503 (2021), arXiv:2102.09773 [hep-ph]

  15. [15]

    Kutak, Entanglement entropy of proton and its relation to thermodynamics entropy (2023), arXiv:2310.18510 [hep-ph]

    K. Kutak, Entanglement entropy of proton and its relation to thermodynamics entropy (2023), arXiv:2310.18510 [hep-ph]

  16. [16]

    Kutak and M

    K. Kutak and M. Prasza lowicz, Entropy, purity and gluon cascades at high energies with recombinations and transitions to vacuum (2025), arXiv:2508.13781 [hep-ph]

  17. [17]

    Hentschinski and K

    M. Hentschinski and K. Kutak, Evidence for the maximally entangled low x proton in Deep Inelastic Scattering from H1 data, Eur. Phys. J. C82, 111 (2022), [Erratum: Eur.Phys.J.C 83, 1147 (2023)], arXiv:2110.06156 [hep-ph]

  18. [18]

    Kutak and S

    K. Kutak and S. L¨ ok¨ os, Entropy and multiplicity of hadrons in the high energy limit within dipole cascade models, Phys. Rev. D112, 096017 (2025), arXiv:2509.07898 [hep-ph]

  19. [19]

    L¨ ok¨ os, Charged particle multiplicity distributions derived from the Principle of Maximal Entropy, Acta Phys

    S. L¨ ok¨ os, Charged particle multiplicity distributions derived from the Principle of Maximal Entropy, Acta Phys. Pol. B Proc. Suppl18, 5 (2025), arXiv:2505.23491 [hep-ph]

  20. [20]

    Expressions for the Entropy of Binomial-Type Distributions

    M. Cheraghchi, Expressions for the entropy of binomial-type distributions, CoRR abs/1708.06394(2017), 1708.06394

  21. [21]

    Brun and F

    R. Brun and F. Rademakers, ROOT: An object oriented data analysis framework, Nucl. Instrum. Meth. A389, 81 (1997)

  22. [22]

    Gough, M

    B. Gough, M. Galassi,et al.,GNU scientific library reference manual(Network Theory Ltd, 2009)

  23. [23]

    Scipy gamma function documentation (2025). 7