An approximate formula for the entropy of the negative binomial distribution
Pith reviewed 2026-05-20 21:34 UTC · model grok-4.3
The pith
An approximate formula estimates the Shannon entropy of the negative binomial distribution within 20 percent for extreme parameter values.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
By approximating one of the existing series or integral representations, the authors obtain a formula for the entropy of the negative binomial distribution that deviates by at most about 20 percent from the true value when the parameters are taken to their extreme limits.
What carries the argument
An approximate closed-form expression obtained by simplifying a series or integral representation of the Shannon entropy for the negative binomial distribution.
If this is right
- Entropy estimates become feasible for multiplicity data in limiting regimes without full numerical summation.
- Model comparisons that rely on entropy differences can be performed more rapidly across wide parameter ranges.
- Theoretical predictions for particle production in extreme kinematic conditions gain a lightweight entropy observable.
- Parameter fitting routines for negative binomial distributions can incorporate entropy constraints at lower computational cost.
Where Pith is reading between the lines
- The same approximation strategy could be tested on related distributions such as the Poisson or binomial to see if similar accuracy holds.
- In experimental analyses, the formula might allow real-time entropy monitoring during multiplicity unfolding procedures.
- If the 20 percent tolerance proves acceptable in practice, it opens the door to analytic derivatives of entropy with respect to the NBD parameters.
Load-bearing premise
One of the existing series or integral representations of the NBD entropy admits an approximation accurate to within 20 percent for extreme parameter values.
What would settle it
Compute the exact entropy numerically from the full series representation at several extreme parameter points and check whether the proposed formula stays inside 20 percent of those values.
Figures
read the original abstract
Recent theoretical developments revived the interest in charged particle multiplicities and their wide-spread parametrization, the negative binomial distribution (NBD). The central observable of the studies is the Shannon entropy of the NBD. A closed form is not known, however, there are representations with special series and integrals. In this note, we will investigate one of these and give an approximate formula for the entropy that is valid up to $\sim$20\% deviation from the exact value for extreme values of the NBD parameters.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript investigates one of the series or integral representations of the Shannon entropy of the negative binomial distribution (NBD) and derives an approximate formula claimed to remain within ∼20% of the exact value for extreme values of the NBD parameters.
Significance. If the approximation and its error bound are established, the result would provide a practical tool for entropy calculations in high-energy physics analyses that employ the NBD to parametrize charged-particle multiplicity distributions, where closed-form expressions are unavailable.
major comments (2)
- [Abstract] Abstract: the central claim that the approximation is 'valid up to ∼20% deviation from the exact value for extreme values of the NBD parameters' does not define the extreme regime (e.g., explicit ranges or limiting behavior for parameters r and p, or equivalently the mean and variance). This definition is load-bearing for assessing whether the stated accuracy holds throughout the claimed domain.
- [Derivation] Derivation section: the step that produces the approximate formula from the chosen series/integral representation must be accompanied by an explicit error bound or asymptotic analysis demonstrating that the truncation error remains ≤20% for all extreme parameter values; numerical checks at isolated points are insufficient to establish the bound over the full regime (e.g., r→0 at fixed mean or r→∞).
minor comments (1)
- Add a plot or table of relative error versus the NBD parameters across the extreme regime to make the 20% claim visually verifiable.
Simulated Author's Rebuttal
We thank the referee for the careful reading of our manuscript and the constructive comments. We address each major comment below and outline the revisions we intend to implement.
read point-by-point responses
-
Referee: [Abstract] Abstract: the central claim that the approximation is 'valid up to ∼20% deviation from the exact value for extreme values of the NBD parameters' does not define the extreme regime (e.g., explicit ranges or limiting behavior for parameters r and p, or equivalently the mean and variance). This definition is load-bearing for assessing whether the stated accuracy holds throughout the claimed domain.
Authors: We agree that a precise definition of the 'extreme' regime is necessary. In the revised version we will explicitly state the parameter domain, for example 0.01 ≤ r ≤ 1 with mean multiplicity μ ≥ 10 (corresponding to p = r/(r+μ) approaching the relevant limits), and confirm that the stated 20% deviation holds throughout this region by additional targeted checks. revision: yes
-
Referee: [Derivation] Derivation section: the step that produces the approximate formula from the chosen series/integral representation must be accompanied by an explicit error bound or asymptotic analysis demonstrating that the truncation error remains ≤20% for all extreme parameter values; numerical checks at isolated points are insufficient to establish the bound over the full regime (e.g., r→0 at fixed mean or r→∞).
Authors: The approximation arises from truncating the integral representation after the leading contribution that dominates when r is small or large at fixed mean. While the original manuscript relied on numerical verification at representative points, we acknowledge that this is not a substitute for an explicit bound. In the revision we will add a short asymptotic analysis of the truncation error in the limits r→0 (fixed mean) and r→∞, together with a denser numerical scan confirming that the relative deviation stays below 20% across the full extreme regime. revision: yes
Circularity Check
No circularity: approximation derived from existing representations without reduction to fitted inputs or self-citations
full rationale
The paper starts from known series and integral representations of the NBD entropy (external to the present work), investigates one of them, and produces an approximate closed-form expression whose claimed accuracy is checked against the exact entropy. No step equates the output formula to a parameter fitted from the target result itself, renames a known pattern, or relies on a load-bearing self-citation or uniqueness theorem imported from the authors' prior work. The derivation chain remains independent of the final approximation and is therefore self-contained.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption The Shannon entropy of the negative binomial distribution admits representations via special series and integrals.
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
an approximate formula for the entropy that is valid up to ~20% deviation ... replacing the square bracket ... by a single saturating exponential ... Japprox = -D ln[Gamma(k+lambda)/(Gamma(k)Gamma(1+lambda))]
-
IndisputableMonolith/Foundation/RealityFromDistinction.leanreality_from_one_distinction unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
No closed form ... representations with special series and integrals
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
Charged Particle Multiplicities in Deep Inelastic Scattering at HERA
S. Aidet al.(H1), Charged particle multiplicities in deep inelastic scattering at HERA, Z. Phys. C72, 573 (1996), arXiv:hep-ex/9608011
work page internal anchor Pith review Pith/arXiv arXiv 1996
-
[2]
V. Andreevet al.(H1), Measurement of charged particle multiplicity distributions in DIS at HERA and its implication to entanglement entropy of partons, Eur. Phys. J. C81, 212 (2021), arXiv:2011.01812 [hep-ex]
-
[3]
R. Aaijet al.(LHCb), Measurement of charged particle multiplicities and densities inppcol- lisions at √s= 7 TeV in the forward region, Eur. Phys. J. C74, 2888 (2014), arXiv:1402.4430 [hep-ex]
work page internal anchor Pith review Pith/arXiv arXiv 2014
-
[4]
Charged-particle multiplicities in proton-proton collisions at $\sqrt{s}$ = 0.9 to 8 TeV
J. Adamet al.(ALICE), Charged-particle multiplicities in proton–proton collisions at √s= 0.9 to 8 TeV, Eur. Phys. J. C77, 33 (2017), arXiv:1509.07541 [nucl-ex]
work page internal anchor Pith review Pith/arXiv arXiv 2017
-
[5]
S. Acharyaet al.(ALICE), Charged-particle multiplicity distributions over a wide pseudora- pidity range in proton-proton collisions at √s= 0.9, 7, and 8 TeV, Eur. Phys. J. C77, 852 (2017), arXiv:1708.01435 [hep-ex]
work page internal anchor Pith review Pith/arXiv arXiv 2017
-
[6]
Charged-particle multiplicities in pp interactions measured with the ATLAS detector at the LHC
G. Aadet al.(ATLAS), Charged-particle multiplicities in pp interactions measured with the ATLAS detector at the LHC, New J. Phys.13, 053033 (2011), arXiv:1012.5104 [hep-ex]
work page internal anchor Pith review Pith/arXiv arXiv 2011
-
[7]
G. Aadet al.(ATLAS), Charged-particle distributions inppinteractions at √s= 8 TeV measured with the ATLAS detector, Eur. Phys. J. C76, 403 (2016), arXiv:1603.02439 [hep- ex]
work page internal anchor Pith review Pith/arXiv arXiv 2016
-
[8]
G. Aadet al.(ATLAS), Charged-particle distributions in √s= 13 TeV pp interactions mea- sured with the ATLAS detector, Phys. Lett. B758, 67 (2016), arXiv:1602.01633 [hep-ex]
work page internal anchor Pith review Pith/arXiv arXiv 2016
-
[9]
Charged particle multiplicities in pp interactions at sqrt(s) = 0.9, 2.36, and 7 TeV
V. Khachatryanet al.(CMS), Charged Particle Multiplicities inppInteractions at √s= 0.9, 2.36, and 7 TeV, JHEP01, 079, arXiv:1011.5531 [hep-ex]
work page internal anchor Pith review Pith/arXiv arXiv
-
[10]
R. E. Ansorgeet al.(UA5), Charged Particle Multiplicity Distributions at 200-GeV and 900- GeV Center-Of-Mass Energy, Z. Phys. C43, 357 (1989)
work page 1989
-
[11]
A. Adareet al.(PHENIX), Charged hadron multiplicity fluctuations in Au+Au and Cu+Cu collisions from √sN N = 22.5 to 200 GeV, Phys. Rev. C78, 044902 (2008), arXiv:0805.1521 [nucl-ex]. 6
work page internal anchor Pith review Pith/arXiv arXiv 2008
-
[12]
D. E. Kharzeev and E. M. Levin, Deep inelastic scattering as a probe of entanglement, Phys. Rev. D95, 114008 (2017), arXiv:1702.03489 [hep-ph]
work page internal anchor Pith review Pith/arXiv arXiv 2017
- [13]
- [14]
-
[15]
K. Kutak, Entanglement entropy of proton and its relation to thermodynamics entropy (2023), arXiv:2310.18510 [hep-ph]
-
[16]
K. Kutak and M. Prasza lowicz, Entropy, purity and gluon cascades at high energies with recombinations and transitions to vacuum (2025), arXiv:2508.13781 [hep-ph]
-
[17]
M. Hentschinski and K. Kutak, Evidence for the maximally entangled low x proton in Deep Inelastic Scattering from H1 data, Eur. Phys. J. C82, 111 (2022), [Erratum: Eur.Phys.J.C 83, 1147 (2023)], arXiv:2110.06156 [hep-ph]
-
[18]
K. Kutak and S. L¨ ok¨ os, Entropy and multiplicity of hadrons in the high energy limit within dipole cascade models, Phys. Rev. D112, 096017 (2025), arXiv:2509.07898 [hep-ph]
-
[19]
S. L¨ ok¨ os, Charged particle multiplicity distributions derived from the Principle of Maximal Entropy, Acta Phys. Pol. B Proc. Suppl18, 5 (2025), arXiv:2505.23491 [hep-ph]
-
[20]
Expressions for the Entropy of Binomial-Type Distributions
M. Cheraghchi, Expressions for the entropy of binomial-type distributions, CoRR abs/1708.06394(2017), 1708.06394
work page internal anchor Pith review Pith/arXiv arXiv 2017
-
[21]
R. Brun and F. Rademakers, ROOT: An object oriented data analysis framework, Nucl. Instrum. Meth. A389, 81 (1997)
work page 1997
- [22]
-
[23]
Scipy gamma function documentation (2025). 7
work page 2025
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.