On the Golomb-Dickman constant under Ewens sampling
Pith reviewed 2026-05-22 10:12 UTC · model grok-4.3
The pith
The generalized Golomb-Dickman constant under the Ewens measure admits an explicit integral representation in terms of the exponential integral.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
We define a generalized Golomb--Dickman constant λ_θ as the limiting expected proportion of the longest cycle in random permutations under the Ewens measure with parameter θ > 0. Exploiting the independence properties of Kingman's Poisson process construction of the Poisson--Dirichlet distribution, we obtain an explicit integral representation for λ_θ in terms of the exponential integral. The dependence of λ_θ on θ reflects the transition between regimes dominated by long cycles (small θ) and those with many small cycles (large θ). We also derive the asymptotic behavior of λ_θ for small and large θ.
What carries the argument
Kingman's Poisson process construction of the Poisson-Dirichlet distribution, whose independent increments permit direct derivation of an integral for the expected longest-cycle proportion.
If this is right
- The dependence of λ_θ on θ captures the transition from long-cycle dominated regimes at small θ to many-small-cycles regimes at large θ.
- Explicit asymptotic expressions for λ_θ are derived as θ approaches 0 and as θ approaches infinity.
- Numerical computations and Monte Carlo simulations of the Hoppe urn confirm the integral representation.
- The results are illustrated by an application to a concrete combinatorial setting.
Where Pith is reading between the lines
- The integral form may allow direct numerical quadrature for λ_θ at any θ without repeated simulation of permutations.
- The same Poisson-process approach could be applied to other functionals such as the length of the second-longest cycle.
- The change in scaling with θ may connect to critical phenomena in other models of random integer partitions.
Load-bearing premise
The independence properties of Kingman's Poisson process construction of the Poisson-Dirichlet distribution can be directly exploited to produce the integral representation for the longest-cycle proportion under the Ewens measure.
What would settle it
Numerical evaluation of the integral for a fixed θ, say θ=1, must reproduce the classical Golomb-Dickman value near 0.624 within Monte Carlo error from Ewens-sampled permutations; a clear mismatch would falsify the claimed representation.
Figures
read the original abstract
We define a generalized Golomb--Dickman constant $\lambda_{\theta}$ as the limiting expected proportion of the longest cycle in random permutations under the Ewens measure with parameter $\theta > 0$. Exploiting the independence properties of Kingman's Poisson process construction of the Poisson--Dirichlet distribution, we obtain an explicit integral representation for $\lambda_{\theta}$ in terms of the exponential integral. The dependence of $\lambda_{\theta}$ on $\theta$ reflects the transition between regimes dominated by long cycles (small $\theta$) and those with many small cycles (large $\theta$). We also derive the asymptotic behavior of $\lambda_{\theta}$ for small and large $\theta$ and illustrate our results with numerical computations, Monte Carlo simulations of the Hoppe urn, and an application.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper defines a generalized Golomb-Dickman constant λ_θ as the limiting expected proportion of the longest cycle in random permutations under the Ewens measure with parameter θ > 0. Exploiting the independence properties of Kingman's Poisson process construction of the Poisson-Dirichlet distribution, the authors obtain an explicit integral representation for λ_θ in terms of the exponential integral. They also derive the asymptotic behavior of λ_θ for small and large θ, provide numerical computations and Monte Carlo simulations of the Hoppe urn, and discuss an application.
Significance. If the central derivation holds, the work supplies a clean, explicit integral formula for the expected longest-cycle proportion under Ewens sampling that generalizes the classical Golomb-Dickman constant. The use of the independent Poisson-point-process construction is a genuine strength, yielding a representation free of auxiliary fitting parameters. The accompanying asymptotics, reproducible Monte Carlo checks, and application further increase the result's utility for probabilistic combinatorics.
major comments (2)
- [§3] §3, the step deriving the intensity of the residual process after conditioning on a large component: the manuscript must explicitly record the modified intensity measure on the residual interval so that readers can verify that normalization does not re-introduce dependence that invalidates the subsequent integration.
- [§4, Eq. (15)] §4, Eq. (15) (or the displayed integral for λ_θ): the representation must recover the known numerical value of the Golomb-Dickman constant (≈0.62433) at θ=1; an explicit numerical or analytic check of this special case is required to confirm that the exponential-integral term arises correctly from the PPP construction.
minor comments (2)
- [§5] The Monte Carlo section should state the number of independent Hoppe-urn realizations and report standard errors or confidence intervals on the simulated values of λ_θ.
- [Introduction] A short paragraph recalling the definition of the Ewens sampling measure and its link to the Poisson-Dirichlet distribution would improve accessibility for readers outside the immediate subfield.
Simulated Author's Rebuttal
We thank the referee for the positive evaluation and the constructive comments that improve the clarity of our presentation. We address each major comment in turn.
read point-by-point responses
-
Referee: [§3] §3, the step deriving the intensity of the residual process after conditioning on a large component: the manuscript must explicitly record the modified intensity measure on the residual interval so that readers can verify that normalization does not re-introduce dependence that invalidates the subsequent integration.
Authors: We agree that an explicit statement of the residual intensity will aid verification. In the revised manuscript we insert, immediately after the conditioning argument in §3, the following clarification: conditioning on the presence of a point of size x removes that atom; the residual process on (0,1-x] remains an inhomogeneous Poisson point process with the original intensity measure θ y^{-1} dy (now restricted to y<x), and the normalization factor is simply the total mass on the reduced interval. Because the original construction is a Poisson point process, the residual is independent of the conditioned atom and the subsequent integral representation continues to hold without re-introducing dependence. We thank the referee for highlighting this point. revision: yes
-
Referee: [§4, Eq. (15)] §4, Eq. (15) (or the displayed integral for λ_θ): the representation must recover the known numerical value of the Golomb-Dickman constant (≈0.62433) at θ=1; an explicit numerical or analytic check of this special case is required to confirm that the exponential-integral term arises correctly from the PPP construction.
Authors: We have performed the requested numerical check. Substituting θ=1 into the integral representation (15) and evaluating the resulting expression via quadrature yields approximately 0.624329, which agrees with the accepted value of the classical Golomb–Dickman constant to the reported precision. In the revised version we add a short remark in §4 (immediately after Eq. (15)) stating this numerical agreement and briefly describing the quadrature method used, thereby confirming that the exponential-integral form is consistent with the θ=1 case. revision: yes
Circularity Check
Derivation from independent Kingman PPP construction shows no circularity
full rationale
The paper derives the explicit integral representation for λ_θ by exploiting the independence properties of Kingman's Poisson process construction of the Poisson-Dirichlet distribution (intensity θ e^{-x}/x dx). This is a standard external result in probability theory, not a self-citation or author-specific ansatz. The survival function P(largest > u) follows from void probabilities, and the expectation is obtained by integrating the residual process after removing large points, with the exponential integral arising from the change of variables. No equation reduces to a fitted input, self-definition, or prior result by the same authors. The θ=1 consistency check is a validation against an independently known value, not a load-bearing input. The derivation is self-contained against external benchmarks.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Kingman's Poisson process construction yields the Poisson-Dirichlet distribution with the required independence properties for cycle lengths under Ewens sampling
Reference graph
Works this paper leans on
-
[1]
D. J. Aldous,Exchangeability and related topics, In: P. L. Hennequin (Ed.), ´Ecole d’Et´ e de Probabilit´ es de Saint-Flour XIII, 1983, Lecture Notes in Mathematics1117, Springer, Berlin, 1985, pp. 1–198. MR 0883646
work page 1983
-
[2]
R. Arratia, A. D. Barbour, and S. Tavar´ e,Logarithmic Combinatorial Structures: A Proba- bilistic Approach, European Mathematical Society, Z¨ urich, 2003. MR 2032426
work page 2003
-
[3]
R. Arratia and S. Tavar´ e,The cycle structure of random permutations, The Annals of Prob- ability20(1992), no. 3, 1567–1591. MR 1175278 10 J. R. G. MENDONC ¸ A AND L. J. NEGRET
work page 1992
-
[4]
Campisi,Lectures on the Mechanical Foundations of Thermodynamics, 2nd ed
M. Campisi,Lectures on the Mechanical Foundations of Thermodynamics, 2nd ed. Springer, Cham, 2024
work page 2024
-
[5]
Crane,The ubiquitous Ewens sampling formula, Statistical Science31(2016), no
H. Crane,The ubiquitous Ewens sampling formula, Statistical Science31(2016), no. 1, 1–19. MR 3458585
work page 2016
-
[6]
N. M. Ercolani and D. Ueltschi,Cycle structure of random permutations with cycle weights, Random Structures & Algorithms44(2014), no. 1, 109–133. MR 3143592
work page 2014
-
[7]
W. J. Ewens,The sampling theory of selectively neutral alleles, Theoretical Population Biol- ogy3(1972), 87–112. MR 0325177
work page 1972
-
[8]
Ford,Cycle type of random permutations: A toolkit, Discrete Analysis (2022), art
K. Ford,Cycle type of random permutations: A toolkit, Discrete Analysis (2022), art. 9 (36pp). MR 4481406
work page 2022
-
[9]
V. Gonˇ carov,On the field of combinatory analysis, In: Twelve Papers on Number The- ory and Function Theory, American Mathematical Society Translations (2)19(1962), 1–46. MR 0131369 [Originally published in Izvestiya Akademii Nauk SSSR, Seriya Matematich- eskaya8(1944), no. 1, 3–48, in Russian. MR 0010922]
work page 1962
-
[10]
L. Holst,The Poisson-Dirichlet distribution and its relatives revisited, preprint (2001), Royal Institute of Technology, Stockholm, Sweden, URL: https://www.math.kth.se/matstat/fofu/ reports/PoiDir.pdf
work page 2001
-
[11]
F. M. Hoppe,P´ olya-like urns and the Ewens’ sampling formula, Journal of Mathematical Biology20(1984), no. 1, 91–94
work page 1984
-
[12]
H. Jeffreys and B. S. Jeffreys,Methods of Mathematical Physics, 3rd ed., Cambridge Univer- sity Press, Cambridge, 1972. MR 1744997
work page 1972
-
[13]
J. F. C. Kingman,Random discrete distributions, Journal of the Royal Statistical Society. Series B (Methodological)37(1975), no. 1, 1–22. MR 0368264
work page 1975
-
[14]
Pitman,Combinatorial Stochastic Processes, In: J
J. Pitman,Combinatorial Stochastic Processes, In: J. Picard (Ed.), ´Ecole d’Et´ e de Proba- bilit´ es de Saint-Flour XXXII, 2002, Lecture Notes in Mathematics1875, Springer, Heidel- berg, 2006. MR 2245368
work page 2002
-
[15]
L. A. Shepp and S. Lloyd,Ordered cycle lengths in a random permutation, Transactions of the American Mathematical Society121(1966), 340–357. MR 195117
work page 1966
-
[16]
T. Tao,The Poisson–Dirichlet process, and large prime factors of a random number, What’s New Daily Archive, URL: https://terrytao.wordpress.com/2013/09/21/
work page 2013
-
[17]
S. Tavar´ e,The magical Ewens sampling formula, Bulletin of the London Mathematical So- ciety53(2021), no. 6, 1563–1582. MR 4368686
work page 2021
-
[18]
Touchard,Sur les cycles des substitutions, Acta Mathematica70(1939), no
J. Touchard,Sur les cycles des substitutions, Acta Mathematica70(1939), no. 1, 243–297. MR 1555449
work page 1939
-
[19]
Escola de Artes, Ciˆencias e Humanidades, Universidade de S ˜ao Paulo, SP, Brazil
Wolfram Research, Mathematica, Version 14.3.Wolfram Research, Inc., Champaign, IL, 2025. Escola de Artes, Ciˆencias e Humanidades, Universidade de S ˜ao Paulo, SP, Brazil. Email address:jricardo@usp.br Instituto de Matem´atica e Estat´ıstica, Universidade de S ˜ao Paulo, SP, Brazil Email address:ljnegrett@ime.usp.br
work page 2025
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.