pith. sign in

arxiv: 2603.23175 · v3 · pith:VWKT6KLRnew · submitted 2026-03-24 · 🧮 math.PR · cond-mat.stat-mech· math.ST· stat.TH

On the Golomb-Dickman constant under Ewens sampling

Pith reviewed 2026-05-22 10:12 UTC · model grok-4.3

classification 🧮 math.PR cond-mat.stat-mechmath.STstat.TH
keywords Golomb-Dickman constantEwens measurePoisson-Dirichlet distributionlongest cyclerandom permutationsexponential integralKingman construction
0
0 comments X

The pith

The generalized Golomb-Dickman constant under the Ewens measure admits an explicit integral representation in terms of the exponential integral.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper defines a generalized Golomb-Dickman constant λ_θ as the limiting expected proportion of the longest cycle when random permutations are drawn from the Ewens measure with parameter θ. It derives an explicit integral formula for this constant by using the independence properties in Kingman's Poisson process construction of the Poisson-Dirichlet distribution. A sympathetic reader would care because the formula makes visible how the typical longest cycle shrinks as θ grows, marking a shift from regimes with a few long cycles to those with many short ones. The work supplies asymptotic formulas for λ_θ when θ is small or large and supports the result with numerical checks and simulations.

Core claim

We define a generalized Golomb--Dickman constant λ_θ as the limiting expected proportion of the longest cycle in random permutations under the Ewens measure with parameter θ > 0. Exploiting the independence properties of Kingman's Poisson process construction of the Poisson--Dirichlet distribution, we obtain an explicit integral representation for λ_θ in terms of the exponential integral. The dependence of λ_θ on θ reflects the transition between regimes dominated by long cycles (small θ) and those with many small cycles (large θ). We also derive the asymptotic behavior of λ_θ for small and large θ.

What carries the argument

Kingman's Poisson process construction of the Poisson-Dirichlet distribution, whose independent increments permit direct derivation of an integral for the expected longest-cycle proportion.

If this is right

  • The dependence of λ_θ on θ captures the transition from long-cycle dominated regimes at small θ to many-small-cycles regimes at large θ.
  • Explicit asymptotic expressions for λ_θ are derived as θ approaches 0 and as θ approaches infinity.
  • Numerical computations and Monte Carlo simulations of the Hoppe urn confirm the integral representation.
  • The results are illustrated by an application to a concrete combinatorial setting.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The integral form may allow direct numerical quadrature for λ_θ at any θ without repeated simulation of permutations.
  • The same Poisson-process approach could be applied to other functionals such as the length of the second-longest cycle.
  • The change in scaling with θ may connect to critical phenomena in other models of random integer partitions.

Load-bearing premise

The independence properties of Kingman's Poisson process construction of the Poisson-Dirichlet distribution can be directly exploited to produce the integral representation for the longest-cycle proportion under the Ewens measure.

What would settle it

Numerical evaluation of the integral for a fixed θ, say θ=1, must reproduce the classical Golomb-Dickman value near 0.624 within Monte Carlo error from Ewens-sampled permutations; a clear mismatch would falsify the claimed representation.

Figures

Figures reproduced from arXiv: 2603.23175 by Jos\'e Ricardo G. Mendon\c{c}a, Luis Jehiel Negret.

Figure 1
Figure 1. Figure 1: displays the behavior of λθ × θ for 0 ≤ θ ≤ 5, while [PITH_FULL_IMAGE:figures/full_fig_p007_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: displays the time evolution of the color proportions in one simulation of 50 draws in a Hoppe urn with θ = 1. 0 10 20 30 40 50 n 0.0 0.2 0.4 0.6 0.8 1.0 Color proportion [PITH_FULL_IMAGE:figures/full_fig_p008_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: , these statistical fluctuations are negligible and therefore omitted, as the numerical data are included for illustration only. 0 2 4 6 8 10 θ 0.0 0.2 0.4 0.6 0.8 1.0 λ θ λθ θ → 0 θ → ∞ Hoppe [PITH_FULL_IMAGE:figures/full_fig_p009_3.png] view at source ↗
read the original abstract

We define a generalized Golomb--Dickman constant $\lambda_{\theta}$ as the limiting expected proportion of the longest cycle in random permutations under the Ewens measure with parameter $\theta > 0$. Exploiting the independence properties of Kingman's Poisson process construction of the Poisson--Dirichlet distribution, we obtain an explicit integral representation for $\lambda_{\theta}$ in terms of the exponential integral. The dependence of $\lambda_{\theta}$ on $\theta$ reflects the transition between regimes dominated by long cycles (small $\theta$) and those with many small cycles (large $\theta$). We also derive the asymptotic behavior of $\lambda_{\theta}$ for small and large $\theta$ and illustrate our results with numerical computations, Monte Carlo simulations of the Hoppe urn, and an application.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The paper defines a generalized Golomb-Dickman constant λ_θ as the limiting expected proportion of the longest cycle in random permutations under the Ewens measure with parameter θ > 0. Exploiting the independence properties of Kingman's Poisson process construction of the Poisson-Dirichlet distribution, the authors obtain an explicit integral representation for λ_θ in terms of the exponential integral. They also derive the asymptotic behavior of λ_θ for small and large θ, provide numerical computations and Monte Carlo simulations of the Hoppe urn, and discuss an application.

Significance. If the central derivation holds, the work supplies a clean, explicit integral formula for the expected longest-cycle proportion under Ewens sampling that generalizes the classical Golomb-Dickman constant. The use of the independent Poisson-point-process construction is a genuine strength, yielding a representation free of auxiliary fitting parameters. The accompanying asymptotics, reproducible Monte Carlo checks, and application further increase the result's utility for probabilistic combinatorics.

major comments (2)
  1. [§3] §3, the step deriving the intensity of the residual process after conditioning on a large component: the manuscript must explicitly record the modified intensity measure on the residual interval so that readers can verify that normalization does not re-introduce dependence that invalidates the subsequent integration.
  2. [§4, Eq. (15)] §4, Eq. (15) (or the displayed integral for λ_θ): the representation must recover the known numerical value of the Golomb-Dickman constant (≈0.62433) at θ=1; an explicit numerical or analytic check of this special case is required to confirm that the exponential-integral term arises correctly from the PPP construction.
minor comments (2)
  1. [§5] The Monte Carlo section should state the number of independent Hoppe-urn realizations and report standard errors or confidence intervals on the simulated values of λ_θ.
  2. [Introduction] A short paragraph recalling the definition of the Ewens sampling measure and its link to the Poisson-Dirichlet distribution would improve accessibility for readers outside the immediate subfield.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the positive evaluation and the constructive comments that improve the clarity of our presentation. We address each major comment in turn.

read point-by-point responses
  1. Referee: [§3] §3, the step deriving the intensity of the residual process after conditioning on a large component: the manuscript must explicitly record the modified intensity measure on the residual interval so that readers can verify that normalization does not re-introduce dependence that invalidates the subsequent integration.

    Authors: We agree that an explicit statement of the residual intensity will aid verification. In the revised manuscript we insert, immediately after the conditioning argument in §3, the following clarification: conditioning on the presence of a point of size x removes that atom; the residual process on (0,1-x] remains an inhomogeneous Poisson point process with the original intensity measure θ y^{-1} dy (now restricted to y<x), and the normalization factor is simply the total mass on the reduced interval. Because the original construction is a Poisson point process, the residual is independent of the conditioned atom and the subsequent integral representation continues to hold without re-introducing dependence. We thank the referee for highlighting this point. revision: yes

  2. Referee: [§4, Eq. (15)] §4, Eq. (15) (or the displayed integral for λ_θ): the representation must recover the known numerical value of the Golomb-Dickman constant (≈0.62433) at θ=1; an explicit numerical or analytic check of this special case is required to confirm that the exponential-integral term arises correctly from the PPP construction.

    Authors: We have performed the requested numerical check. Substituting θ=1 into the integral representation (15) and evaluating the resulting expression via quadrature yields approximately 0.624329, which agrees with the accepted value of the classical Golomb–Dickman constant to the reported precision. In the revised version we add a short remark in §4 (immediately after Eq. (15)) stating this numerical agreement and briefly describing the quadrature method used, thereby confirming that the exponential-integral form is consistent with the θ=1 case. revision: yes

Circularity Check

0 steps flagged

Derivation from independent Kingman PPP construction shows no circularity

full rationale

The paper derives the explicit integral representation for λ_θ by exploiting the independence properties of Kingman's Poisson process construction of the Poisson-Dirichlet distribution (intensity θ e^{-x}/x dx). This is a standard external result in probability theory, not a self-citation or author-specific ansatz. The survival function P(largest > u) follows from void probabilities, and the expectation is obtained by integrating the residual process after removing large points, with the exponential integral arising from the change of variables. No equation reduces to a fitted input, self-definition, or prior result by the same authors. The θ=1 consistency check is a validation against an independently known value, not a load-bearing input. The derivation is self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The central claim rests on the standard construction of the Poisson-Dirichlet distribution and the independence properties of its Poisson-process representation; no new free parameters or invented entities are introduced.

axioms (1)
  • domain assumption Kingman's Poisson process construction yields the Poisson-Dirichlet distribution with the required independence properties for cycle lengths under Ewens sampling
    Invoked to obtain the integral representation for λ_θ

pith-pipeline@v0.9.0 · 5672 in / 1007 out tokens · 27028 ms · 2026-05-22T10:12:41.225560+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

19 extracted references · 19 canonical work pages

  1. [1]

    D. J. Aldous,Exchangeability and related topics, In: P. L. Hennequin (Ed.), ´Ecole d’Et´ e de Probabilit´ es de Saint-Flour XIII, 1983, Lecture Notes in Mathematics1117, Springer, Berlin, 1985, pp. 1–198. MR 0883646

  2. [2]

    Arratia, A

    R. Arratia, A. D. Barbour, and S. Tavar´ e,Logarithmic Combinatorial Structures: A Proba- bilistic Approach, European Mathematical Society, Z¨ urich, 2003. MR 2032426

  3. [3]

    Arratia and S

    R. Arratia and S. Tavar´ e,The cycle structure of random permutations, The Annals of Prob- ability20(1992), no. 3, 1567–1591. MR 1175278 10 J. R. G. MENDONC ¸ A AND L. J. NEGRET

  4. [4]

    Campisi,Lectures on the Mechanical Foundations of Thermodynamics, 2nd ed

    M. Campisi,Lectures on the Mechanical Foundations of Thermodynamics, 2nd ed. Springer, Cham, 2024

  5. [5]

    Crane,The ubiquitous Ewens sampling formula, Statistical Science31(2016), no

    H. Crane,The ubiquitous Ewens sampling formula, Statistical Science31(2016), no. 1, 1–19. MR 3458585

  6. [6]

    N. M. Ercolani and D. Ueltschi,Cycle structure of random permutations with cycle weights, Random Structures & Algorithms44(2014), no. 1, 109–133. MR 3143592

  7. [7]

    W. J. Ewens,The sampling theory of selectively neutral alleles, Theoretical Population Biol- ogy3(1972), 87–112. MR 0325177

  8. [8]

    Ford,Cycle type of random permutations: A toolkit, Discrete Analysis (2022), art

    K. Ford,Cycle type of random permutations: A toolkit, Discrete Analysis (2022), art. 9 (36pp). MR 4481406

  9. [9]

    Gonˇ carov,On the field of combinatory analysis, In: Twelve Papers on Number The- ory and Function Theory, American Mathematical Society Translations (2)19(1962), 1–46

    V. Gonˇ carov,On the field of combinatory analysis, In: Twelve Papers on Number The- ory and Function Theory, American Mathematical Society Translations (2)19(1962), 1–46. MR 0131369 [Originally published in Izvestiya Akademii Nauk SSSR, Seriya Matematich- eskaya8(1944), no. 1, 3–48, in Russian. MR 0010922]

  10. [10]

    L. Holst,The Poisson-Dirichlet distribution and its relatives revisited, preprint (2001), Royal Institute of Technology, Stockholm, Sweden, URL: https://www.math.kth.se/matstat/fofu/ reports/PoiDir.pdf

  11. [11]

    F. M. Hoppe,P´ olya-like urns and the Ewens’ sampling formula, Journal of Mathematical Biology20(1984), no. 1, 91–94

  12. [12]

    Jeffreys and B

    H. Jeffreys and B. S. Jeffreys,Methods of Mathematical Physics, 3rd ed., Cambridge Univer- sity Press, Cambridge, 1972. MR 1744997

  13. [13]

    J. F. C. Kingman,Random discrete distributions, Journal of the Royal Statistical Society. Series B (Methodological)37(1975), no. 1, 1–22. MR 0368264

  14. [14]

    Pitman,Combinatorial Stochastic Processes, In: J

    J. Pitman,Combinatorial Stochastic Processes, In: J. Picard (Ed.), ´Ecole d’Et´ e de Proba- bilit´ es de Saint-Flour XXXII, 2002, Lecture Notes in Mathematics1875, Springer, Heidel- berg, 2006. MR 2245368

  15. [15]

    L. A. Shepp and S. Lloyd,Ordered cycle lengths in a random permutation, Transactions of the American Mathematical Society121(1966), 340–357. MR 195117

  16. [16]

    Tao,The Poisson–Dirichlet process, and large prime factors of a random number, What’s New Daily Archive, URL: https://terrytao.wordpress.com/2013/09/21/

    T. Tao,The Poisson–Dirichlet process, and large prime factors of a random number, What’s New Daily Archive, URL: https://terrytao.wordpress.com/2013/09/21/

  17. [17]

    Tavar´ e,The magical Ewens sampling formula, Bulletin of the London Mathematical So- ciety53(2021), no

    S. Tavar´ e,The magical Ewens sampling formula, Bulletin of the London Mathematical So- ciety53(2021), no. 6, 1563–1582. MR 4368686

  18. [18]

    Touchard,Sur les cycles des substitutions, Acta Mathematica70(1939), no

    J. Touchard,Sur les cycles des substitutions, Acta Mathematica70(1939), no. 1, 243–297. MR 1555449

  19. [19]

    Escola de Artes, Ciˆencias e Humanidades, Universidade de S ˜ao Paulo, SP, Brazil

    Wolfram Research, Mathematica, Version 14.3.Wolfram Research, Inc., Champaign, IL, 2025. Escola de Artes, Ciˆencias e Humanidades, Universidade de S ˜ao Paulo, SP, Brazil. Email address:jricardo@usp.br Instituto de Matem´atica e Estat´ıstica, Universidade de S ˜ao Paulo, SP, Brazil Email address:ljnegrett@ime.usp.br