The impatient collector
Pith reviewed 2026-05-25 15:21 UTC · model grok-4.3
The pith
The coupon collector conditioned on completing in at most (1+ν)n trials follows a new limiting completion curve ζ(ν,·).
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
For ν > 0 the asymptotic shape ζ(ν,·) of the completion curve under the condition T_n ≤ (1+ν)n exists and can be characterized. This shape describes the typical trajectory of the number of distinct coupons obtained after nt trials when the collection finishes in the rare fast regime.
What carries the argument
The conditioned coupon collector process, whose completion curve ζ(ν,·) tracks the fraction of distinct items collected by time nt under the fast-completion event.
If this is right
- The fraction of items collected by time nt, given T_n ≤ (1+ν)n, converges to ζ(ν,t) instead of the unconditional 1 - e^{-t}.
- The same conditioning supplies a new derivation of Koršunov's formula counting complete accessible automata.
- The result gives the typical path followed by the collector in the large-deviation regime of unusually rapid completion.
Where Pith is reading between the lines
- An explicit formula for ζ(ν,·) would allow direct computation of other conditioned statistics such as the distribution of duplicate counts along fast paths.
- The conditioning technique may transfer to other hitting-time problems for Markov chains on large state spaces.
- Finite-n simulations for moderate ν could be compared directly against the predicted curve to locate the onset of the asymptotic regime.
Load-bearing premise
An asymptotic completion curve ζ(ν,·) exists and can be characterized for the coupon collector process conditioned on T_n ≤ (1+ν)n.
What would settle it
Large-scale Monte Carlo sampling of coupon-collector trajectories conditioned on T_n ≤ (1+ν)n for large n, showing that the empirical average of the fraction collected by time nt does not converge to the claimed ζ(ν,t).
Figures
read the original abstract
In the coupon collector problem with $n$ items, the collector needs a random number of tries $T_n\simeq n\ln n$ to complete the collection. Also, after $nt$ tries, the collector has secured approximately a fraction $\zeta_\infty(t)=1-e^{-t}$ of the complete collection, so we call $\zeta_\infty$ the (asymptotic) \emph{completion curve}. In this paper, for $\nu>0$, we address the asymptotic shape $\zeta (\nu,.) $ of the completion curve under the condition $T_n\leq \left( 1+\nu \right) n$, i.e. assuming that the collection is \emph{completed unlikely fast}. As an application to the asymptotic study of complete accessible automata, we provide a new derivation of a formula due to Kor\v{s}unov.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript studies the coupon collector problem conditioned on the rare event T_n ≤ (1+ν)n for ν>0. It claims to establish the existence and characterize the asymptotic shape ζ(ν,.) of the completion curve (the fraction of the collection obtained after nt steps under this conditioning), and applies the result to give a new derivation of Koršunov's formula for the asymptotic enumeration of complete accessible automata.
Significance. If the central limit and characterization hold with the claimed rigor, the work would supply a concrete description of a conditioned coupon-collector trajectory and an independent derivation of a known automata-theoretic formula. The absence of free parameters or ad-hoc axioms in the ledger is a positive feature, but the abstract supplies no derivation steps, error bounds, or verification that the limit exists, so the significance cannot yet be assessed.
major comments (1)
- [Abstract] Abstract, paragraph 2: the claim that an asymptotic completion curve ζ(ν,.) 'exists and can be characterized' is stated without any indication of the large-deviation or conditioning argument used to obtain it; the reader's report correctly notes that no derivation, error analysis, or existence proof is visible, rendering the central claim unverifiable from the supplied text.
Simulated Author's Rebuttal
We thank the referee for their careful reading and for highlighting the need for greater clarity in the abstract regarding the derivation of the asymptotic completion curve. The full manuscript contains a rigorous large-deviation analysis establishing existence and characterization; the abstract is a high-level summary. We address the single major comment below.
read point-by-point responses
-
Referee: [Abstract] Abstract, paragraph 2: the claim that an asymptotic completion curve ζ(ν,.) 'exists and can be characterized' is stated without any indication of the large-deviation or conditioning argument used to obtain it; the reader's report correctly notes that no derivation, error analysis, or existence proof is visible, rendering the central claim unverifiable from the supplied text.
Authors: The abstract summarizes the main result without technical details, as is standard. The existence of the limit ζ(ν,·) and its explicit characterization are proved in Section 3 via a large-deviation principle for the empirical measure of the coupon-collector process conditioned on {T_n ≤ (1+ν)n}. The proof proceeds by establishing tightness of the conditioned trajectory, identifying the rate function, and solving the associated variational problem; error bounds appear in Theorem 3.2. The application to Koršunov’s formula is derived in Section 4 from the same limit. We agree that a brief indication of the large-deviation approach in the abstract would improve readability and will add one sentence to that effect. revision: partial
Circularity Check
No significant circularity; derivation presented as independent
full rationale
The supplied abstract and context contain no equations, fitted parameters, self-citations, or derivation steps that reduce to their own inputs by construction. The central claim is an asymptotic characterization of the conditioned completion curve ζ(ν,·) for the coupon collector process, presented as a new derivation of an external formula (Koršunov). No load-bearing step is visible that would qualify under any of the enumerated circularity patterns. The result is therefore treated as self-contained against external benchmarks.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
51, Springer-Verlag, New York, 2003, Stochastic Modelling and Applied Probability
S\"oren Asmussen, Applied probability and queues, second ed., Applications of Mathematics (New York), vol. 51, Springer-Verlag, New York, 2003, Stochastic Modelling and Applied Probability. 1978607
work page 2003
-
[2]
1709, Springer, Berlin, 1999, pp
Michel Bena\" i m, Dynamics of stochastic approximation algorithms, S\' e minaire de P robabilit\' e s, XXXIII , Lecture Notes in Math., vol. 1709, Springer, Berlin, 1999, pp. 1--68. 1767993
work page 1999
-
[3]
S. D. Conte, Elementary numerical analysis: A n algorithmic approach , McGraw-Hill Book Co., New York-Toronto, Ont.-London, 1965. 0202267
work page 1965
-
[4]
34, Springer-Verlag, Berlin, 1997, Translated from the 1990 French original by Stephen S
Marie Duflo, Random iterative models, Applications of Mathematics (New York), vol. 34, Springer-Verlag, Berlin, 1997, Translated from the 1990 French original by Stephen S. Wilson and revised by the author. 1485774
work page 1997
-
[5]
Philippe Flajolet and Robert Sedgewick, Analytic combinatorics, Cambridge University Press, Cambridge, 2009. 2483235
work page 2009
-
[6]
I. J. Good, An asymptotic formula for the differences of the powers at zero, Ann. Math. Statist. 32 (1961), 249--256. 0120204
work page 1961
-
[7]
A. D. Kor s unov, Enumeration of finite automata, Problemy Kibernet. (1978), no. 34, 5--82, 272. 517814
work page 1978
-
[8]
, On the number of nonisomorphic strongly connected finite automata, Elektron. Informationsverarb. Kybernet. 22 (1986), no. 9, 459--462. 862029
work page 1986
-
[9]
V. F. Kolchin, B. A. Sevastyanov, and V. P. Chistyakov, Random allocations, John Wiley & Sons, 1978, Scripta Series in Mathematics. 0471015
work page 1978
-
[10]
Elcio Lebensztayn, On the asymptotic enumeration of accessible automata, Discrete Math. Theor. Comput. Sci. 12 (2010), no. 3, 75--79. 2786470
work page 2010
-
[11]
Lothaire, Applied combinatorics on words, Encyclopedia of Mathematics and its Applications, vol
M. Lothaire, Applied combinatorics on words, Encyclopedia of Mathematics and its Applications, vol. 105, Cambridge University Press, Cambridge, 2005. 2165687
work page 2005
-
[12]
A. Meir and J. W. Moon, The asymptotic behaviour of coefficients of powers of certain generating functions, European J. Combin. 11 (1990), no. 6, 581--587. 1078714
work page 1990
-
[13]
Cyril Nicaud, É tude du comportement en moyenne des automates finis et des langages rationnels , Ph.D. thesis, Paris VII, 2000
work page 2000
-
[14]
Herbert Robbins, A remark on the joint distribution of cumulative sums, Ann. Math. Statistics 25 (1954), 614--616. 0063592
work page 1954
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.