Recursive Record Filtering and Longest Decreasing Subsequences
Pith reviewed 2026-05-08 05:42 UTC · model grok-4.3
The pith
The number of passes in recursive record filtering on a sequence equals the length of its longest decreasing subsequence.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
For any permutation p_n the recursive Disappear-Sort procedure produces layers that constitute an antichain decomposition of the associated poset, hence the number of passes equals L(p_n), the length of the longest decreasing subsequence. When p_n is drawn uniformly at random, the expectation of this quantity coincides with the expected first-column length of the Plancherel-random Young diagram; the Robinson-Schensted correspondence then yields an exact expression in terms of partitions and standard Young tableaux, from which the asymptotics E[D_n] ~ 2 sqrt(n) and Tracy-Widom fluctuations on the n^{1/6} scale follow directly.
What carries the argument
The recursive Disappear-Sort procedure, which retains the left-to-right records of the current sequence in each pass and recurses on the discarded entries.
If this is right
- For every permutation the number of Disappear-Sort passes equals its longest decreasing subsequence length.
- The expected passes for a uniform random permutation equals the expected first-column length of a Plancherel Young diagram.
- An exact formula for the expectation is obtained by summing over partitions weighted by the number of standard Young tableaux.
- The expectation grows asymptotically as 2 sqrt(n) with fluctuations of scale n^{1/6} governed by the Tracy-Widom law.
Where Pith is reading between the lines
- The O(n log n) implementation supplies an efficient algorithm for computing longest decreasing subsequence lengths.
- The equivalence may allow other asymptotic properties of Plancherel Young diagrams to be transferred to record-filtering statistics on random sequences.
- The same layer decomposition might apply to variants of the procedure on other partial orders.
Load-bearing premise
The layers produced by repeated record filtering form an antichain decomposition of the poset attached to the permutation.
What would settle it
For any explicit permutation, count the passes required by the recursive filtering procedure and compare the count directly with the length of its longest decreasing subsequence.
Figures
read the original abstract
We consider a recursive record-filtering procedure, which we informally call Disappear-Sort. Let $D_n$ denote the random variable giving the required number of passes in Disappear-Sort to eliminate a sequence of length $n$ sampled as i.i.d. copies of a continuous random variable $X$, where each pass retains the left-to-right records and discards all remaining entries. We show that this procedure admits two natural probabilistic interpretations. For the resampling variant we prove that $d_n=\mathbb{E}[D_n]$ satisfies an exact recurrence involving the unsigned Stirling numbers of the first kind. For the non-resampling variant, we associate to a permutation $p_n\in S_n$ a natural poset and prove that the recursive Disappear-Sort layers form an antichain decomposition of this poset. We deduce that the total number of passes equals $L(p_n)$, where $L(p_n)$ is the length of the longest decreasing subsequence of $p_n$. We then show that for a uniform random permutation of size $n$, the expectation $\mathbb{E}[D_n]$ of this second variant coincides with the expected first-column length of a Plancherel-random Young diagram. Using the Robinson--Schensted correspondence, we obtain an exact formula for this expectation in terms of partitions and standard Young tableaux, and classical Plancherel asymptotics then yield $\mathbb{E}[D_n]\sim 2\sqrt{n}$, with fluctuations on the $n^{1/6}$ scale governed by the Tracy--Widom law derived by Baik, Deift and Johansson. We conclude with an $O(n\log n)$ implementation.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript introduces the Disappear-Sort recursive record-filtering procedure and studies the number of passes D_n required to process a length-n sequence. For the resampling variant it derives an exact recurrence for the expectation d_n in terms of unsigned Stirling numbers of the first kind. For the non-resampling variant on a permutation p_n it defines a natural poset on the entries, proves that the successive record layers form an antichain decomposition of this poset, and concludes that the number of passes equals L(p_n), the length of the longest decreasing subsequence. It then shows that for uniform random p_n the expectation E[D_n] equals the expected length of the first column of a Plancherel-random Young diagram, supplies an exact formula via the Robinson-Schensted correspondence and standard Young tableaux, and recovers the asymptotic E[D_n] ∼ 2√n together with n^{1/6}-scale Tracy-Widom fluctuations from the Baik-Deift-Johansson theorem. An O(n log n) implementation is also presented.
Significance. If the central identifications hold, the paper supplies a new algorithmic and probabilistic model for the first-column length of Plancherel-random Young diagrams, directly linking a simple record-filtering process to the length of the longest decreasing subsequence and to the RSK shape. The use of Mirsky's theorem to equate passes with poset height, followed by the symmetry of Plancherel measure and the classical BDJ asymptotics, yields a clean derivation of both exact and asymptotic results. The exact formula in terms of partitions and SYT counts and the efficient implementation are additional strengths that could facilitate further study of permutation statistics.
major comments (1)
- [non-resampling variant and poset association] In the section establishing the antichain decomposition for the non-resampling variant, the argument that successive record layers partition the poset into antichains (and hence that the number of layers equals the height) should explicitly verify that each layer is an antichain under the chosen partial order; a short direct argument showing that no two elements retained in the same pass can form a decreasing pair would strengthen the application of Mirsky's theorem.
minor comments (3)
- [resampling variant] The recurrence for the resampling variant is stated in terms of unsigned Stirling numbers of the first kind; including the explicit initial conditions and the first few computed values of d_n would make the recurrence easier to verify.
- [implementation] The O(n log n) implementation is announced in the abstract and conclusion; the manuscript would benefit from a brief pseudocode listing or a short complexity analysis paragraph to confirm the claimed running time.
- [preliminaries] Notation for the poset associated to p_n is introduced only in the non-resampling section; moving a concise definition to the preliminaries would improve readability for readers unfamiliar with the construction.
Simulated Author's Rebuttal
We thank the referee for the careful reading and constructive recommendation. The suggestion to strengthen the antichain verification is well taken, and we will incorporate a short direct argument in the revised manuscript.
read point-by-point responses
-
Referee: In the section establishing the antichain decomposition for the non-resampling variant, the argument that successive record layers partition the poset into antichains (and hence that the number of layers equals the height) should explicitly verify that each layer is an antichain under the chosen partial order; a short direct argument showing that no two elements retained in the same pass can form a decreasing pair would strengthen the application of Mirsky's theorem.
Authors: We agree that an explicit verification strengthens the exposition. In the revised version we will insert a short direct argument immediately after defining the poset: suppose two entries a and b are retained in the same pass. By construction of the left-to-right record filter, the later of the two cannot be smaller than the earlier one (otherwise the later entry would have been discarded as non-record). Hence a and b are incomparable in the poset, confirming that each layer is an antichain. This makes the appeal to Mirsky's theorem fully explicit while preserving the existing proof structure. revision: yes
Circularity Check
No significant circularity; derivation uses standard theorems and independent priors
full rationale
The paper proves that Disappear-Sort passes equal the longest decreasing subsequence length by constructing an antichain decomposition of the natural poset on the permutation and invoking Mirsky's theorem (height equals minimum antichain cover). This is a direct combinatorial argument with no definitional loop. The expectation is then identified with the expected first column length of a Plancherel-random Young diagram via the Robinson-Schensted correspondence, a standard bijection. The asymptotic E[D_n] ~ 2√n with Tracy-Widom fluctuations is taken from the independent Baik-Deift-Johansson result on Plancherel measure, which is externally established and not derived from the present construction. No fitted parameters are relabeled as predictions, no self-citations are load-bearing, and no ansatz is smuggled. The chain is self-contained against external benchmarks.
Axiom & Free-Parameter Ledger
axioms (3)
- standard math The unsigned Stirling numbers of the first kind satisfy the recurrence for the expectation in the resampling variant.
- standard math Robinson-Schensted correspondence maps permutations to pairs of Young tableaux with Plancherel measure.
- domain assumption The poset associated to the permutation has the property that Disappear-Sort layers are antichains whose number equals the longest decreasing subsequence length.
Reference graph
Works this paper leans on
-
[1]
A leader-election procedure using records
Gerold Alsmeyer, Zakhar Kabluchko, and Alexander Marynych. A leader-election procedure using records. The Annals of Probability , 45(6B):4348–4388, November
-
[2]
doi:10.1214/16-AOP1167
-
[3]
On the distribution of the length of the longest increasing subsequence of random per muta- tions
Jinho Baik, Percy Deift, and Kurt Johansson. On the distribution of the length of the longest increasing subsequence of random per muta- tions. Journal of the American Mathematical Society , 12(4):1119–1178, 1999. doi:10.1090/S0894-0347-99-00307-0
-
[4]
David and H
Herbert A. David and H. N. Nagaraja. Order Statistics. Wiley-Interscience, 3 edition,
-
[5]
ISBN 978-0-471-38926-2. doi:10.1002/0471722162
-
[6]
J. S. Frame, G. de B. Robinson, and R. M. Thrall. The hook graphs of the symmetric group. Canadian Journal of Mathematics , 6:316–324, 1954. doi:10.4153/CJM-1954-030-1
-
[7]
B. F. Logan and L. A. Shepp. A variational problem for random Yo ung tableaux. Advances in Mathematics , 26(2):206–222, 1977. ISSN 0001-8708. doi:10.1016/0001-8708(77)90030-5
-
[8]
L. Mirsky. A dual of Dilworth’s decomposition theorem. The American Mathematical Monthly, 78(8):876–877, 1971. doi:10.2307/2316481
-
[9]
Th´ eorie des ´ el´ ements saillants d’une suite d’observations
Alfr´ ed R´ enyi. Th´ eorie des ´ el´ ements saillants d’une suite d’observations. Annales scientifiques de l’Universit´ e de Clermont. Math´ ematiques, 8(2):7–13, 1962. URL https://www.numdam.org/item/ASCFM_1962__8_2_7_0/
1962
-
[10]
G. de B. Robinson. On the representations of the symmetric gro up. American Journal of Mathematics , 60(3):745–760, 1938. doi:10.2307/2371609
-
[11]
C. Schensted. Longest increasing and decreasing subsequenc es. Canadian Journal of Mathematics, 13:179–191, 1961. doi:10.4153/CJM-1961-015-3
-
[12]
A. M. Vershik and S. V. Kerov. Asymptotics of the Plancherel m easure of the symmetric group and the limiting form of Young tableaux. Soviet Mathematics Doklady, 18:527–531, 1977. the electronic journal of combinatorics 30 (2023), #P00 15
1977
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.