The complexity of finite smooth words over binary alphabets
Pith reviewed 2026-05-15 13:42 UTC · model grok-4.3
The pith
Finite smooth words over any binary alphabet are exactly the factors of infinite smooth words.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The paper proves that the language of f-smooth words coincides with the set of all factors of smooth words. Over even alphabets the complexity function p(n) satisfies p(n) = Theta(n^alpha) for the explicit exponent alpha = log(a+b) / log((a+b)/2). For arbitrary binary alphabets the same lower bound is established, while the known upper bound is improved for odd alphabets, bringing both sides closer to the conjectured Theta(n^alpha) growth.
What carries the argument
The derivability map that replaces each run in a word by the integer equal to its length, iterated to define both infinite smooth words and their finite f-smooth approximations.
If this is right
- The language of all smooth words can now be studied entirely through the finite f-smooth words they contain.
- Over even alphabets the number of distinct blocks of length n is known to grow exactly like n raised to the explicit power log(a+b)/log((a+b)/2).
- A matching lower bound on complexity now holds uniformly for every binary alphabet.
- The gap between lower and upper bounds on complexity has been reduced for odd alphabets.
Where Pith is reading between the lines
- Enumeration algorithms for f-smooth words can now be used to generate candidate factors of smooth words without separately checking infinite extendability.
- The same factor-closed property may allow transfer of complexity results from finite to infinite smooth words in both directions.
- The improved bounds suggest that computational checks of the exact exponent for small odd alphabets are now within reach.
Load-bearing premise
The derivability process remains well-defined for every finite and infinite word over the alphabet without any hidden constraints that would break the exact correspondence between f-smooth words and factors of smooth words.
What would settle it
Discovery of a single finite word that is f-smooth yet fails to appear as a consecutive block inside any infinite smooth word, or an explicit computation of the complexity function for large n that falls outside the proven lower and improved upper bounds.
Figures
read the original abstract
Smooth words over an alphabet of non-negative integers $\{a,b\}$ are infinite words that are infinitely derivable, the most famous example being the Oldenburger-Kolakoski word over $\{1,2\}$. The main way to study their language is to consider a finite version of smooth words that we call f-smooth words. In this paper we prove that the f-smooth words are exactly the factors of smooth words, and we make progress towards the conjecture of Sing that the complexity of f-smooth words over $\{a,b\}$ grows like $\Theta\left(n^{\log(a+b)/\log((a+b)/2)}\right)$: we prove it over even alphabets, we prove the lower bound over any binary alphabet and we improve the known upper bound over odd alphabets.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript proves that f-smooth words (finite versions of infinitely derivable smooth words) over a binary alphabet are exactly the factors of (infinite) smooth words. It further establishes the conjectured complexity growth Θ(n^{log(a+b)/log((a+b)/2)}) for even alphabets, proves the matching lower bound for arbitrary binary alphabets, and improves the known upper bound for odd alphabets.
Significance. The characterization supplies a foundational equivalence for studying the language of smooth words such as the Kolakoski word. The complexity results constitute concrete progress on Sing's conjecture, with the exact asymptotic determination for even alphabets being a strong, parameter-free confirmation of the predicted exponent.
minor comments (2)
- [§1] §1: the definition of the derivability process for finite words could include an explicit statement that it terminates after finitely many steps without altering the factor relation.
- [§4] The proof of the even-alphabet case (presumably in §4) would benefit from a short remark confirming that the exponent is independent of the specific even pair {a,b}.
Simulated Author's Rebuttal
We thank the referee for their positive report, accurate summary of our results on the characterization of f-smooth words and progress toward Sing's conjecture, and the recommendation of minor revision. We appreciate the recognition of the foundational equivalence and the concrete advances on the complexity exponent.
Circularity Check
No significant circularity; derivation is self-contained
full rationale
The paper defines smooth words as infinitely derivable infinite words over non-negative integers and introduces f-smooth words as the finite analogue. The central result proves by direct combinatorial analysis that f-smooth words coincide exactly with the factors of smooth words; this equivalence is established from the definition of the derivability process without any reduction to fitted parameters, self-referential definitions, or load-bearing self-citations. The complexity bounds (exact growth for even alphabets, lower bound for any binary alphabet, improved upper bound for odd alphabets) are then derived from this equivalence using standard counting arguments on word factors and derivability depth. No step renames a known result, imports a uniqueness theorem from the authors' prior work, or treats a fitted quantity as a prediction. The argument remains independent of any external fitted data or circular self-reference.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Smooth words are infinite words that remain infinitely derivable under the block-replacement rule
Forward citations
Cited by 1 Pith paper
-
Frequency of patterns in smooth sequences over the alphabet {1, 3}
Asymptotic frequencies of any finite pattern in smooth sequences over {1,3} are well-defined and depend on the type sequence of derivatives, with all such subshifts uniquely ergodic.
Reference graph
Works this paper leans on
-
[1]
M. Baake and B. Sing. Kolakoski-(3,1) is a (deformed) model set.Canadian Mathematical Bulletin, 47(2):168–190, 2004.doi:10.4153/CMB-2004-018-6
-
[2]
S. Brlek, D. Jamet, and G. Paquin. Smooth words on 2-letter alphabets having same parity.Theoretical Computer Science, 393(1):166–181, 2008. doi:10.1016/j.tcs.2007.11.019
-
[3]
S. Brlek, G. Melançon, and G. Paquin. Properties of extremal infinite smooth words.Discrete Mathematics and Theoretical Computer Science, 9, 11 2007.doi:10.46298/dmtcs.412
-
[4]
A. Carpi. On repeated factors inC ∞-words.Information Processing Letters, 52(6):289–294, 1994.doi:10.1016/0020-0190(94)00162-6
-
[5]
J. Cassaigne. Complexité et facteurs spéciaux.Bulletin of the Belgian Mathe- matical Society - Simon Stevin, 4, 01 1997.doi:10.36045/bbms/1105730624
-
[6]
F. M. Dekking. Regularity and irregularity of sequences generated by automata.Seminaire de Théorie des Nombres de Bordeaux, 9:1–10, 1979. URL: http://eudml.org/doc/182065
work page 1979
- [7]
-
[8]
F. M. Dekking. What is the long range order in the Kolakoski sequence ? 08 2001.doi:10.1007/978-94-015-8784-6_5
- [9]
-
[10]
Y. B. Huang. The complexity of smooth words on 2-letter alphabets.The- oretical Computer Science, 412(45):6327–6339, 2011.doi:10.1016/j.tcs. 2011.07.002
-
[11]
Y. B. Huang. The powers of smooth words over arbitrary 2-letter alphabets. 2011.arXiv:0904.0562
work page internal anchor Pith review Pith/arXiv arXiv 2011
-
[12]
Y. B. Huang and W. D. Weakley. A note on the complexity ofC ∞- words.Theor. Comput. Sci., 411(40-42):3731–3735, 2010.doi:10.1016/ J.TCS.2010.06.024
work page 2010
-
[13]
The on-line encyclopedia of integer sequences
OEIS Foundation Inc. The on-line encyclopedia of integer sequences. 2026. URL:https://oeis.org
work page 2026
-
[14]
Frequency of patterns in smooth sequences over the alphabet {1, 3}
D. Jamet, I. Marcovici, T. de la Rue, and L. Poirier. Frequency of patterns in smooth sequences over the alphabet{1,3}. 2026.arXiv:2604.11387. 23
work page internal anchor Pith review Pith/arXiv arXiv 2026
-
[15]
M. S. Keane. Ergodic theory and subshifts of finite type. pages 35–70, 1991
work page 1991
-
[16]
W. Kolakoski. Self generating runs, problem 5304.The American Mathe- matical Monthly, 72:674, 1965.doi:10.2307/2313883
-
[17]
W. Kolakoski and N. Üçoluk. Solution of advanced problem 5304.The American Mathematical Monthly, 73:681–682, 1966
work page 1966
-
[18]
J. Nilsson. Letter frequencies in the Kolakoski sequence.Acta Physica Polonica A, 126:549–552, 2014.doi:10.12693/APhysPolA.126.549
-
[19]
R. Oldenburger. Exponent trajectories in symbolic dynamics.Transactions of the American Mathematical Society, 46(3):453–466, 1939.doi:10/2307/ 1989933
work page 1939
-
[20]
M. Rao. Trucs et bidules sur la séquence de Kolakoski. 2012. URL: https://www.arthy.org/kola/kola.php
work page 2012
-
[21]
B. Sing. Spektrale eigenschaften der Kolakoski-sequenzen.Diploma thesis, Universität Tübingen, 2002
work page 2002
-
[22]
B. Sing. More Kolakoski sequences.Integers, 11B:Paper No. A14, 17, 2011. arXiv:1009.4061
work page internal anchor Pith review Pith/arXiv arXiv 2011
-
[23]
W. D. Weakley. On the number ofC∞-words of each length.J. Comb. Theory A, 51(1):55–62, 1989.doi:10.1016/0097-3165(89)90076-9. 24
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.