Guessing Individual Sequences: Generating Randomized Guesses Using Finite-State Machines
Pith reviewed 2026-05-25 15:31 UTC · model grok-4.3
The pith
The finite-state guessing exponent of any sequence equals its Lempel-Ziv compressibility.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
For any individual sequence x^n the finite-state guessing exponent equals the finite-state compressibility of x^n. This exponent is achieved by the decoder of a suitably modified LZ78 algorithm that is driven by a stream of purely random bits. The same statement holds when the machine is also given a side-information sequence y^n.
What carries the argument
Finite-state guessing exponent, defined as the asymptotic normalized logarithm of the minimum moment of the number of guesses until success, shown to equal the Lempel-Ziv finite-state compressibility.
If this is right
- Guessing performance for any individual sequence is governed by its Lempel-Ziv compressibility.
- The modified LZ78 decoder fed random bits attains the optimal guessing exponent for every sequence.
- The equality between guessing exponent and compressibility extends to the presence of a side-information sequence.
- Finite-state machines suffice to achieve the optimal individual-sequence guessing rate.
Where Pith is reading between the lines
- The result supplies a direct operational meaning to Lempel-Ziv compressibility in a guessing setting.
- It indicates that existing compression algorithms can be reused, with only minor modification, for randomized guessing tasks.
- The framework may be testable by comparing empirical guessing moments against computed compressibility rates on long periodic or Markovian sequences.
Load-bearing premise
Guessing is performed by a finite-state machine that produces randomized guesses sequentially from a stream of purely random bits.
What would settle it
A concrete individual sequence for which the guessing exponent realized by the modified LZ78 decoder differs from the sequence's Lempel-Ziv compressibility would refute the claimed equality.
Figures
read the original abstract
Motivated by earlier results on universal randomized guessing, we consider an individual-sequence approach to the guessing problem: in this setting, the goal is to guess a secret, individual (deterministic) vector $x^n=(x_1,\ldots,x_n)$, by using a finite-state machine that sequentially generates randomized guesses from a stream of purely random bits. We define the finite-state guessing exponent as the asymptotic normalized logarithm of the minimum achievable moment of the number of randomized guesses, generated by any finite-state machine, until $x^n$ is guessed successfully. We show that the finite-state guessing exponent of any sequence is intimately related to its finite-state compressibility (due to Lempel and Ziv), and it is asymptotically achieved by the decoder of (a certain modified version of) the 1978 Lempel-Ziv data compression algorithm (a.k.a. the LZ78 algorithm), fed by purely random bits. The results are also extended to the case where the guessing machine has access to a side information sequence, $y^n=(y_1,\ldots,y_n)$, which is also an individual sequence.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript formulates the guessing problem for an arbitrary individual sequence x^n using a finite-state machine that consumes a stream of i.i.d. fair random bits and produces successive guesses until x^n is hit. The finite-state guessing exponent is defined as the lim sup (1/n) log of the smallest achievable E[G^rho] (rho>0) over all finite-state guessing machines. The central claim is that this exponent equals the Lempel-Ziv finite-state compressibility rate of x^n for every sequence and is asymptotically attained by the decoder of a suitably modified LZ78 algorithm driven by random bits. The result is extended to the case with an individual side-information sequence y^n.
Significance. If the claimed equality holds, the work supplies a parameter-free, sequence-by-sequence link between the guessing exponent under finite-state randomization and the classical LZ compressibility measure. The constructive achievability result via a modified LZ78 decoder is a concrete strength: the same sequential parsing that achieves optimal compression also yields optimal guessing when the dictionary is driven by random bits. This extends earlier probabilistic guessing results to the individual-sequence setting without introducing fitted parameters or additional assumptions on state cardinality.
minor comments (3)
- [Abstract] The abstract states that the guessing exponent 'is intimately related' to LZ compressibility; the introduction or Section II should state the precise equality (guessing exponent = LZ rate) at the first opportunity.
- [Introduction] Notation for the moment order rho, the normalization factor 1/n, and the precise definition of the finite-state machine (input alphabet, output mapping, state transition) should be fixed before the main theorem statements.
- The description of the 'modified version' of LZ78 should include an explicit statement of which component (dictionary construction, parsing rule, or output mapping) is altered to accept random bits.
Simulated Author's Rebuttal
We thank the referee for the positive review, the insightful summary of our contributions, and the recommendation to accept the manuscript.
Circularity Check
No significant circularity identified
full rationale
The paper defines the finite-state guessing exponent for individual sequences and relates it to the established Lempel-Ziv finite-state compressibility via standard relations between sequential parsing, dictionary growth, and moment-generating functions on the number of trials. Both the converse (FS machines cannot beat the LZ rate) and direct part (achievability via modified LZ78 decoder) follow from these external, pre-existing arguments without any reduction to fitted parameters, self-definitional loops, or load-bearing self-citations. The side-information case uses the same joint parsing approach. The derivation is self-contained against the external LZ benchmark and does not rename or smuggle in results via author citations.
Axiom & Free-Parameter Ledger
axioms (1)
- standard math Standard definitions and asymptotic properties of finite-state machines and normalized logarithmic exponents from information theory and automata theory
Reference graph
Works this paper leans on
-
[1]
An inequality on guessing and its applicatio n to sequential decoding,
E. Arikan, “An inequality on guessing and its applicatio n to sequential decoding,” IEEE Trans. Inform. Theory , vol. IT–42, no. 1, pp. 99–105, January 1996
work page 1996
-
[2]
T. M. Cover and J. A. Thomas, Elements of Information Theory , Second Edition, John Wiley & Sons, Hoboken, New Jersey, U.S.A., 2006
work page 2006
-
[3]
Gambling using a finite–state machine,
M. Feder, “Gambling using a finite–state machine,” IEEE Trans. on Inform. Theory , vol. 37, no. 5, pp. 1459–1465, September 1991
work page 1991
-
[4]
Universal predictio n of individual sequences,
M. Feder, N. Merhav, and M. Gutman, “Universal predictio n of individual sequences,” IEEE Trans. Inform. Theory , vol. 38, no. 4, pp. 1258–1270, July 1992
work page 1992
-
[5]
On the complexity of finite sequence s,
A. Lempel and J. Ziv, “On the complexity of finite sequence s,” IEEE Trans. Inform. Theory , vol. IT–22, no. 1, pp. 75–81, January 1976
work page 1976
-
[6]
J. L. Massey, “Guessing and entropy,” Proc. IEEE International Symposium on Information Theory (ISIT ’94) , p. 204, 1994
work page 1994
-
[7]
Universal coding with minimum probability o f code word length overflow,
N. Merhav, “Universal coding with minimum probability o f code word length overflow,” IEEE Trans. Inform. Theory , vol. 37, no. 3, pp. 556–563, May 1991
work page 1991
-
[8]
Perfectly secure encryption of individual s equences,
N. Merhav, “Perfectly secure encryption of individual s equences,” IEEE Trans. Inform. The- ory, vol. 59, no. 3, pp. 1302–1310, March 2013
work page 2013
-
[9]
Universal detection of messages via finite–s tate channels,
N. Merhav, “Universal detection of messages via finite–s tate channels,” IEEE Trans. Inform. Theory, vol. 46, no. 6, pp. 2242–2246, September 2000
work page 2000
-
[10]
Universal randomized guessing with application to asynchronous decentralized brute–force attacks,
N. Merhav and A. Cohen, “Universal randomized guessing with application to asynchronous decentralized brute–force attacks,” to appear in IEEE Trans. Inform. Theory , 2019
work page 2019
-
[11]
Why botnets work: distributed brute–force attacks need no synchronization,
S. Salamatian, W. Huleihel, A. Beirami, A. Cohen, and M. Médard, “Why botnets work: distributed brute–force attacks need no synchronization, ” to appear in IEEE Trans. Inform. Forensics and Security , September 2019. 22
work page 2019
-
[12]
T. Uyematsu and S. Kuzuoka, “Conditional Lempel-Ziv co mplexity and its application to source coding theorem with side information,” IEICE Trans. on Fundamentals of Electronics, Communications and Computer Sciences , vol. 86, no. 10, pp. 2615–2617, October 2003
work page 2003
-
[13]
Universal decoding for finite–state channels,
J. Ziv, “Universal decoding for finite–state channels, ” IEEE Trans. Inform. Theory , vol. IT– 31, no. 4, pp. 453–460, July 1985
work page 1985
-
[14]
Compression of individual sequen ces via variable-rate coding,
J. Ziv and A. Lempel, “Compression of individual sequen ces via variable-rate coding,” IEEE Trans. Inform. Theory , vol. IT–24, no. 5, pp. 530–536, September 1978. 23
work page 1978
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.