How to Store a Random Walk
Pith reviewed 2026-05-24 16:25 UTC · model grok-4.3
The pith
A random walk on any graph can be stored using log of the number of walks plus O(log n) bits with constant-time decoding of any position.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
There is a succinct data structure storing a random walk using lg₂ κ(G,n) + O(lg n) bits of space such that any vertex along the walk can be decoded in O(1) time on a word-RAM. For the harder task of matching the point-wise optimal space of the walk (the empirical entropy ∑_{i=1}^{n-1} lg(deg(v_i))), a data structure with O(1) extra bits exists at the price of O(lg n) decoding time; any improvement on this would lead to an improved solution on the Dictionary problem. All structures support the online version with constant update and query time.
What carries the argument
Succinct encoding of the path that compresses to the walk count κ(G,n) or empirical entropy while supporting constant-time position queries.
If this is right
- Any length-n walk on G fits in lg κ(G,n) + O(lg n) bits with O(1) access to any vertex.
- Matching the per-step empirical entropy costs only O(1) extra bits but requires O(lg n) decoding time.
- Further reduction below O(1) extra bits for the entropy version would improve the classic Dictionary problem.
- The same structures work in the online model with constant-time updates and queries.
Where Pith is reading between the lines
- The approach may extend to other sequentially correlated objects such as paths in geometric graphs or trajectories in sensor networks.
- The explicit link to the Dictionary problem indicates that lower-bound techniques from that area could be adapted to prove tightness for the entropy version.
- Practical implementations on graphs with small diameter or regular degree might achieve even smaller hidden constants than the O(lg n) term.
- The online variant suggests possible use in streaming or dynamic settings where the walk is revealed incrementally.
Load-bearing premise
The graph G is known to the encoder in advance so its structure can be used during encoding to reach the stated space bound.
What would settle it
An explicit graph G and walk length n where every data structure that decodes any vertex in O(1) time must use more than lg κ(G,n) + c lg n bits for some constant c.
read the original abstract
Motivated by storage applications, we study the following data structure problem: An encoder wishes to store a collection of jointly-distributed files $\overline{X}:=(X_1,X_2,\ldots, X_n) \sim \mu$ which are \emph{correlated} ($H_\mu(\overline{X}) \ll \sum_i H_\mu(X_i)$), using as little (expected) memory as possible, such that each individual file $X_i$ can be recovered quickly with few (ideally constant) memory accesses. In the case of independent random files, a dramatic result by \Pat (FOCS'08) and subsequently by Dodis, \Pat and Thorup (STOC'10) shows that it is possible to store $\overline{X}$ using just a \emph{constant} number of extra bits beyond the information-theoretic minimum space, while at the same time decoding each $X_i$ in constant time. However, in the (realistic) case where the files are correlated, much weaker results are known, requiring at least $\Omega(n/poly\lg n)$ extra bits for constant decoding time, even for "simple" joint distributions $\mu$. We focus on the natural case of compressing\emph{Markov chains}, i.e., storing a length-$n$ random walk on any (possibly directed) graph $G$. Denoting by $\kappa(G,n)$ the number of length-$n$ walks on $G$, we show that there is a succinct data structure storing a random walk using $\lg_2 \kappa(G,n) + O(\lg n)$ bits of space, such that any vertex along the walk can be decoded in $O(1)$ time on a word-RAM. For the harder task of matching the \emph{point-wise} optimal space of the walk, i.e., the empirical entropy $\sum_{i=1}^{n-1} \lg (deg(v_i))$, we present a data structure with $O(1)$ extra bits at the price of $O(\lg n)$ decoding time, and show that any improvement on this would lead to an improved solution on the long-standing Dictionary problem. All of our data structures support the \emph{online} version of the problem with constant update and query time.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper studies succinct data structures for storing a length-n random walk on an arbitrary (possibly directed) graph G. It claims a structure using lg2 κ(G,n) + O(lg n) bits of space with O(1) word-RAM decoding time for any vertex along the walk, plus a pointwise version matching the empirical entropy sum lg(deg(vi)) with only O(1) extra bits at the cost of O(lg n) decoding time; any improvement on the latter would improve the Dictionary problem. The structures also support the online setting with constant-time updates and queries.
Significance. If the claims hold, the results substantially improve the state of the art for compressing correlated Markovian data with fast access, reducing extra space from the prior Ω(n/poly lg n) to O(lg n) or O(1) while retaining fast decoding; the explicit reduction linking the pointwise bound to the Dictionary problem is a notable strength, as is the online support. The results apply to arbitrary G, which increases their scope.
major comments (2)
- [Abstract] Abstract: the headline claim of lg2 κ(G,n) + O(lg n) bits with O(1) decoding for arbitrary G is load-bearing on the encoder having efficient access to G (degrees and adjacency information) during encoding; the manuscript must explicitly define the input model for G (preprocessed, oracle, or explicit adjacency list) because the abstract states the result without this detail and the construction implicitly relies on it.
- [Abstract] The reduction transferring Dictionary lower bounds to the pointwise empirical-entropy case (mentioned in the abstract) is central to the optimality claim; the paper must specify the exact parameter mapping and confirm that no extra logarithmic factors or word-RAM model mismatches are introduced, as any such gap would weaken the 'any improvement would lead to an improved Dictionary solution' statement.
minor comments (2)
- [Abstract] Notation: lg2 is used once in the abstract while lg appears elsewhere; standardize the base-2 logarithm notation throughout.
- The online version is asserted to have constant update and query time, but the high-level description would benefit from a brief pointer to the section containing the update mechanism.
Simulated Author's Rebuttal
We thank the referee for the careful reading and constructive suggestions on clarifying the abstract. We address each major comment below.
read point-by-point responses
-
Referee: [Abstract] Abstract: the headline claim of lg2 κ(G,n) + O(lg n) bits with O(1) decoding for arbitrary G is load-bearing on the encoder having efficient access to G (degrees and adjacency information) during encoding; the manuscript must explicitly define the input model for G (preprocessed, oracle, or explicit adjacency list) because the abstract states the result without this detail and the construction implicitly relies on it.
Authors: We agree that the input model for G must be stated explicitly. Our construction assumes G is provided explicitly (as an adjacency list) or via an efficient oracle allowing constant-time degree and neighbor queries during encoding; this is the standard model for graph data structures and is implicit in the full construction but omitted from the abstract. We will revise the abstract and introduction to define this model clearly. revision: yes
-
Referee: [Abstract] The reduction transferring Dictionary lower bounds to the pointwise empirical-entropy case (mentioned in the abstract) is central to the optimality claim; the paper must specify the exact parameter mapping and confirm that no extra logarithmic factors or word-RAM model mismatches are introduced, as any such gap would weaken the 'any improvement would lead to an improved Dictionary solution' statement.
Authors: The reduction, including the exact parameter mapping (universe size n maps to walk length, dictionary space to the O(1) overhead) and confirmation of word-RAM compatibility with no extra logarithmic factors, is fully detailed in the body of the paper. The abstract statement is therefore accurate, but to improve clarity we will add a brief parenthetical reference to the mapping in the abstract. revision: yes
Circularity Check
No circularity: bounds stated directly in external information measures; Dictionary reduction is one-way implication
full rationale
The claimed space lg2 κ(G,n) + O(lg n) and pointwise empirical entropy + O(1) are defined externally to the construction; no equation reduces the output to a fitted parameter or self-citation. The statement that improvement would improve the Dictionary problem is a reduction from the data-structure question to a known open problem, not a self-referential loop. All cited prior work (Pat FOCS'08, Dodis-Pat-Thorup STOC'10) is by disjoint authors and concerns the independent case. The derivation is therefore self-contained against external benchmarks.
Axiom & Free-Parameter Ledger
axioms (2)
- domain assumption Word-RAM model with word size Theta(log n) bits
- domain assumption Graph G is fixed and its local structure (degrees, adjacency) is accessible during encoding
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.