How to Store a Random Walk

Emanuele Viola; Huacheng Yu; Omri Weinstein

arxiv: 1907.10874 · v1 · pith:TATRQEAMnew · submitted 2019-07-25 · 💻 cs.DS · cs.IT· math.IT

How to Store a Random Walk

Emanuele Viola , Omri Weinstein , Huacheng Yu This is my paper

Pith reviewed 2026-05-24 16:25 UTC · model grok-4.3

classification 💻 cs.DS cs.ITmath.IT

keywords random walksuccinct data structureMarkov chaindata compressionword-RAMempirical entropydictionary problemonline data structure

0 comments

The pith

A random walk on any graph can be stored using log of the number of walks plus O(log n) bits with constant-time decoding of any position.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The work studies how to store a length-n random walk on an arbitrary graph G so that the entire sequence uses little space but any individual vertex can be recovered quickly. When the files are independent, constant extra bits suffice for constant-time access, yet correlations force weaker bounds in general. For walks the authors give a structure using lg κ(G,n) plus O(lg n) bits that still decodes in constant time on the word-RAM. A second structure matches the tighter pointwise entropy sum lg(deg(v_i)) up to O(1) extra bits, at the cost of O(lg n) decoding time, and any further improvement is shown to yield a better dictionary data structure. Both constructions also handle the online setting with constant updates and queries.

Core claim

There is a succinct data structure storing a random walk using lg₂ κ(G,n) + O(lg n) bits of space such that any vertex along the walk can be decoded in O(1) time on a word-RAM. For the harder task of matching the point-wise optimal space of the walk (the empirical entropy ∑_{i=1}^{n-1} lg(deg(v_i))), a data structure with O(1) extra bits exists at the price of O(lg n) decoding time; any improvement on this would lead to an improved solution on the Dictionary problem. All structures support the online version with constant update and query time.

What carries the argument

Succinct encoding of the path that compresses to the walk count κ(G,n) or empirical entropy while supporting constant-time position queries.

If this is right

Any length-n walk on G fits in lg κ(G,n) + O(lg n) bits with O(1) access to any vertex.
Matching the per-step empirical entropy costs only O(1) extra bits but requires O(lg n) decoding time.
Further reduction below O(1) extra bits for the entropy version would improve the classic Dictionary problem.
The same structures work in the online model with constant-time updates and queries.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The approach may extend to other sequentially correlated objects such as paths in geometric graphs or trajectories in sensor networks.
The explicit link to the Dictionary problem indicates that lower-bound techniques from that area could be adapted to prove tightness for the entropy version.
Practical implementations on graphs with small diameter or regular degree might achieve even smaller hidden constants than the O(lg n) term.
The online variant suggests possible use in streaming or dynamic settings where the walk is revealed incrementally.

Load-bearing premise

The graph G is known to the encoder in advance so its structure can be used during encoding to reach the stated space bound.

What would settle it

An explicit graph G and walk length n where every data structure that decodes any vertex in O(1) time must use more than lg κ(G,n) + c lg n bits for some constant c.

read the original abstract

Motivated by storage applications, we study the following data structure problem: An encoder wishes to store a collection of jointly-distributed files $\overline{X}:=(X_1,X_2,\ldots, X_n) \sim \mu$ which are \emph{correlated} ($H_\mu(\overline{X}) \ll \sum_i H_\mu(X_i)$), using as little (expected) memory as possible, such that each individual file $X_i$ can be recovered quickly with few (ideally constant) memory accesses. In the case of independent random files, a dramatic result by \Pat (FOCS'08) and subsequently by Dodis, \Pat and Thorup (STOC'10) shows that it is possible to store $\overline{X}$ using just a \emph{constant} number of extra bits beyond the information-theoretic minimum space, while at the same time decoding each $X_i$ in constant time. However, in the (realistic) case where the files are correlated, much weaker results are known, requiring at least $\Omega(n/poly\lg n)$ extra bits for constant decoding time, even for "simple" joint distributions $\mu$. We focus on the natural case of compressing\emph{Markov chains}, i.e., storing a length-$n$ random walk on any (possibly directed) graph $G$. Denoting by $\kappa(G,n)$ the number of length-$n$ walks on $G$, we show that there is a succinct data structure storing a random walk using $\lg_2 \kappa(G,n) + O(\lg n)$ bits of space, such that any vertex along the walk can be decoded in $O(1)$ time on a word-RAM. For the harder task of matching the \emph{point-wise} optimal space of the walk, i.e., the empirical entropy $\sum_{i=1}^{n-1} \lg (deg(v_i))$, we present a data structure with $O(1)$ extra bits at the price of $O(\lg n)$ decoding time, and show that any improvement on this would lead to an improved solution on the long-standing Dictionary problem. All of our data structures support the \emph{online} version of the problem with constant update and query time.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper gives O(log n) extra bits for constant-time access to random walks on arbitrary graphs plus a Dictionary-linked pointwise version, but the G-access model during encoding needs checking.

read the letter

The main result is a data structure that stores a length-n random walk on any graph using log2 of the number of walks plus O(log n) bits, with O(1) word-RAM time to recover any vertex. They also give a pointwise version using the empirical entropy plus O(1) bits at the cost of O(log n) decoding time, and show that shaving those extra bits would improve the Dictionary problem. Both support online updates and queries in constant time. This is new relative to the independent-files constant-extra-bits results and the general correlated lower bounds of Omega(n/polylog n). The Markov-chain focus lets them exploit the graph structure for these concrete tradeoffs, which prior general-distribution work did not achieve. The online feature is a practical plus. The soft spot is the model for G during encoding. The abstract states results for arbitrary G, yet achieving the stated space requires the encoder to access degrees and adjacency information efficiently; it is not clear from the high-level description whether G is preprocessed, given as an oracle, or assumed to have some succinct representation already. The Dictionary reduction also needs verification that no extra logarithmic factors appear in the word-RAM setting. The claims are internally consistent and the parameters are defined externally with no free fitting, but full proofs would be needed to confirm the reduction is tight. This paper is for people working on succinct data structures and compression with fast access. A reader focused on space-time tradeoffs for correlated sources would find the specific bounds and the Dictionary connection useful. It deserves a serious referee because it targets a stated gap with plausible new upper bounds and a conditional hardness link, even if the model details require clarification.

Referee Report

2 major / 2 minor

Summary. The paper studies succinct data structures for storing a length-n random walk on an arbitrary (possibly directed) graph G. It claims a structure using lg2 κ(G,n) + O(lg n) bits of space with O(1) word-RAM decoding time for any vertex along the walk, plus a pointwise version matching the empirical entropy sum lg(deg(vi)) with only O(1) extra bits at the cost of O(lg n) decoding time; any improvement on the latter would improve the Dictionary problem. The structures also support the online setting with constant-time updates and queries.

Significance. If the claims hold, the results substantially improve the state of the art for compressing correlated Markovian data with fast access, reducing extra space from the prior Ω(n/poly lg n) to O(lg n) or O(1) while retaining fast decoding; the explicit reduction linking the pointwise bound to the Dictionary problem is a notable strength, as is the online support. The results apply to arbitrary G, which increases their scope.

major comments (2)

[Abstract] Abstract: the headline claim of lg2 κ(G,n) + O(lg n) bits with O(1) decoding for arbitrary G is load-bearing on the encoder having efficient access to G (degrees and adjacency information) during encoding; the manuscript must explicitly define the input model for G (preprocessed, oracle, or explicit adjacency list) because the abstract states the result without this detail and the construction implicitly relies on it.
[Abstract] The reduction transferring Dictionary lower bounds to the pointwise empirical-entropy case (mentioned in the abstract) is central to the optimality claim; the paper must specify the exact parameter mapping and confirm that no extra logarithmic factors or word-RAM model mismatches are introduced, as any such gap would weaken the 'any improvement would lead to an improved Dictionary solution' statement.

minor comments (2)

[Abstract] Notation: lg2 is used once in the abstract while lg appears elsewhere; standardize the base-2 logarithm notation throughout.
The online version is asserted to have constant update and query time, but the high-level description would benefit from a brief pointer to the section containing the update mechanism.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the careful reading and constructive suggestions on clarifying the abstract. We address each major comment below.

read point-by-point responses

Referee: [Abstract] Abstract: the headline claim of lg2 κ(G,n) + O(lg n) bits with O(1) decoding for arbitrary G is load-bearing on the encoder having efficient access to G (degrees and adjacency information) during encoding; the manuscript must explicitly define the input model for G (preprocessed, oracle, or explicit adjacency list) because the abstract states the result without this detail and the construction implicitly relies on it.

Authors: We agree that the input model for G must be stated explicitly. Our construction assumes G is provided explicitly (as an adjacency list) or via an efficient oracle allowing constant-time degree and neighbor queries during encoding; this is the standard model for graph data structures and is implicit in the full construction but omitted from the abstract. We will revise the abstract and introduction to define this model clearly. revision: yes
Referee: [Abstract] The reduction transferring Dictionary lower bounds to the pointwise empirical-entropy case (mentioned in the abstract) is central to the optimality claim; the paper must specify the exact parameter mapping and confirm that no extra logarithmic factors or word-RAM model mismatches are introduced, as any such gap would weaken the 'any improvement would lead to an improved Dictionary solution' statement.

Authors: The reduction, including the exact parameter mapping (universe size n maps to walk length, dictionary space to the O(1) overhead) and confirmation of word-RAM compatibility with no extra logarithmic factors, is fully detailed in the body of the paper. The abstract statement is therefore accurate, but to improve clarity we will add a brief parenthetical reference to the mapping in the abstract. revision: yes

Circularity Check

0 steps flagged

No circularity: bounds stated directly in external information measures; Dictionary reduction is one-way implication

full rationale

The claimed space lg2 κ(G,n) + O(lg n) and pointwise empirical entropy + O(1) are defined externally to the construction; no equation reduces the output to a fitted parameter or self-citation. The statement that improvement would improve the Dictionary problem is a reduction from the data-structure question to a known open problem, not a self-referential loop. All cited prior work (Pat FOCS'08, Dodis-Pat-Thorup STOC'10) is by disjoint authors and concerns the independent case. The derivation is therefore self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 0 invented entities

The results rest on the standard word-RAM model and the definition of random walks; no free parameters, new entities, or ad-hoc axioms are introduced in the stated claims.

axioms (2)

domain assumption Word-RAM model with word size Theta(log n) bits
Used to define O(1) and O(log n) time bounds.
domain assumption Graph G is fixed and its local structure (degrees, adjacency) is accessible during encoding
Required to achieve the space bound relative to κ(G,n).

pith-pipeline@v0.9.0 · 5966 in / 1378 out tokens · 33471 ms · 2026-05-24T16:25:00.869200+00:00 · methodology

How to Store a Random Walk

Core claim

What carries the argument

If this is right

Where Pith is reading between the lines

Load-bearing premise

What would settle it

discussion (0)