Tight simulation of a distribution using conditional samples
Pith reviewed 2026-05-19 08:08 UTC · model grok-4.3
The pith
An algorithm simulates any distribution from prefix conditional samples in O(log^{2} N / ε^{2}) queries while achieving O(ε^{2}) closeness in Kullback-Leibler divergence.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
We present an algorithm for simulating a distribution using prefix conditional samples with sample complexity O(log^{2} N / ε^{2}) per query. Our simulating distribution is O(ε^{2})-close to the input distribution with respect to the Kullback-Leibler divergence. We show that our algorithm is tight because any algorithm estimating the mass of individual elements within (1 ± ε)-multiplicative error must make Ω(log^{2} N / ε^{2}) prefix conditional samples per element.
What carries the argument
Prefix conditional sampling oracle that reveals probability masses on prefixes, intervals, or subcubes to support recursive sampling and simulation.
Load-bearing premise
The input distribution admits efficient access via prefix conditional samples or prefix-compatible models such as intervals or subcubes without additional overhead.
What would settle it
An algorithm that simulates the distribution to the stated KL closeness using o(log^{2} N / ε^{2}) prefix conditional samples per query on some family of distributions over [N], or an explicit distribution where individual masses can be estimated to (1 ± ε) error with fewer than Ω(log^{2} N / ε^{2}) samples per element.
read the original abstract
We present an algorithm for simulating a distribution using prefix conditional samples (Adar, Fischer and Levi, 2024), as well as ``prefix-compatible'' conditional models such as the interval model (Cannone, Ron and Servedio, 2015) and the subcube model (CRS15, Bhattacharyya and Chakraborty, 2018). The sample complexity is $O(\log^2 N / \varepsilon^2)$ prefix conditional samples per query, which improves on the previously known $\tilde{O}(\log^3 N / \varepsilon^2)$ (Kumar, Meel and Pote, 2025). Moreover, our simulating distribution is $O(\varepsilon^2)$-close to the input distribution with respect to the Kullback-Leibler divergence, which is stricter than the usual guarantee of being $O(\varepsilon)$-close with respect to the total-variation distance. We show that our algorithm is tight with respect to the highly-related task of estimation: every algorithm that is able to estimate the mass of individual elements within $(1 \pm \varepsilon)$-multiplicative error must make $\Omega(\log^2 N / \varepsilon^2)$ prefix conditional samples per element.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript presents an algorithm for simulating an unknown distribution over [N] using prefix conditional samples (and prefix-compatible models such as interval and subcube oracles). The algorithm uses O(log² N / ε²) samples per query and produces an output distribution that is O(ε²)-close to the input in Kullback-Leibler divergence, improving the prior Õ(log³ N / ε²) bound. The paper also proves an Ω(log² N / ε²) lower bound on the number of prefix conditional samples needed to estimate individual element masses to multiplicative (1 ± ε) accuracy.
Significance. If the proofs hold, the result is a meaningful improvement in the conditional-sampling literature: a quadratic rather than cubic logarithmic dependence together with a stronger KL guarantee than the usual total-variation bound. The explicit connection to the related multiplicative estimation task and the matching lower bound supply evidence of tightness, which is a positive feature of the work.
major comments (1)
- [Abstract and §4] Abstract and §4 (lower-bound section): the manuscript establishes the Ω(log² N / ε²) bound only for the estimation task and states that the simulation algorithm is 'tight with respect to' that task. Because the central claim concerns simulation, the paper should either supply a direct simulation lower bound or explicitly argue why the estimation lower bound implies optimality for simulation (e.g., via a reduction that preserves the sample complexity up to constants).
minor comments (2)
- [Introduction] Introduction: the prefix conditional sample model and the precise definition of 'prefix-compatible' models should be restated formally (with notation) before the algorithm is described, rather than relying solely on the citations to Adar et al. (2024) and Cannone et al. (2015).
- [Theorem 1] Theorem statements: verify that the O(ε²) KL bound is stated with explicit dependence on N and ε (or confirm it is independent of N) so that readers can compare it directly with the total-variation guarantees in prior work.
Simulated Author's Rebuttal
We thank the referee for the careful reading, positive assessment of the significance, and recommendation for minor revision. We address the major comment below.
read point-by-point responses
-
Referee: [Abstract and §4] Abstract and §4 (lower-bound section): the manuscript establishes the Ω(log² N / ε²) bound only for the estimation task and states that the simulation algorithm is 'tight with respect to' that task. Because the central claim concerns simulation, the paper should either supply a direct simulation lower bound or explicitly argue why the estimation lower bound implies optimality for simulation (e.g., via a reduction that preserves the sample complexity up to constants).
Authors: We agree that the tightness claim for simulation should be supported by an explicit connection rather than left implicit. In the revised manuscript we will add a short subsection (or paragraph in §4) providing a reduction: any algorithm that simulates the unknown distribution to O(ε²) KL divergence using prefix conditional samples can be turned into an algorithm that estimates the mass of any fixed element to multiplicative (1 ± ε) accuracy. The reduction works by drawing O(1/ε²) independent samples from the simulator and using their empirical frequencies; the KL guarantee ensures that the estimated mass is multiplicatively close to the true mass (up to constants that can be absorbed into the O-notation). Consequently the Ω(log² N / ε²) lower bound already proved for estimation immediately yields the same lower bound for simulation, establishing that our O(log² N / ε²) simulation algorithm is optimal for the central task as well. revision: yes
Circularity Check
No significant circularity detected
full rationale
The paper presents an independent algorithmic construction achieving O(log² N / ε²) prefix conditional samples for simulation with an O(ε²) KL-divergence guarantee, improving on a prior Õ(log³ N / ε²) bound. The Ω(log² N / ε²) lower bound applies only to the separate multiplicative mass estimation task rather than simulation itself. The access model is referenced via citation but does not reduce the new algorithm or its analysis to a self-definition, fitted input renamed as prediction, or load-bearing self-citation chain; the derivation remains self-contained against external benchmarks.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption The domain admits a prefix structure allowing conditional samples on initial segments.
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
We present an algorithm for simulating a distribution using prefix conditional samples ... O(log² N / ε²) ... O(ε²)-close ... KL divergence
-
IndisputableMonolith/Foundation/ArithmeticFromLogic.leanembed_add unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
repeated chain rule for DKL ... DKL(μ ∥ τ) = Σ ... marginals
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.