Empirical Measures and Strong Laws of Large Numbers in Categorical Probability
Pith reviewed 2026-05-22 22:24 UTC · model grok-4.3
The pith
Morphisms satisfying permutation invariance and empirical adequacy enable abstract proofs of de Finetti, Glivenko-Cantelli, and strong law theorems in categorical probability.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Given an empirical sampling morphism and a few other properties, we prove representability as well as abstract versions of the de Finetti theorem, the Glivenko--Cantelli theorem and the strong law of large numbers. We provide several concrete constructions of empirical sampling morphisms as partially defined Markov kernels on standard Borel spaces. Instantiating our abstract results then recovers the standard Glivenko--Cantelli theorem and the strong law of large numbers for random variables with finite first moment. Our work thus provides a joint proof of these two theorems in conjunction with the de Finetti theorem from first principles.
What carries the argument
The empirical sampling morphism, a partially defined map of type X to the power of natural numbers to X in a quasi-Markov category that obeys permutation invariance and empirical adequacy.
If this is right
- Abstract de Finetti theorem holds under the stated conditions on the morphism.
- Abstract Glivenko-Cantelli theorem follows directly from the axioms.
- Abstract strong law of large numbers holds for the same morphisms.
- The three theorems receive a joint proof from the same set of first principles.
- Concrete constructions on Borel spaces match the classical statements for finite first moment.
Where Pith is reading between the lines
- The same axioms might organize proofs of other limit theorems once suitable morphisms are identified in the same categories.
- Quasi-Markov categories could let the framework handle empirical measures that are only partially defined on non-standard spaces.
- The representability results may connect empirical sampling to other notions of conditional independence already studied in categorical probability.
Load-bearing premise
A morphism of type X to the power of natural numbers to X must satisfy permutation invariance and empirical adequacy in order to count as taking an infinite sequence as input and producing a sample from its empirical measure.
What would settle it
A morphism on standard Borel spaces that meets both permutation invariance and empirical adequacy yet fails to recover the classical Glivenko-Cantelli or strong law statements when the abstract theorems are applied.
read the original abstract
The Glivenko--Cantelli theorem is a uniform version of the strong law of large numbers. It states that for every IID sequence of random variables, the empirical measure converges to the underlying distribution (in the sense of uniform convergence of the CDF). In this work, we provide tools to study such limits of empirical measures in categorical probability. We propose two axioms, namely permutation invariance and empirical adequacy, that a morphism of type $X^{\mathbb{N}} \to X$ should satisfy to be interpretable as taking an infinite sequence as input and producing a sample from its empirical measure as output. Since not all sequences have a well-defined empirical measure, such \emph{empirical sampling morphisms} live in quasi-Markov categories, which, unlike Markov categories, allow for partial morphisms. Given an empirical sampling morphism and a few other properties, we prove representability as well as abstract versions of the de Finetti theorem, the Glivenko--Cantelli theorem and the strong law of large numbers. We provide several concrete constructions of empirical sampling morphisms as partially defined Markov kernels on standard Borel spaces. Instantiating our abstract results then recovers the standard Glivenko--Cantelli theorem and the strong law of large numbers for random variables with finite first moment. Our work thus provides a joint proof of these two theorems in conjunction with the de Finetti theorem from first principles.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper develops a framework in quasi-Markov categories for empirical measures by introducing empirical sampling morphisms of type X^ℕ → X that satisfy permutation invariance and empirical adequacy. It proves representability of such morphisms and derives abstract versions of de Finetti's theorem, the Glivenko-Cantelli theorem, and the strong law of large numbers. Concrete partially defined Markov kernels on standard Borel spaces are constructed to instantiate the abstract results and recover the classical theorems, yielding a joint proof from first principles.
Significance. If the derivations hold, the work supplies a unified categorical treatment of these limit theorems that handles partiality naturally. The explicit constructions of empirical sampling morphisms as kernels and their direct instantiation to recover the standard Glivenko-Cantelli and SLLN statements (for finite first moment) constitute a clear strength, as does the joint derivation with de Finetti from the two axioms. This advances categorical probability by providing falsifiable abstract statements that specialize correctly to classical probability.
minor comments (3)
- The introduction would benefit from a short paragraph situating the two new axioms against prior categorical treatments of exchangeability or laws of large numbers (e.g., references to existing work on Markov categories and de Finetti).
- [§2] Notation for partial morphisms and the precise domain of the empirical sampling morphism (sequences without well-defined empirical measures) could be illustrated with a small diagram or example in §2 to improve readability.
- [abstract SLLN statement] In the statement of the abstract SLLN, clarify whether the finite-first-moment hypothesis is inherited from the concrete kernels or is an additional assumption on the category; this affects how directly the classical result is recovered.
Simulated Author's Rebuttal
We thank the referee for their careful reading and positive evaluation of the manuscript. We are pleased that the significance of the unified categorical framework, the explicit constructions, and the joint derivation of de Finetti, Glivenko-Cantelli, and the SLLN were recognized. The recommendation for minor revision is noted; since no specific major comments were raised, we will address any minor editorial or presentational points in the revised version.
Circularity Check
No significant circularity; derivation self-contained from axioms
full rationale
The paper defines empirical sampling morphisms via two new axioms (permutation invariance and empirical adequacy) in quasi-Markov categories, then derives representability plus abstract de Finetti, Glivenko-Cantelli and SLLN statements directly from those axioms plus a short list of additional properties. Concrete kernels on Borel spaces are exhibited separately to instantiate the abstract results. No step reduces a claimed prediction or theorem to a fitted parameter, self-referential definition, or unverified self-citation chain; the axioms are chosen precisely to capture the target behavior and the proofs proceed from them without circular reduction.
Axiom & Free-Parameter Ledger
axioms (2)
- ad hoc to paper permutation invariance
- ad hoc to paper empirical adequacy
invented entities (1)
-
empirical sampling morphism
no independent evidence
Forward citations
Cited by 1 Pith paper
-
Approaching the Continuous from the Discrete: an Infinite Tensor Product Construction
A universal construction adjoins infinite tensor products to FinStoch to produce a category of locally constant Markov kernels on finite sets union the Cantor space, enabling algebraic reasoning about continuous proba...
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.