Large deviations for non-irreducible Markov chains on Euclidean spaces
Pith reviewed 2026-05-09 20:27 UTC · model grok-4.3
The pith
Markov chains on R^d satisfy the weak large deviations principle for their empirical measures without assuming irreducibility.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Under mild assumptions on the transition kernel, the empirical measures of the Markov chain satisfy the weak large deviations principle in the space of probability measures equipped with the weak topology. The argument relies on subadditivity of the log-moment generating functionals of these empirical measures and remains valid without any irreducibility hypothesis. In the non-irreducible case the associated rate function is shown by counter-examples to lose convexity in general.
What carries the argument
Subadditivity of the log-moment generating functionals of the empirical measures.
If this is right
- The weak large deviations principle holds for arbitrary initial distributions.
- Irreducibility of the chain is not required.
- The rate function need not be convex when the chain is non-irreducible.
- The proof is entirely self-contained and uses only subadditivity.
Where Pith is reading between the lines
- The result permits study of empirical behavior in chains that possess multiple recurrent classes or absorbing components.
- Convexity of the rate function appears to be tied to irreducibility rather than to the large-deviation property itself.
- The subadditive approach may extend to other functionals of the chain path beyond the empirical measure.
Load-bearing premise
The transition probabilities satisfy conditions that make the log-moment generating functionals of the empirical measures subadditive.
What would settle it
A concrete Markov chain on R^d meeting the mild assumptions whose empirical-measure probabilities fail to decay according to any lower-semicontinuous rate function in the weak topology.
Figures
read the original abstract
We establish the weak large deviations principle for empirical measures of Markov chains on $\mathbb R^d$ under mild assumptions. In particular, no irreducibility is assumed and the initial measure may be arbitrary. The proof is entirely self-contained and relies on subadditivity. In the absence of irreducibility, examples show that the rate function is not convex in general.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper claims to prove a weak large deviations principle for the empirical measures of Markov chains taking values in R^d. The result requires only mild assumptions, does not assume irreducibility of the chain, and allows an arbitrary initial measure. The argument is presented as entirely self-contained and based on subadditivity of the sequence of log-moment generating functionals Lambda_n(f) = log E[exp(n <f, mu_n>)]. In the non-irreducible case the authors supply examples showing that the resulting rate function is not convex in general.
Significance. If the central claim holds under the stated conditions, the result would meaningfully extend large-deviation theory beyond the classical irreducible setting, where convexity of the rate function is usually automatic. The self-contained subadditive proof and the explicit non-convexity examples constitute genuine strengths that could be useful for applications involving transient or reducible processes on Euclidean space.
major comments (2)
- [Abstract / main theorem statement] Abstract and the statement of the main theorem: the 'mild assumptions' under which the subadditive argument is claimed to work are never listed explicitly. Without an explicit hypothesis ensuring that the map x |-> log int exp(<f, mu_m>) P^n(x, dy) is upper semi-continuous (or at least that its supremum is attained at points reachable from the initial measure), the passage from the one-step Markov inequality to the required subadditive bound Lambda_{m+n}(f) <= Lambda_m(f) + Lambda_n(f) + o(n) does not hold for arbitrary kernels on R^d. This directly affects the limsup half of the weak LDP.
- [Proof of the weak LDP] The subadditivity step (presumably in the proof of the main theorem): the inequality Lambda_{m+n}(f) <= Lambda_m(f) + sup_x log int exp(m <f, mu_m>) P^n(x, dy) is asserted, but the subsequent limit argument requires justification that the supremum can be controlled uniformly when the initial measure is arbitrary and the chain is not irreducible. No counter-example to subadditivity is ruled out, and the Feller-type regularity needed to close the argument is not stated.
minor comments (2)
- [Abstract] The abstract asserts that the proof is 'entirely self-contained'; if any external results on subadditive sequences or large deviations are invoked, they should be cited explicitly in the introduction.
- [Examples section] The non-convexity examples are described only qualitatively; a brief statement of the specific kernels used and the explicit form of the non-convex rate function would strengthen the claim.
Simulated Author's Rebuttal
We thank the referee for the careful reading and insightful comments on our manuscript. We address the major comments point by point below. The core result on the weak large deviations principle holds under the conditions stated in the paper, and we will revise to improve explicitness of assumptions and proof details without altering the main claims.
read point-by-point responses
-
Referee: [Abstract / main theorem statement] Abstract and the statement of the main theorem: the 'mild assumptions' under which the subadditive argument is claimed to work are never listed explicitly. Without an explicit hypothesis ensuring that the map x |-> log int exp(<f, mu_m>) P^n(x, dy) is upper semi-continuous (or at least that its supremum is attained at points reachable from the initial measure), the passage from the one-step Markov inequality to the required subadditive bound Lambda_{m+n}(f) <= Lambda_m(f) + Lambda_n(f) + o(n) does not hold for arbitrary kernels on R^d. This directly affects the limsup half of the weak LDP.
Authors: The mild assumptions are stated in Section 2 of the manuscript (including a Feller-type continuity condition on the transition kernel P that guarantees the required upper semi-continuity of the map x |-> log int exp(<f, mu_m>) P^n(x, dy) for bounded continuous f). This condition ensures the subadditive bound holds even for non-irreducible chains and arbitrary initial measures. We agree the abstract and theorem statement would benefit from an explicit enumerated list of these assumptions and will add one in the revision. revision: yes
-
Referee: [Proof of the weak LDP] The subadditivity step (presumably in the proof of the main theorem): the inequality Lambda_{m+n}(f) <= Lambda_m(f) + sup_x log int exp(m <f, mu_m>) P^n(x, dy) is asserted, but the subsequent limit argument requires justification that the supremum can be controlled uniformly when the initial measure is arbitrary and the chain is not irreducible. No counter-example to subadditivity is ruled out, and the Feller-type regularity needed to close the argument is not stated.
Authors: The subadditivity inequality follows directly from the Markov property by conditioning on the position after m steps and taking the essential supremum over the support of the m-step measure; the Feller-type regularity (already in Section 2) ensures the supremum is attained and can be passed to the limit uniformly for the fixed initial measure. Non-irreducibility is handled precisely because we only claim a weak LDP (not a full LDP with convex rate function). We will insert an additional paragraph in the proof clarifying the uniform control and explicitly reference the Feller condition. revision: partial
Circularity Check
Derivation self-contained via subadditivity; no circular reductions
full rationale
The paper explicitly states that its proof of the weak large deviations principle is entirely self-contained and relies on subadditivity applied to the log-moment generating functionals of the empirical measures. No self-citations, fitted parameters renamed as predictions, or ansatzes imported from prior work are invoked for the central result. The derivation proceeds directly from the Markov property and subadditivity under the paper's mild assumptions, without any step that reduces by construction to its own inputs or to a self-referential definition. This is the standard case of an independent, self-contained argument.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Markov chains on R^d satisfy the mild assumptions that permit application of subadditivity to the relevant functionals of empirical measures.
Reference graph
Works this paper leans on
-
[1]
M. D. Donsker and S. R. S. Varadhan. Asymptotic evaluation of certain Markov process expectations for large time. I.Comm. Pure Appl. Math., 28:1–47, 1975
1975
-
[2]
M. D. Donsker and S. R. S. Varadhan. Asymptotic evaluation of certain Markov process expectations for large time. II.Comm. Pure Appl. Math., 28:279–301, 1975
1975
-
[3]
M. D. Donsker and S. R. S. Varadhan. Asymptotic evaluation of certain Markov process expectations for large time. III.Comm. Pure Appl. Math., 29(4):389–461, 1976
1976
-
[4]
Springer-Verlag, Berlin, 2010
Amir Dembo and Ofer Zeitouni.Large deviations techniques and applications, volume 38 ofStochastic Modelling and Applied Probability. Springer-Verlag, Berlin, 2010. Corrected reprint of the second (1998) edition
2010
-
[5]
de Acosta.Large deviations for Markov chains, volume 229 ofCambridge Tracts in Mathematics
Alejandro D. de Acosta.Large deviations for Markov chains, volume 229 ofCambridge Tracts in Mathematics. Cambridge University Press, Cambridge, 2022
2022
-
[6]
D. W. Stroock.An introduction to the theory of large deviations. Universitext. Springer-Verlag, New York, 1984. 31
1984
-
[7]
Richard S. Ellis. Large deviations for the empirical measure of a Markov chain with an application to the multivariate empirical measure.Ann. Probab., 16(4):1496–1508, 1988
1988
-
[8]
J. R. Baxter, N. C. Jain, and S. R. S. Varadhan. Some familiar examples for which the large deviation principle does not hold.Comm. Pure Appl. Math., 44(8-9):911–923, 1991
1991
-
[9]
Ney and E
P. Ney and E. Nummelin. Markov additive processes II. Large deviations.Ann. Probab., 15(2):593–609, 1987
1987
-
[10]
de Acosta
A. de Acosta. Large deviations for vector-valued functionals of a Markov chain: lower bounds. Ann. Probab., 16(3):925–960, 1988
1988
-
[11]
I. H. Dinwoodie. Identifying a large deviation rate function.Ann. Probab., 21(1):216–231, 1993
1993
-
[12]
Large deviations for possibly reducible Markov chains on discrete state spaces,
Léo Daures. Large deviations for possibly reducible Markov chains on discrete state spaces,
-
[13]
arXiv: 2507.11166 [math.PR]
work page internal anchor Pith review arXiv
-
[14]
Large deviations for empirical measures of not necessarily irreducible countable Markov chains with arbitrary initial measures.Acta Math
Yi Wen Jiang and Li Ming Wu. Large deviations for empirical measures of not necessarily irreducible countable Markov chains with arbitrary initial measures.Acta Math. Sin. (Engl. Ser.), 21(6):1377–1390, 2005
2005
-
[15]
American Mathematical Society, Providence, RI, 2015
Firas Rassoul-Agha and Timo Seppäläinen.A course on large deviations with an introduction to Gibbs measures, volume 162 ofGraduate Studies in Mathematics. American Mathematical Society, Providence, RI, 2015
2015
-
[16]
Azencott
R. Azencott. Grandes déviations et applications. InEighth Saint Flour Probability Summer School—1978 (Saint Flour, 1978), volume 774 ofLecture Notes in Math., pages 1–176. Springer, Berlin, 1980
1978
-
[17]
R. R. Bahadur and S. L. Zabell. Large deviations of the sample mean in general vector spaces.Ann. Probab., 7(4):587–621, 1979
1979
-
[18]
D. Ruelle. Correlation functionals.J. Mathematical Phys., 6:201–220, 1965
1965
-
[19]
A variational formulation of equilibrium statistical mechanics and the gibbs phase rule.Communications in Mathematical Physics, 5:324–329, 1967
David Ruelle. A variational formulation of equilibrium statistical mechanics and the gibbs phase rule.Communications in Mathematical Physics, 5:324–329, 1967
1967
-
[20]
Oscar E. Lanford. Entropy and equilibrium states in classical statistical mechanics. pages 1–113, 1973
1973
-
[21]
Lewis and Charles-Edouard Pfister
J.T. Lewis and Charles-Edouard Pfister. Thermodynamic probability theory: Some aspects of large deviations.Russian Mathematical Surveys, 50, 1995
1995
-
[22]
V. I. Bogachev.Measure theory. Vol. I, II. Springer-Verlag, Berlin, 2007
2007
-
[23]
Springer, Cham, third edition, 2021
Olav Kallenberg.Foundations of modern probability, volume 99 ofProbability Theory and Stochastic Modelling. Springer, Cham, third edition, 2021. 32
2021
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.