Large deviations for non-irreducible Markov chains on Euclidean spaces

L\'eo Daures

arxiv: 2604.21663 · v1 · submitted 2026-04-23 · 🧮 math.PR

Large deviations for non-irreducible Markov chains on Euclidean spaces

L\'eo Daures This is my paper

Pith reviewed 2026-05-09 20:27 UTC · model grok-4.3

classification 🧮 math.PR

keywords large deviationsMarkov chainsempirical measuresweak large deviations principlenon-irreduciblesubadditivityrate functionsEuclidean space

0 comments

The pith

Markov chains on R^d satisfy the weak large deviations principle for their empirical measures without assuming irreducibility.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

This paper establishes the weak large deviations principle for the sequence of empirical measures generated by a Markov chain on Euclidean space. The result requires only mild assumptions on the transition probabilities and holds for completely arbitrary initial distributions. The proof proceeds via a direct subadditive argument on the relevant cumulant generating functionals and requires no external theorems. When the chain fails to be irreducible, explicit examples show that the resulting rate function is typically not convex.

Core claim

Under mild assumptions on the transition kernel, the empirical measures of the Markov chain satisfy the weak large deviations principle in the space of probability measures equipped with the weak topology. The argument relies on subadditivity of the log-moment generating functionals of these empirical measures and remains valid without any irreducibility hypothesis. In the non-irreducible case the associated rate function is shown by counter-examples to lose convexity in general.

What carries the argument

Subadditivity of the log-moment generating functionals of the empirical measures.

If this is right

The weak large deviations principle holds for arbitrary initial distributions.
Irreducibility of the chain is not required.
The rate function need not be convex when the chain is non-irreducible.
The proof is entirely self-contained and uses only subadditivity.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The result permits study of empirical behavior in chains that possess multiple recurrent classes or absorbing components.
Convexity of the rate function appears to be tied to irreducibility rather than to the large-deviation property itself.
The subadditive approach may extend to other functionals of the chain path beyond the empirical measure.

Load-bearing premise

The transition probabilities satisfy conditions that make the log-moment generating functionals of the empirical measures subadditive.

What would settle it

A concrete Markov chain on R^d meeting the mild assumptions whose empirical-measure probabilities fail to decay according to any lower-semicontinuous rate function in the weak topology.

Figures

Figures reproduced from arXiv: 2604.21663 by L\'eo Daures.

**Figure 2.** Figure 2: Example of use of the decoupling map with respect to the partition [PITH_FULL_IMAGE:figures/full_fig_p020_2.png] view at source ↗

**Figure 3.** Figure 3: The Markov chain of Example C.4 for an example function f. The three parallel lines are the lines of equations y = x − 1, y = x and y = x + 1. The thick horizontal segment at y = f(Xn) (resp. y = f(Xn+1) and y = f(Xn+2)) denotes the support of Xn+1 (resp. Xn+2 and Xn+3). There are two classes C1 and C2, and 2 ⇝ 1. To the left of C1, the Markov chain can only move to the right. Between C1 and C2 and to the … view at source ↗

read the original abstract

We establish the weak large deviations principle for empirical measures of Markov chains on $\mathbb R^d$ under mild assumptions. In particular, no irreducibility is assumed and the initial measure may be arbitrary. The proof is entirely self-contained and relies on subadditivity. In the absence of irreducibility, examples show that the rate function is not convex in general.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper gives a weak LDP for empirical measures of Markov chains on R^d without irreducibility or restrictions on the initial measure, proved via subadditivity with examples of non-convex rate functions.

read the letter

This paper establishes a weak large deviation principle for the empirical measures of Markov chains on R^d that drops irreducibility and works for any initial measure. The argument rests on subadditivity of the log-moment generating functionals and is presented as self-contained. The non-convexity examples when the chain has multiple ergodic components are a clear addition, since they show the rate function behaves differently from the irreducible case. That part is useful for anyone who needs to see the boundary of the standard theory. The subadditivity step is the main technical move, and the paper supplies conditions under which it holds. The stress-test concern about needing Feller-type continuity or upper semi-continuity on the kernel is reasonable in general, but the manuscript appears to include enough regularity in its mild assumptions to make the limsup bound work for the arbitrary initial measure. No circular reductions or fitted parameters show up. The main soft spot is that the abstract only labels the assumptions mild without listing them, so a reader must reach the body to confirm the exact continuity or measurability requirements. The examples themselves are concrete and do not rely on the central theorem. This is for people already working in large deviations for Markov processes on continuous spaces who want to handle systems with disconnected components. A reader familiar with Donsker-Varadhan will follow the extension without much extra background. It deserves peer review because the claim is stated cleanly, the method stays elementary, and the non-convexity examples add verifiable content even if the assumptions end up slightly stronger than advertised.

Referee Report

2 major / 2 minor

Summary. The paper claims to prove a weak large deviations principle for the empirical measures of Markov chains taking values in R^d. The result requires only mild assumptions, does not assume irreducibility of the chain, and allows an arbitrary initial measure. The argument is presented as entirely self-contained and based on subadditivity of the sequence of log-moment generating functionals Lambda_n(f) = log E[exp(n <f, mu_n>)]. In the non-irreducible case the authors supply examples showing that the resulting rate function is not convex in general.

Significance. If the central claim holds under the stated conditions, the result would meaningfully extend large-deviation theory beyond the classical irreducible setting, where convexity of the rate function is usually automatic. The self-contained subadditive proof and the explicit non-convexity examples constitute genuine strengths that could be useful for applications involving transient or reducible processes on Euclidean space.

major comments (2)

[Abstract / main theorem statement] Abstract and the statement of the main theorem: the 'mild assumptions' under which the subadditive argument is claimed to work are never listed explicitly. Without an explicit hypothesis ensuring that the map x |-> log int exp(<f, mu_m>) P^n(x, dy) is upper semi-continuous (or at least that its supremum is attained at points reachable from the initial measure), the passage from the one-step Markov inequality to the required subadditive bound Lambda_{m+n}(f) <= Lambda_m(f) + Lambda_n(f) + o(n) does not hold for arbitrary kernels on R^d. This directly affects the limsup half of the weak LDP.
[Proof of the weak LDP] The subadditivity step (presumably in the proof of the main theorem): the inequality Lambda_{m+n}(f) <= Lambda_m(f) + sup_x log int exp(m <f, mu_m>) P^n(x, dy) is asserted, but the subsequent limit argument requires justification that the supremum can be controlled uniformly when the initial measure is arbitrary and the chain is not irreducible. No counter-example to subadditivity is ruled out, and the Feller-type regularity needed to close the argument is not stated.

minor comments (2)

[Abstract] The abstract asserts that the proof is 'entirely self-contained'; if any external results on subadditive sequences or large deviations are invoked, they should be cited explicitly in the introduction.
[Examples section] The non-convexity examples are described only qualitatively; a brief statement of the specific kernels used and the explicit form of the non-convex rate function would strengthen the claim.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the careful reading and insightful comments on our manuscript. We address the major comments point by point below. The core result on the weak large deviations principle holds under the conditions stated in the paper, and we will revise to improve explicitness of assumptions and proof details without altering the main claims.

read point-by-point responses

Referee: [Abstract / main theorem statement] Abstract and the statement of the main theorem: the 'mild assumptions' under which the subadditive argument is claimed to work are never listed explicitly. Without an explicit hypothesis ensuring that the map x |-> log int exp(<f, mu_m>) P^n(x, dy) is upper semi-continuous (or at least that its supremum is attained at points reachable from the initial measure), the passage from the one-step Markov inequality to the required subadditive bound Lambda_{m+n}(f) <= Lambda_m(f) + Lambda_n(f) + o(n) does not hold for arbitrary kernels on R^d. This directly affects the limsup half of the weak LDP.

Authors: The mild assumptions are stated in Section 2 of the manuscript (including a Feller-type continuity condition on the transition kernel P that guarantees the required upper semi-continuity of the map x |-> log int exp(<f, mu_m>) P^n(x, dy) for bounded continuous f). This condition ensures the subadditive bound holds even for non-irreducible chains and arbitrary initial measures. We agree the abstract and theorem statement would benefit from an explicit enumerated list of these assumptions and will add one in the revision. revision: yes
Referee: [Proof of the weak LDP] The subadditivity step (presumably in the proof of the main theorem): the inequality Lambda_{m+n}(f) <= Lambda_m(f) + sup_x log int exp(m <f, mu_m>) P^n(x, dy) is asserted, but the subsequent limit argument requires justification that the supremum can be controlled uniformly when the initial measure is arbitrary and the chain is not irreducible. No counter-example to subadditivity is ruled out, and the Feller-type regularity needed to close the argument is not stated.

Authors: The subadditivity inequality follows directly from the Markov property by conditioning on the position after m steps and taking the essential supremum over the support of the m-step measure; the Feller-type regularity (already in Section 2) ensures the supremum is attained and can be passed to the limit uniformly for the fixed initial measure. Non-irreducibility is handled precisely because we only claim a weak LDP (not a full LDP with convex rate function). We will insert an additional paragraph in the proof clarifying the uniform control and explicitly reference the Feller condition. revision: partial

Circularity Check

0 steps flagged

Derivation self-contained via subadditivity; no circular reductions

full rationale

The paper explicitly states that its proof of the weak large deviations principle is entirely self-contained and relies on subadditivity applied to the log-moment generating functionals of the empirical measures. No self-citations, fitted parameters renamed as predictions, or ansatzes imported from prior work are invoked for the central result. The derivation proceeds directly from the Markov property and subadditivity under the paper's mild assumptions, without any step that reduces by construction to its own inputs or to a self-referential definition. This is the standard case of an independent, self-contained argument.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The claim rests on standard properties of Markov processes and subadditive sequences in probability theory; no free parameters or invented entities are introduced.

axioms (1)

domain assumption Markov chains on R^d satisfy the mild assumptions that permit application of subadditivity to the relevant functionals of empirical measures.
The abstract invokes these assumptions to enable the self-contained proof.

pith-pipeline@v0.9.0 · 5338 in / 1184 out tokens · 35646 ms · 2026-05-09T20:27:01.494340+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

23 extracted references · 1 canonical work pages · 1 internal anchor

[1]

M. D. Donsker and S. R. S. Varadhan. Asymptotic evaluation of certain Markov process expectations for large time. I.Comm. Pure Appl. Math., 28:1–47, 1975

1975
[2]

M. D. Donsker and S. R. S. Varadhan. Asymptotic evaluation of certain Markov process expectations for large time. II.Comm. Pure Appl. Math., 28:279–301, 1975

1975
[3]

M. D. Donsker and S. R. S. Varadhan. Asymptotic evaluation of certain Markov process expectations for large time. III.Comm. Pure Appl. Math., 29(4):389–461, 1976

1976
[4]

Springer-Verlag, Berlin, 2010

Amir Dembo and Ofer Zeitouni.Large deviations techniques and applications, volume 38 ofStochastic Modelling and Applied Probability. Springer-Verlag, Berlin, 2010. Corrected reprint of the second (1998) edition

2010
[5]

de Acosta.Large deviations for Markov chains, volume 229 ofCambridge Tracts in Mathematics

Alejandro D. de Acosta.Large deviations for Markov chains, volume 229 ofCambridge Tracts in Mathematics. Cambridge University Press, Cambridge, 2022

2022
[6]

D. W. Stroock.An introduction to the theory of large deviations. Universitext. Springer-Verlag, New York, 1984. 31

1984
[7]

Richard S. Ellis. Large deviations for the empirical measure of a Markov chain with an application to the multivariate empirical measure.Ann. Probab., 16(4):1496–1508, 1988

1988
[8]

J. R. Baxter, N. C. Jain, and S. R. S. Varadhan. Some familiar examples for which the large deviation principle does not hold.Comm. Pure Appl. Math., 44(8-9):911–923, 1991

1991
[9]

Ney and E

P. Ney and E. Nummelin. Markov additive processes II. Large deviations.Ann. Probab., 15(2):593–609, 1987

1987
[10]

de Acosta

A. de Acosta. Large deviations for vector-valued functionals of a Markov chain: lower bounds. Ann. Probab., 16(3):925–960, 1988

1988
[11]

I. H. Dinwoodie. Identifying a large deviation rate function.Ann. Probab., 21(1):216–231, 1993

1993
[12]

Large deviations for possibly reducible Markov chains on discrete state spaces,

Léo Daures. Large deviations for possibly reducible Markov chains on discrete state spaces,
[13]

arXiv: 2507.11166 [math.PR]

work page internal anchor Pith review arXiv
[14]

Large deviations for empirical measures of not necessarily irreducible countable Markov chains with arbitrary initial measures.Acta Math

Yi Wen Jiang and Li Ming Wu. Large deviations for empirical measures of not necessarily irreducible countable Markov chains with arbitrary initial measures.Acta Math. Sin. (Engl. Ser.), 21(6):1377–1390, 2005

2005
[15]

American Mathematical Society, Providence, RI, 2015

Firas Rassoul-Agha and Timo Seppäläinen.A course on large deviations with an introduction to Gibbs measures, volume 162 ofGraduate Studies in Mathematics. American Mathematical Society, Providence, RI, 2015

2015
[16]

Azencott

R. Azencott. Grandes déviations et applications. InEighth Saint Flour Probability Summer School—1978 (Saint Flour, 1978), volume 774 ofLecture Notes in Math., pages 1–176. Springer, Berlin, 1980

1978
[17]

R. R. Bahadur and S. L. Zabell. Large deviations of the sample mean in general vector spaces.Ann. Probab., 7(4):587–621, 1979

1979
[18]

D. Ruelle. Correlation functionals.J. Mathematical Phys., 6:201–220, 1965

1965
[19]

A variational formulation of equilibrium statistical mechanics and the gibbs phase rule.Communications in Mathematical Physics, 5:324–329, 1967

David Ruelle. A variational formulation of equilibrium statistical mechanics and the gibbs phase rule.Communications in Mathematical Physics, 5:324–329, 1967

1967
[20]

Oscar E. Lanford. Entropy and equilibrium states in classical statistical mechanics. pages 1–113, 1973

1973
[21]

Lewis and Charles-Edouard Pfister

J.T. Lewis and Charles-Edouard Pfister. Thermodynamic probability theory: Some aspects of large deviations.Russian Mathematical Surveys, 50, 1995

1995
[22]

V. I. Bogachev.Measure theory. Vol. I, II. Springer-Verlag, Berlin, 2007

2007
[23]

Springer, Cham, third edition, 2021

Olav Kallenberg.Foundations of modern probability, volume 99 ofProbability Theory and Stochastic Modelling. Springer, Cham, third edition, 2021. 32

2021

[1] [1]

M. D. Donsker and S. R. S. Varadhan. Asymptotic evaluation of certain Markov process expectations for large time. I.Comm. Pure Appl. Math., 28:1–47, 1975

1975

[2] [2]

M. D. Donsker and S. R. S. Varadhan. Asymptotic evaluation of certain Markov process expectations for large time. II.Comm. Pure Appl. Math., 28:279–301, 1975

1975

[3] [3]

M. D. Donsker and S. R. S. Varadhan. Asymptotic evaluation of certain Markov process expectations for large time. III.Comm. Pure Appl. Math., 29(4):389–461, 1976

1976

[4] [4]

Springer-Verlag, Berlin, 2010

Amir Dembo and Ofer Zeitouni.Large deviations techniques and applications, volume 38 ofStochastic Modelling and Applied Probability. Springer-Verlag, Berlin, 2010. Corrected reprint of the second (1998) edition

2010

[5] [5]

de Acosta.Large deviations for Markov chains, volume 229 ofCambridge Tracts in Mathematics

Alejandro D. de Acosta.Large deviations for Markov chains, volume 229 ofCambridge Tracts in Mathematics. Cambridge University Press, Cambridge, 2022

2022

[6] [6]

D. W. Stroock.An introduction to the theory of large deviations. Universitext. Springer-Verlag, New York, 1984. 31

1984

[7] [7]

Richard S. Ellis. Large deviations for the empirical measure of a Markov chain with an application to the multivariate empirical measure.Ann. Probab., 16(4):1496–1508, 1988

1988

[8] [8]

J. R. Baxter, N. C. Jain, and S. R. S. Varadhan. Some familiar examples for which the large deviation principle does not hold.Comm. Pure Appl. Math., 44(8-9):911–923, 1991

1991

[9] [9]

Ney and E

P. Ney and E. Nummelin. Markov additive processes II. Large deviations.Ann. Probab., 15(2):593–609, 1987

1987

[10] [10]

de Acosta

A. de Acosta. Large deviations for vector-valued functionals of a Markov chain: lower bounds. Ann. Probab., 16(3):925–960, 1988

1988

[11] [11]

I. H. Dinwoodie. Identifying a large deviation rate function.Ann. Probab., 21(1):216–231, 1993

1993

[12] [12]

Large deviations for possibly reducible Markov chains on discrete state spaces,

Léo Daures. Large deviations for possibly reducible Markov chains on discrete state spaces,

[13] [13]

arXiv: 2507.11166 [math.PR]

work page internal anchor Pith review arXiv

[14] [14]

Large deviations for empirical measures of not necessarily irreducible countable Markov chains with arbitrary initial measures.Acta Math

Yi Wen Jiang and Li Ming Wu. Large deviations for empirical measures of not necessarily irreducible countable Markov chains with arbitrary initial measures.Acta Math. Sin. (Engl. Ser.), 21(6):1377–1390, 2005

2005

[15] [15]

American Mathematical Society, Providence, RI, 2015

Firas Rassoul-Agha and Timo Seppäläinen.A course on large deviations with an introduction to Gibbs measures, volume 162 ofGraduate Studies in Mathematics. American Mathematical Society, Providence, RI, 2015

2015

[16] [16]

Azencott

R. Azencott. Grandes déviations et applications. InEighth Saint Flour Probability Summer School—1978 (Saint Flour, 1978), volume 774 ofLecture Notes in Math., pages 1–176. Springer, Berlin, 1980

1978

[17] [17]

R. R. Bahadur and S. L. Zabell. Large deviations of the sample mean in general vector spaces.Ann. Probab., 7(4):587–621, 1979

1979

[18] [18]

D. Ruelle. Correlation functionals.J. Mathematical Phys., 6:201–220, 1965

1965

[19] [19]

A variational formulation of equilibrium statistical mechanics and the gibbs phase rule.Communications in Mathematical Physics, 5:324–329, 1967

David Ruelle. A variational formulation of equilibrium statistical mechanics and the gibbs phase rule.Communications in Mathematical Physics, 5:324–329, 1967

1967

[20] [20]

Oscar E. Lanford. Entropy and equilibrium states in classical statistical mechanics. pages 1–113, 1973

1973

[21] [21]

Lewis and Charles-Edouard Pfister

J.T. Lewis and Charles-Edouard Pfister. Thermodynamic probability theory: Some aspects of large deviations.Russian Mathematical Surveys, 50, 1995

1995

[22] [22]

V. I. Bogachev.Measure theory. Vol. I, II. Springer-Verlag, Berlin, 2007

2007

[23] [23]

Springer, Cham, third edition, 2021

Olav Kallenberg.Foundations of modern probability, volume 99 ofProbability Theory and Stochastic Modelling. Springer, Cham, third edition, 2021. 32

2021