pith. sign in

arxiv: 2404.12589 · v7 · pith:GMJYPVVCnew · submitted 2024-04-19 · 🧮 math.PR · cs.IT· math.IT· math.OC· stat.CO

Geometry and factorization of multivariate Markov chains with applications to MCMC acceleration and approximate inference

classification 🧮 math.PR cs.ITmath.ITmath.OCstat.CO
keywords algorithmchainsmarkovmcmcswappingapplicationsfilteringmixing
0
0 comments X
read the original abstract

This paper analyzes the factorizability and geometry of transition matrices of multivariate Markov chains. Specifically, we demonstrate that the induced chains on factors of a product space can be regarded as information projections with respect to the Kullback-Leibler divergence. This perspective yields Han-Shearer type inequalities and submodularity of the entropy rate of Markov chains, as well as applications in the context of large deviations and mixing time comparison. As concrete algorithmic applications in Markov chain Monte Carlo (MCMC) and approximate inference, we provide three illustrations based on lifted MCMC, swapping algorithm and factored filtering to demonstrate projection samplers improve mixing over the original samplers. The projection sampler based on the swapping algorithm resamples the highest-temperature coordinate at stationarity at each step, and we prove that such practice accelerates the mixing time by multiplicative factors related to the number of temperatures and the dimension of the underlying state space when compared with the original swapping algorithm. Through simple numerical experiments on a bimodal target distribution, we show that the projection samplers mix effectively, in contrast to lifted MCMC and the swapping algorithm, which mix less well. In filtering, our proposed factored filtering scheme is able to scale to high dimensions with linear-in-dimension computational cost per step at the price of an approximation error that can be tracked using the distance to independence, compared with the exponential-in-dimension cost per step of the exact filter.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Information-theoretic coordinate subset and partition selection of multivariate Markov chains via submodular optimization

    math.PR 2025-03 unverdicted novelty 5.0

    Formulates coordinate subset and partition selection for ergodic multivariate Markov chains as submodular problems and supplies greedy algorithms with approximation guarantees plus a generalized distorted greedy variant.