Kinetic Interacting Particle Langevin Monte Carlo
Pith reviewed 2026-05-23 23:05 UTC · model grok-4.3
The pith
A diffusion evolving jointly over parameters and latent variables has stationary distribution concentrating around the maximum marginal likelihood estimate.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
We propose a diffusion process that evolves jointly in the space of parameters and latent variables and show that the stationary distribution of this diffusion concentrates around the maximum marginal likelihood estimate of the parameters. We obtain nonasymptotic rates of convergence in Wasserstein-2 distance for the case where the joint log-likelihood is strongly concave with respect to latent variables and parameters. We achieve accelerated convergence rates clearly demonstrating improvement in dimension dependence.
What carries the argument
The Kinetic Interacting Particle Langevin Monte Carlo (KIPLMC) diffusion: an underdamped Langevin process on the joint space of parameters and latent variables whose stationary measure concentrates at the maximizer of the marginal likelihood.
If this is right
- The stationary distribution of the joint diffusion concentrates around the maximum marginal likelihood estimate.
- Nonasymptotic Wasserstein-2 convergence rates hold for both discretizations under strong concavity of the joint log-likelihood.
- The rates exhibit acceleration and improved dimension dependence relative to non-interacting or overdamped baselines.
- The two explicit discretizations serve as practical algorithms for parameter estimation in latent variable models.
- Numerical experiments confirm effectiveness for statistical inference tasks including unsupervised learning and inverse problems.
Where Pith is reading between the lines
- The interacting-particle structure may allow variance reduction across particles that further improves mixing beyond the stated dimension gains.
- The joint diffusion construction could be adapted to target other marginal functionals such as posterior means rather than point estimates.
- Relaxing strong concavity via local analysis or adaptive step-size rules would widen the range of models to which the rates apply.
- The same joint-evolution idea might transfer to continuous-time formulations of other interacting-particle samplers used in Bayesian computation.
Load-bearing premise
The joint log-likelihood must be strongly concave with respect to both the latent variables and the parameters.
What would settle it
Simulate the discretized KIPLMC algorithm on a model satisfying strong concavity and check whether the observed Wasserstein-2 distance to the target measure fails to decay at the accelerated rate claimed or whether the sampled parameters fail to concentrate near the marginal maximum-likelihood point.
Figures
read the original abstract
This paper introduces and analyses interacting underdamped Langevin algorithms, termed Kinetic Interacting Particle Langevin Monte Carlo (KIPLMC) methods, for statistical inference in latent variable models. We propose a diffusion process that evolves jointly in the space of parameters and latent variables and show that the stationary distribution of this diffusion concentrates around the maximum marginal likelihood estimate of the parameters. We then provide two explicit discretisations of this diffusion as practical algorithms to estimate parameters of statistical models. For each algorithm, we obtain nonasymptotic rates of convergence in Wasserstein-2 distance for the case where the joint log-likelihood is strongly concave with respect to latent variables and parameters. We achieve accelerated convergence rates clearly demonstrating improvement in dimension dependence. To demonstrate the utility of the introduced methodology, we provide numerical experiments that illustrate the effectiveness of the proposed diffusion for statistical inference. Our setting covers a broad number of applications, including unsupervised learning, statistical inference, and inverse problems.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript introduces Kinetic Interacting Particle Langevin Monte Carlo (KIPLMC) methods for inference in latent variable models. It defines a joint diffusion process over parameters and latent variables whose stationary distribution is shown to concentrate around the maximum marginal likelihood estimator of the parameters. Two explicit discretizations are derived as practical algorithms, and nonasymptotic Wasserstein-2 convergence rates are established under the assumption that the joint log-likelihood is strongly concave in both latent variables and parameters; these rates exhibit improved dimension dependence relative to prior methods. Numerical experiments on synthetic and real data illustrate practical performance.
Significance. If the nonasymptotic rates hold, the work supplies a theoretically grounded joint diffusion sampler for latent-variable models that achieves accelerated convergence with better scaling in high dimensions. The explicit conditioning on strong concavity of the joint log-likelihood makes the guarantees conditional but falsifiable, and the approach covers applications in unsupervised learning, statistical inference, and inverse problems. The provision of two discretizations and accompanying experiments strengthens the practical contribution.
minor comments (3)
- [§3.2] §3.2, Algorithm 1: the discretization step-size h is introduced without an explicit dependence on the strong-concavity constants; a short remark clarifying how h is chosen in the numerical experiments would improve reproducibility.
- [Figure 2] Figure 2 caption: the legend labels 'KIPLMC-1' and 'KIPLMC-2' but the text refers to 'Algorithm 1' and 'Algorithm 2'; consistent naming would avoid minor confusion.
- [Theorem 4.3] Theorem 4.3: the statement of the W2 bound contains an implicit dependence on the latent dimension d_z that is not highlighted in the main text discussion of dimension improvement; adding one sentence contrasting the d_z scaling with standard Langevin would clarify the claimed acceleration.
Simulated Author's Rebuttal
We thank the referee for their positive summary of the manuscript, recognition of its significance, and recommendation for minor revision. The report does not list any specific major comments.
Circularity Check
No significant circularity identified
full rationale
The paper proposes a novel diffusion process evolving jointly over parameters and latent variables, establishes its stationary distribution concentrates around the maximum marginal likelihood estimator, and derives nonasymptotic Wasserstein-2 convergence rates for two explicit discretizations under the explicit assumption that the joint log-likelihood is strongly concave in both variables. These steps are presented as direct mathematical analysis of the constructed process; the rates are conditional on the stated concavity assumption rather than derived by fitting or redefinition. No equations reduce target quantities to inputs by construction, no load-bearing self-citations are invoked for uniqueness or ansatzes, and the central claims do not rename known empirical patterns. The derivation chain is therefore self-contained against external benchmarks.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption The joint log-likelihood is strongly concave with respect to latent variables and parameters
Lean theorems connected to this paper
-
IndisputableMonolith/Foundation/RealityFromDistinction.leanreality_from_one_distinction unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
We propose a diffusion process that evolves jointly in the space of parameters and latent variables and show that the stationary distribution of this diffusion concentrates around the maximum marginal likelihood estimate
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
nonasymptotic rates of convergence in Wasserstein-2 distance for the case where the joint log-likelihood is strongly concave
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
Deniz Akyildiz and Sotirios Sabanis
Ö. Deniz Akyildiz and Sotirios Sabanis. Nonasymptotic analysis of stochastic gradient hamiltonian monte carlo under local conditions for nonconvex optimization.Journal of Machine Learning Research, 25(113):1–34, 2024. Cited on pages 5, 10, 11, 17, and 19
work page 2024
-
[2]
Deniz Akyildiz, Dan Crisan, and Joaquín Míguez
Ö. Deniz Akyildiz, Dan Crisan, and Joaquín Míguez. Parallel sequential Monte Carlo for stochastic gradient-free nonconvex optimization.Statistics and Computing, 30(6):1645–1663, 2020. Cited on page 2
work page 2020
-
[3]
Ö Deniz Akyildiz, Francesca Romana Crucinio, Mark Girolami, Tim Johnston, and Sotirios Sabanis. Interacting particle Langevin algorithm for maximum marginal likelihood estimation.arXiv preprint arXiv:2303.13429, 2023. Cited on pages 2, 3, 6, 7, 13, 14, 16, 17, 18, 19, and 29. 19
-
[4]
Deniz Akyildiz, Michela Ottobre, and Iain Souttar
Ö. Deniz Akyildiz, Michela Ottobre, and Iain Souttar. A multiscale perspective on maximum marginal likelihood estimation. arXiv preprint arXiv:2406.04187, 2024. Cited on page 2
-
[5]
Statistical finite elements via langevin dynamics
Ömer Deniz Akyildiz, Connor Duffin, Sotirios Sabanis, and Mark Girolami. Statistical finite elements via langevin dynamics. SIAM/ASA Journal on Uncertainty Quantification, 10(4):1560–1585, 2022. Cited on page 19
work page 2022
-
[6]
Atchadé, Gersende Fort, and Eric Moulines
Yves F. Atchadé, Gersende Fort, and Eric Moulines. On perturbed proximal gradient algorithms.Journal of Machine Learning Research, 18(10):1–33, 2017. URLhttp://jmlr.org/papers/v18/15-038.html. Cited on pages 2 and 6
work page 2017
-
[7]
José M Bernardo and Adrian FM Smith.Bayesian theory, volume 405. John Wiley & Sons, 2009. Cited on page 2
work page 2009
-
[8]
Billingsley.Probability and Measure
P. Billingsley.Probability and Measure. Wiley Series in Probability and Statistics. Wiley, 1995. ISBN 9780471007104. URL https://books.google.co.uk/books?id=z39jQgAACAAJ. Cited on page 10
work page 1995
-
[9]
Latent Dirichlet allocation.Journal of Machine Learning Research, 3(Jan):993–1022, 2003
David M Blei, Andrew Y Ng, and Michael I Jordan. Latent Dirichlet allocation.Journal of Machine Learning Research, 3(Jan):993–1022, 2003. Cited on page 1
work page 2003
-
[10]
James G Booth and James P Hobert. Maximizing generalized linear mixed model likelihoods with an automated Monte Carlo EM algorithm.Journal of the Royal Statistical Society: Series B (Statistical Methodology), 61(1):265–285, 1999. Cited on page 2
work page 1999
-
[11]
Optimizing interacting Langevin dynamics using spectral gaps
Anastasia Borovykh, Nikolas Kantas, Panos Parpas, and Greg Pavliotis. Optimizing interacting Langevin dynamics using spectral gaps. InProceedings of the 38th International Conference on Machine Learning (ICML 2021), 2021. Cited on page 2
work page 2021
-
[12]
The tamed unadjusted Langevin algorithm
Nicolas Brosse, Alain Durmus, Éric Moulines, and Sotirios Sabanis. The tamed unadjusted Langevin algorithm. Stochastic Processes and their Applications, 129(10):3638–3663, 2019. Cited on page 4
work page 2019
-
[13]
Olivier Cappé, Arnaud Doucet, Marc Lavielle, and Eric Moulines. Simulation-based methods for blind maximum-likelihood filter identification.Signal Processing, 73(1-2):3–25, 1999. Cited on page 2
work page 1999
- [14]
-
[15]
Gilles Celeux. The SEM algorithm: a probabilistic teacher algorithm derived from the em algorithm for the mixture problem.Computational Statistics Quarterly, 2:73–82, 1985. Cited on page 2
work page 1985
-
[16]
A stochastic approximation type EM algorithm for the mixture problem
Gilles Celeux and Jean Diebolt. A stochastic approximation type EM algorithm for the mixture problem. Stochastics: An International Journal of Probability and Stochastic Processes, 41(1-2):119–134, 1992. Cited on page 2
work page 1992
-
[17]
Monte Carlo EM estimation for time series models involving counts
KS Chan and Johannes Ledolter. Monte Carlo EM estimation for time series models involving counts. Journal of the American Statistical Association, 90(429):242–252, 1995. Cited on page 2
work page 1995
-
[18]
Stochastic gradient hamiltonian monte carlo for non-convex learning
Huy N Chau and Miklós Rásonyi. Stochastic gradient hamiltonian monte carlo for non-convex learning. Stochastic Processes and their Applications, 149:341–368, 2022. Cited on page 5
work page 2022
-
[19]
Chatterji, Yasin Abbasi-Yadkori, Peter L
Xiang Cheng, Niladri S Chatterji, Yasin Abbasi-Yadkori, Peter L Bartlett, and Michael I Jordan. Sharp convergence rates for Langevin dynamics in the nonconvex setting.arXiv preprint arXiv:1805.01648,
-
[20]
Underdamped Langevin MCMC: A non-asymptotic analysis
Xiang Cheng, Niladri S Chatterji, Peter L Bartlett, and Michael I Jordan. Underdamped Langevin MCMC: A non-asymptotic analysis. InConference On Learning Theory, pages 300–323, 2018. Cited on pages 3, 5, 7, 8, and 18. 20
work page 2018
-
[21]
Analysis of Langevin Monte Carlo from Poincare to Log-Sobolev
Sinho Chewi, Murat A Erdogdu, Mufan Li, Ruoqi Shen, and Shunshi Zhang. Analysis of Langevin Monte Carlo from Poincare to Log-Sobolev. InConference on Learning Theory, pages 1–2. PMLR, 2022. Cited on page 4
work page 2022
-
[22]
Arnak Dalalyan. Further and stronger analogy between sampling and optimization: Langevin monte carlo and gradient descent. InConference on Learning Theory, pages 678–689, 2017. Cited on pages 2 and 4
work page 2017
-
[23]
Theoretical guarantees for approximate sampling from smooth and log-concave densities
Arnak S Dalalyan. Theoretical guarantees for approximate sampling from smooth and log-concave densities. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 79(3):651–676,
-
[24]
Cited on pages 2, 6, and 10
-
[25]
User-friendly guarantees for the Langevin Monte Carlo with inaccurate gradient
Arnak S Dalalyan and Avetik Karagulyan. User-friendly guarantees for the Langevin Monte Carlo with inaccurate gradient. Stochastic Processes and their Applications, 129(12):5278–5311, 2019. Cited on pages 4 and 10
work page 2019
-
[26]
Arnak S Dalalyan and Lionel Riou-Durand. On sampling from a log-concave density using kinetic langevin diffusions.Bernoulli, 26(3):1956–1988, 2020. Cited on pages 2, 3, 4, 5, 7, 8, 9, 12, 14, 17, 18, 26, and 29
work page 1956
-
[27]
Valentin De Bortoli, Alain Durmus, Marcelo Pereyra, and Ana F Vidal. Efficient stochastic optimisation by unadjusted langevin monte carlo.Statistics and Computing, 31(3):1–18, 2021. Cited on pages 2 and 6
work page 2021
-
[28]
Arthur P Dempster, Nan M Laird, and Donald B Rubin. Maximum likelihood from incomplete data via the em algorithm.Journal of the Royal Statistical Society: Series B (Methodological), 39(1):1–22, 1977. Cited on pages 1 and 2
work page 1977
-
[29]
A stochastic EM algorithm for approximating the maximum likelihood estimate
J Diebolt and E HS Ip. A stochastic EM algorithm for approximating the maximum likelihood estimate. In W. R. Gilks, S. T. Richardson, and D. J. Spiegelhalter, editors,Markov Chain Monte Carlo in Practice. CRC Publishers, 1996. Cited on page 2
work page 1996
-
[30]
Randal Douc, Eric Moulines, and David Stoffer.Nonlinear time series: Theory, methods and applications with R examples. CRC press, 2014. Cited on page 6
work page 2014
-
[31]
Jin-Chuan Duan, Andras Fulop, and Yu-Wei Hsieh. Maximum likelihood estimation of latent variable models by SMC with marginalization and data cloning.USC-INET Research Paper, (17-27), 2017. Cited on page 2
work page 2017
-
[32]
Nonasymptotic convergence analysis for the unadjusted Langevin algorithm
Alain Durmus and Eric Moulines. Nonasymptotic convergence analysis for the unadjusted Langevin algorithm. The Annals of Applied Probability, 27(3):1551–1587, 2017. Cited on pages 2, 4, 6, and 10
work page 2017
-
[33]
High-dimensional Bayesian inference via the unadjusted Langevin algorithm
Alain Durmus and Eric Moulines. High-dimensional Bayesian inference via the unadjusted Langevin algorithm. Bernoulli, 25(4A):2854–2882, 2019. Cited on pages 2, 4, 6, and 7
work page 2019
-
[34]
Analysis of Langevin Monte Carlo via convex optimization
Alain Durmus, Szymon Majewski, and Błażej Miasojedow. Analysis of Langevin Monte Carlo via convex optimization. Journal of Machine Learning Research, 20(1):2666–2711, 2019. Cited on pages 2 and 4
work page 2019
-
[35]
Couplings and quantitative contraction rates for Langevin dynamics
Andreas Eberle, Arnaud Guillin, and Raphael Zimmer. Couplings and quantitative contraction rates for Langevin dynamics. The Annals of Probability, 47(4):1982–2010, 2019. Cited on page 28
work page 1982
-
[36]
Paula Cordero Encinar, Francesca R Crucinio, and O. Deniz Akyildiz. Proximal Interacting Particle Langevin Algorithms. arXiv preprint arXiv:2406.14292, 2024. Cited on page 2
-
[37]
Zhou Fan, Leying Guan, Yandi Shen, and Yihong Wu. Gradient flows for empirical bayes in high- dimensional linear models.arXiv preprint arXiv:2312.12708, 2023. Cited on page 19
-
[38]
A multiple-imputation Metropolis version of the EM algorithm
Carlo Gaetan and Jian-Feng Yao. A multiple-imputation Metropolis version of the EM algorithm. Biometrika, 90(3):643–654, 2003. Cited on page 2. 21
work page 2003
-
[39]
Xuefeng Gao, Mert Gürbüzbalaban, and Lingjiong Zhu. Global convergence of stochastic gradient hamiltonian monte carlo for nonconvex stochastic optimization: Nonasymptotic performance bounds and momentum-based acceleration.Operations Research, 70(5):2931–2947, 2022. Cited on pages 5 and 17
work page 2022
-
[40]
Sara Grassi and Lorenzo Pareschi. From particle swarm optimization to consensus based optimization: stochastic modeling and mean-field limit.Mathematical Models and Methods in Applied Sciences, 31(08): 1625–1657, 2021. Cited on page 2
work page 2021
-
[41]
Latent space approaches to social network analysis
Peter D Hoff, Adrian E Raftery, and Mark S Handcock. Latent space approaches to social network analysis. Journal of the American Statistical association, 97(460):1090–1098, 2002. Cited on page 1
work page 2002
-
[42]
Chii-Ruey Hwang. Laplace’s method revisited: weak convergence of probability measures.The Annals of Probability, pages 1177–1182, 1980. Cited on pages 6 and 11
work page 1980
-
[43]
MCMC maximum likelihood for latent state models
Eric Jacquier, Michael Johannes, and Nicholas Polson. MCMC maximum likelihood for latent state models. Journal of Econometrics, 137(2):615–640, 2007. Cited on page 2
work page 2007
-
[44]
Adam M Johansen, Arnaud Doucet, and Manuel Davy. Particle methods for maximum likelihood estimation in latent variable models.Statistics and Computing, 18(1):47–57, 2008. Cited on page 2
work page 2008
-
[45]
Tim Johnston, Iosif Lytras, and Sotirios Sabanis. Kinetic langevin mcmc sampling without gradient lipschitz continuity-the strongly convex case.Journal of Complexity, 2024. Cited on pages 19 and 29
work page 2024
-
[46]
Tim Johnston, Nikolaos Makras, and Sotirios Sabanis. Taming the interacting particle langevin algorithm– the superlinear case.arXiv preprint arXiv:2403.19587, 2024. Cited on pages 2 and 10
-
[47]
James Kennedy and Russell Eberhart. Particle swarm optimization. In Proceedings of ICNN’95- international conference on neural networks, volume 4, pages 1942–1948. IEEE, 1995. Cited on page 2
work page 1942
-
[48]
Particle algorithms for maximum likelihood training of latent variable models
Juan Kuntz, Jen Ning Lim, and Adam M Johansen. Particle algorithms for maximum likelihood training of latent variable models. InInternational Conference on Artificial Intelligence and Statistics, pages 5134–5180. PMLR, 2023. Cited on pages 2, 3, 6, 13, 14, 16, 17, 18, 30, and 31
work page 2023
-
[49]
Kenneth Lange. A gradient algorithm locally equivalent to the em algorithm.Journal of the Royal Statistical Society: Series B (Methodological), 57(2):425–437, 1995. Cited on page 2
work page 1995
-
[50]
Momentum particle maximum likelihood
Jen Ning Lim, Juan Kuntz, Samuel Power, and Adam M Johansen. Momentum particle maximum likelihood. In Proceedings of 41st International Conference on Machine Learning (ICML), volume 235,
-
[51]
Cited on pages 2, 3, 6, 7, 14, 17, 18, 19, and 30
-
[52]
Chuanhai Liu and Donald B Rubin. The ecme algorithm: a simple extension of em and ecm with faster monotone convergence.Biometrika, 81(4):633–648, 1994. Cited on page 2
work page 1994
-
[53]
Chatterji, Xiang Cheng, Nicolas Flammarion, Peter L
Yi-An Ma, Niladri S. Chatterji, Xiang Cheng, Nicolas Flammarion, Peter L. Bartlett, and Michael I. Jordan. Is there an analog of Nesterov acceleration for gradient-based MCMC?Bernoulli, 27(3):1942 – 1992, 2021. doi: 10.3150/20-BEJ1297. URLhttps://doi.org/10.3150/20-BEJ1297. Cited on pages 2, 5, 7, and 18
-
[54]
Maximum likelihood estimation via the ecm algorithm: A general framework
Xiao-Li Meng and Donald B Rubin. Maximum likelihood estimation via the ecm algorithm: A general framework. Biometrika, 80(2):267–278, 1993. Cited on page 2
work page 1993
-
[55]
High-dimensional MCMC with a standard splitting scheme for the underdamped Langevin diffusion
Pierre Monmarché. High-dimensional MCMC with a standard splitting scheme for the underdamped Langevin diffusion. Electronic Journal of Statistics, 15(2):4117–4166, 2021. Cited on pages 3, 7, 8, 9, 13, 14, 17, 18, and 30
work page 2021
-
[56]
Dynamical Theories of Brownian Motion
Edward Nelson. Dynamical Theories of Brownian Motion. Princeton University Press, 1967. ISBN 9780691079509. URL http://www.jstor.org/stable/j.ctv15r57jg.1. Cited on page 5. 22
work page 1967
-
[57]
Grigorios A. Pavliotis.Stochastic processes and applications: Diffusion Processes, the Fokker-Planck and langevin equations. Springer, 2014. Cited on pages 4, 5, 10, and 27
work page 2014
-
[58]
René Pinnau, Claudia Totzeck, Oliver Tse, and Stephan Martin. A consensus-based model for global optimization and its mean-field limit.Mathematical Models and Methods in Applied Sciences, 27(01): 183–204, 2017. Cited on page 2
work page 2017
-
[59]
Non-convex learning via Stochastic Gradient Langevin Dynamics: a nonasymptotic analysis
Maxim Raginsky, Alexander Rakhlin, and Matus Telgarsky. Non-convex learning via Stochastic Gradient Langevin Dynamics: a nonasymptotic analysis. InConference on Learning Theory, pages 1674–1703,
-
[60]
Christian P Robert and George Casella.Monte Carlo statistical methods. John Wiley & Sons, 2004. Cited on page 4
work page 2004
-
[61]
Gareth O Roberts and Osnat Stramer. Langevin diffusions and metropolis-hastings algorithms.Method- ology and computing in applied probability, 4(4):337–357, 2002. Cited on page 4
work page 2002
-
[62]
Gareth O Roberts, Richard L Tweedie, et al. Exponential convergence of Langevin distributions and their discrete approximations.Bernoulli, 2(4):341–363, 1996. Cited on page 4
work page 1996
-
[63]
Optimal transport for applied mathematicians.Birkäuser, NY, 55(58-63):94,
Filippo Santambrogio. Optimal transport for applied mathematicians.Birkäuser, NY, 55(58-63):94,
-
[64]
Log-concavity and strong log-concavity: a review.Statistics surveys, 8:45, 2014
Adrien Saumard and Jon A Wellner. Log-concavity and strong log-concavity: a review.Statistics surveys, 8:45, 2014. Cited on page 29
work page 2014
-
[65]
Tuning-free maximum likelihood training of latent variable models via coin betting
Louis Sharrock, Daniel Dodd, and Christopher Nemeth. Tuning-free maximum likelihood training of latent variable models via coin betting. InInternational Conference on Artificial Intelligence and Statistics, pages 1810–1818. PMLR, 2024. Cited on page 19
work page 2024
-
[66]
Robert P Sherman, Yu-Yun K Ho, and Siddhartha R Dalal. Conditions for convergence of Monte Carlo EM sequences with an application to product diffusion modeling.The Econometrics Journal, 2(2): 248–267, 1999. Cited on page 2
work page 1999
-
[67]
A probabilistic latent variable model for acoustic modeling
Paris Smaragdis, Bhiksha Raj, and Madhusudana Shashanka. A probabilistic latent variable model for acoustic modeling. Advances in Models for Acoustic Processing Workshop, NIPS, 148:8–1, 2006. Cited on page 1
work page 2006
-
[68]
Deniz Akyildiz, Ieva Kazlauskaite, Mark Girolami, and Fehmi Cirak
Arnaud Vadeboncoeur, Ö. Deniz Akyildiz, Ieva Kazlauskaite, Mark Girolami, and Fehmi Cirak. Fully probabilistic deep models for forward and inverse problems in parametric pdes.Journal of Computational Physics, 491:112369, 2023. ISSN 0021-9991. doi: https://doi.org/10.1016/j.jcp.2023.112369. URL https://www.sciencedirect.com/science/article/pii/S00219991230...
-
[69]
Santosh Vempala and Andre Wibisono. Rapid convergence of the unadjusted Langevin algorithm: Isoperimetry suffices.Advances in Neural Information Processing Systems, 32, 2019. Cited on page 4
work page 2019
-
[70]
Greg CG Wei and Martin A Tanner. A Monte Carlo implementation of the EM algorithm and the poor man’s data augmentation algorithms.Journal of the American Statistical Association, 85(411):699–704,
-
[71]
Statistical exploration of the manifold hypothesis
Nick Whiteley, Annie Gray, and Patrick Rubin-Delanchy. Statistical exploration of the manifold hypothesis. arXiv preprint arXiv:2208.11665, 2022. Cited on page 1
-
[72]
Tatiana Xifara, Chris Sherlock, Samuel Livingstone, Simon Byrne, and Mark Girolami. Langevin diffusions and the metropolis-adjusted langevin algorithm.Statistics & Probability Letters, 91:14–19,
-
[73]
Deniz Akyildiz, Theodoros Damoulas, and Sotirios Sabanis
Ying Zhang, Ö. Deniz Akyildiz, Theodoros Damoulas, and Sotirios Sabanis. Nonasymptotic estimates for stochastic gradient langevin dynamics under local conditions in nonconvex optimization.Applied Mathematics & Optimization, 87(2):25, 2023. Cited on pages 10, 11, and 19. 24 Appendix A Preliminary results Lemma A.1 (KIPLD as an underdamped Langevin Diffusio...
work page 2023
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.