pith. sign in

arxiv: 2605.13186 · v3 · pith:YI3YOOBDnew · submitted 2026-05-13 · 🧮 math.AP · math.OC· math.PR

Nesterov acceleration for the Wasserstein minimization of displacement-convex free energies

Pith reviewed 2026-05-21 08:34 UTC · model grok-4.3

classification 🧮 math.AP math.OCmath.PR
keywords Nesterov accelerationWasserstein gradient flowdisplacement convexitymean-field Langevin dynamicsVlasov-Fokker-Planck equationPolyak-Łojasiewicz constantconvergence ratesfree energy minimization
0
0 comments X

The pith

The mean-field underdamped Langevin process achieves Nesterov acceleration for Wasserstein minimization of displacement-convex free energies.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper shows that the mean-field underdamped Langevin process, governed by the nonlinear Vlasov-Fokker-Planck equation, converges faster than ordinary Wasserstein gradient flows when minimizing displacement-convex free energies. The improvement takes the form of a rate proportional to the square root of the Polyak-Łojasiewicz constant of the energy, which is known to be optimal for the gradient flow itself. This extends a recent linear-case result on diffusive-to-ballistic entropy improvement to the nonlinear mean-field setting.

Core claim

The mean-field underdamped Langevin process associated to the non-linear Vlasov-Fokker-Planck equation achieves a Nesterov acceleration with respect to the Wasserstein gradient flow of a displacement-convex free energy, in the sense that it converges at a rate of order given by the square-root of the Polyak-Łojasiewicz constant of the free energy, which is the optimal convergence rate for the corresponding gradient flow. This result has been made possible by the recent breakthrough that establishes such a diffusive-to-ballistic improvement in term of entropy in the linear case.

What carries the argument

The mean-field underdamped Langevin process tied to the nonlinear Vlasov-Fokker-Planck equation, which supplies the diffusive-to-ballistic entropy decay improvement.

If this is right

  • Convergence to equilibrium occurs at the square-root rate of the Polyak-Łojasiewicz constant instead of the linear rate.
  • The acceleration applies to any free energy satisfying the displacement-convexity condition.
  • The result directly transfers the linear diffusive-to-ballistic gain to the nonlinear mean-field regime.
  • Standard Wasserstein gradient flow methods are outperformed in long-time behavior by the underdamped process.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The same momentum mechanism could be tested for other kinetic equations beyond the Vlasov-Fokker-Planck setting.
  • The acceleration may suggest practical improvements to sampling algorithms that rely on Langevin-type dynamics for high-dimensional distributions.
  • Further work could check whether analogous square-root rates appear for non-convex energies or different interaction potentials.
  • The approach links ideas from optimization theory directly to long-time analysis of nonlinear kinetic equations.

Load-bearing premise

The diffusive-to-ballistic improvement established for the linear case extends to the nonlinear mean-field setting governed by the Vlasov-Fokker-Planck equation under the displacement-convexity assumption.

What would settle it

A concrete counter-example or numerical simulation in which the nonlinear Vlasov-Fokker-Planck dynamics fails to improve the convergence rate to the square-root level for some displacement-convex free energy.

read the original abstract

We show that the mean-field underdamped Langevin process (associated to the non-linear Vlasov-Fokker-Planck equation) achieves a Nesterov acceleration with respect to the Wasserstein gradient flow of a displacement-convex free energy, in the sense that it converges at a rate of order given by the square-root of the Polyak-{\L}ojasiewicz constant of the free energy (which is the optimal convergence rate for the corresponding gradient flow). This result has been made possible by the recent breakthrough [42] by Jianfeng Lu, which establishes such a \emph{diffusive-to-ballistic} improvement in term of entropy in the linear case.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The manuscript claims that the mean-field underdamped Langevin process associated to the nonlinear Vlasov-Fokker-Planck equation achieves Nesterov acceleration relative to the Wasserstein gradient flow of a displacement-convex free energy, converging at a rate of order the square root of the Polyak-Łojasiewicz constant (the optimal rate for the corresponding gradient flow). The result extends the linear Fokker-Planck case established in reference [42].

Significance. If the extension from the linear setting is rigorously justified, the result would constitute a meaningful advance at the interface of optimization and mean-field PDEs, furnishing accelerated convergence rates for interacting particle systems under displacement convexity. It appropriately credits the recent linear breakthrough in [42] and could inform sampling algorithms and variational problems, though the load-bearing step is the control of nonlocal interaction terms.

major comments (2)
  1. [Abstract and §1 (Introduction)] The central claim requires that the diffusive-to-ballistic improvement from the linear case in [42] carries over to the nonlinear Vlasov-Fokker-Planck equation. The abstract and introduction do not state additional structural assumptions (e.g., uniform Lipschitz bound on the interaction kernel) needed to control the convolution term in the velocity evolution and preserve the required monotonicity or cancellation in the Lyapunov functional.
  2. [Proof of main theorem (likely §3)] In the entropy-dissipation or Lyapunov analysis (presumably §3 or §4), the cross terms generated by the nonlocal interaction potential must be estimated explicitly; without such bounds the sqrt(PL) rate may fail to close, and the manuscript should supply these estimates or identify where they follow from displacement convexity alone.
minor comments (2)
  1. [Notation and preliminaries] Clarify the precise definition of the Polyak-Łojasiewicz constant in Wasserstein space and its relation to displacement convexity at the beginning of the paper.
  2. [References] Ensure all references, including [42], are fully cited and that the summary of the linear result is accurate.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the careful reading and constructive suggestions. We address the two major comments point by point below and will revise the manuscript to improve clarity on assumptions and estimates.

read point-by-point responses
  1. Referee: [Abstract and §1 (Introduction)] The central claim requires that the diffusive-to-ballistic improvement from the linear case in [42] carries over to the nonlinear Vlasov-Fokker-Planck equation. The abstract and introduction do not state additional structural assumptions (e.g., uniform Lipschitz bound on the interaction kernel) needed to control the convolution term in the velocity evolution and preserve the required monotonicity or cancellation in the Lyapunov functional.

    Authors: We agree that the structural assumptions should be stated explicitly in the abstract and introduction. The manuscript assumes the interaction kernel is uniformly Lipschitz continuous (a standard condition in the mean-field literature to control the convolution in the velocity equation). In the revised version we will add this hypothesis to the abstract and to the opening of Section 1, together with a brief remark that it guarantees the cross terms remain compatible with the Lyapunov analysis of the linear case [42]. revision: yes

  2. Referee: [Proof of main theorem (likely §3)] In the entropy-dissipation or Lyapunov analysis (presumably §3 or §4), the cross terms generated by the nonlocal interaction potential must be estimated explicitly; without such bounds the sqrt(PL) rate may fail to close, and the manuscript should supply these estimates or identify where they follow from displacement convexity alone.

    Authors: We thank the referee for highlighting the need for explicit control of the nonlocal cross terms. In the current proof (Section 3) these terms are bounded by combining the uniform Lipschitz assumption on the kernel with the displacement-convexity of the free energy; the resulting contributions are absorbed into the dissipation via the PL inequality. To make this transparent we will insert a short auxiliary lemma (new Lemma 3.3) that isolates and estimates the interaction contribution to the time derivative of the Lyapunov functional, showing precisely how the Lipschitz bound closes the estimate without requiring stronger convexity assumptions. revision: yes

Circularity Check

0 steps flagged

No circularity: central result extends external linear-case breakthrough from independent reference

full rationale

The paper attributes the diffusive-to-ballistic improvement explicitly to the external reference [42] by Jianfeng Lu for the linear Fokker-Planck case and frames its own contribution as the extension of that result to the nonlinear mean-field Vlasov-Fokker-Planck equation under displacement-convexity of the free energy. No self-citation appears in the load-bearing steps, no parameter is fitted and then renamed as a prediction, and no definitional equivalence or ansatz smuggling is present. The derivation chain therefore remains self-contained once the cited external result is accepted, satisfying the criteria for an independent extension rather than a reduction to the paper's own inputs.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The claim depends on displacement-convexity of the free energy to obtain the Polyak-Łojasiewicz inequality and on the validity of extending the linear-case improvement to the nonlinear Vlasov-Fokker-Planck dynamics.

axioms (1)
  • domain assumption The free energy functional is displacement-convex.
    Displacement-convexity supplies the Polyak-Łojasiewicz inequality needed for the stated convergence rate.

pith-pipeline@v0.9.0 · 5638 in / 1329 out tokens · 66748 ms · 2026-05-21T08:34:19.670027+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

72 extracted references · 72 canonical work pages · 3 internal anchors

  1. [1]

    Vari- ational methods for the kinetic fokker–planck equation.Analysis & PDE, 17(6):1953– 2010, 2024

    Dallas Albritton, Scott Armstrong, Jean-Christophe Mourrat, and Matthew Novack. Vari- ational methods for the kinetic fokker–planck equation.Analysis & PDE, 17(6):1953– 2010, 2024

  2. [2]

    Shifted composition iv: toward ballistic acceleration for log-concave sampling.arXiv preprint arXiv:2506.23062, 2025

    Jason M Altschuler, Sinho Chewi, and Matthew S Zhang. Shifted composition iv: toward ballistic acceleration for log-concave sampling.arXiv preprint arXiv:2506.23062, 2025

  3. [3]

    Springer Science & Business Media, 2005

    Luigi Ambrosio, Nicola Gigli, and Giuseppe Savar´ e.Gradient flows: in metric spaces and in the space of probability measures. Springer Science & Business Media, 2005

  4. [4]

    Hypocoercivity of piecewise deterministic markov process-monte carlo.The Annals of Applied Probability, 31(5):2478–2517, 2021

    Christophe Andrieu, Alain Durmus, Nikolas N¨ usken, and Julien Roussel. Hypocoercivity of piecewise deterministic markov process-monte carlo.The Annals of Applied Probability, 31(5):2478–2517, 2021. 18

  5. [5]

    Maximum mean discrep- ancy gradient flow.Advances in neural information processing systems, 32, 2019

    Michael Arbel, Anna Korba, Adil Salim, and Arthur Gretton. Maximum mean discrep- ancy gradient flow.Advances in neural information processing systems, 32, 2019

  6. [6]

    Springer, Cham, 2014

    Dominique Bakry, Ivan Gentil, and Michel Ledoux.Analysis and geometry of Markov dif- fusion operators, volume 348 ofGrundlehren der Mathematischen Wissenschaften [Fun- damental Principles of Mathematical Sciences]. Springer, Cham, 2014

  7. [7]

    Gradient flow approach to local mean-field spin systems

    Kaveh Bashiri and Anton Bovier. Gradient flow approach to local mean-field spin systems. Stochastic Processes and their Applications, 130(3):1461–1514, 2020

  8. [8]

    Bakry–´ emery meet villani.Journal of functional analysis, 273(7):2275– 2291, 2017

    Fabrice Baudoin. Bakry–´ emery meet villani.Journal of functional analysis, 273(7):2275– 2291, 2017

  9. [9]

    A criterion on the free energy for log-Sobolev inequalities in mean-field particle systems.arXiv e-prints, page arXiv:2503.24372, March 2025

    Roland Bauerschmidt, Thierry Bodineau, and Benoˆ ıt Dagallier. A criterion on the free energy for log-Sobolev inequalities in mean-field particle systems.arXiv e-prints, page arXiv:2503.24372, March 2025

  10. [10]

    A piecewise deterministic scaling limit of lifted Metropolis-Hastings in the Curie-Weiss model.Ann

    Joris Bierkens and Gareth Roberts. A piecewise deterministic scaling limit of lifted Metropolis-Hastings in the Curie-Weiss model.Ann. Appl. Probab., 27(2):846–882, 2017

  11. [11]

    Quantitative stability of constant equilibria in a non-linear alignment model of self-propelled particles

    ´Emeric Bouin and Amic Frouvelle. Quantitative stability of constant equilibria in a non- linear alignment model of self-propelled particles.arXiv preprint arXiv:2604.05927, 2026

  12. [12]

    Hypocoercivity meets lifts.Kinetic and Related Models, 20(0):34–55, 2026

    Giovanni Brigati, Francis L¨ orler, and Lihan Wang. Hypocoercivity meets lifts.Kinetic and Related Models, 20(0):34–55, 2026

  13. [13]

    Proximal optimal transport modeling of population dynamics

    Charlotte Bunne, Laetitia Papaxanthos, Andreas Krause, and Marco Cuturi. Proximal optimal transport modeling of population dynamics. InInternational Conference on Artificial Intelligence and Statistics, pages 6511–6528. PMLR, 2022

  14. [14]

    On explicit l 2-convergence rate estimate for un- derdamped langevin dynamics.Archive for Rational Mechanics and Analysis, 247(5):90, 2023

    Yu Cao, Jianfeng Lu, and Lihan Wang. On explicit l 2-convergence rate estimate for un- derdamped langevin dynamics.Archive for Rational Mechanics and Analysis, 247(5):90, 2023

  15. [15]

    Springer, 2018

    Ren´ e Carmona, Fran¸ cois Delarue, et al.Probabilistic theory of mean field games with applications I-II, volume 3. Springer, 2018

  16. [16]

    Convergence to equilibrium in wasser- stein distance for damped euler equations with interaction forces.Communications in Mathematical Physics, 365(1):329–361, 2019

    Jos´ e A Carrillo, Young-Pil Choi, and Oliver Tse. Convergence to equilibrium in wasser- stein distance for damped euler equations with interaction forces.Communications in Mathematical Physics, 365(1):329–361, 2019

  17. [17]

    Carrillo, Robert J

    Jos´ e A. Carrillo, Robert J. McCann, and C´ edric Villani. Kinetic equilibration rates for granular media and related equations: entropy dissipation and mass transportation estimates.Revista Matem´ atica Iberoamericana, 19(3):971 – 1018, 2003

  18. [18]

    Entropic multipliers method for langevin diffusion and weighted log sobolev inequalities.Journal of Functional Analysis, 277(11):108288, 2019

    Patrick Cattiaux, Arnaud Guillin, Pierre Monmarch´ e, and Chaoen Zhang. Entropic multipliers method for langevin diffusion and weighted log sobolev inequalities.Journal of Functional Analysis, 277(11):108288, 2019

  19. [19]

    Uniform-in-time propagation of chaos for kinetic mean field Langevin dynamics.Electronic Journal of Probability, 29(none):1 – 43, 2024

    Fan Chen, Yiqing Lin, Zhenjie Ren, and Songbo Wang. Uniform-in-time propagation of chaos for kinetic mean field Langevin dynamics.Electronic Journal of Probability, 29(none):1 – 43, 2024

  20. [20]

    Uniform-in-time propagation of chaos for mean field langevin dynamics.arXiv preprint arXiv:2212.03050, 2022

    Fan Chen, Zhenjie Ren, and Songbo Wang. Uniform-in-time propagation of chaos for mean field langevin dynamics.arXiv preprint arXiv:2212.03050, 2022. 19

  21. [21]

    Shi Chen, Qin Li, Oliver Tse, and Stephen J. Wright. Accelerating optimization over the space of probability measures.Journal of Machine Learning Research, 26(31):1–40, 2025

  22. [22]

    Mean-field langevin dynamics : Exponential convergence and annealing

    L´ ena¨ ıc Chizat. Mean-field langevin dynamics : Exponential convergence and annealing. Transactions on Machine Learning Research, 2022

  23. [23]

    Dalalyan and Lionel Riou-Durand

    Arnak S. Dalalyan and Lionel Riou-Durand. On sampling from a log-concave density using kinetic Langevin diffusions.Bernoulli, 26(3):1956 – 1988, 2020

  24. [24]

    Mat´ ıas G Delgadino, Rishabh S Gvalani, Grigorios A Pavliotis, and Scott A Smith. Phase transitions, logarithmic sobolev inequalities, and uniform-in-time propagation of chaos for weakly interacting diffusions.Communications in Mathematical Physics, pages 1–49, 2023

  25. [25]

    Randomized Hamiltonian Monte Carlo as scaling limit of the bouncy particle sampler and dimension-free convergence rates.The Annals of Applied Probability, 31(6):2612 – 2662, 2021

    George Deligiannidis, Daniel Paulin, Alexandre Bouchard-Cˆ ot´ e, and Arnaud Doucet. Randomized Hamiltonian Monte Carlo as scaling limit of the bouncy particle sampler and dimension-free convergence rates.The Annals of Applied Probability, 31(6):2612 – 2662, 2021

  26. [26]

    Analysis of a nonreversible markov chain sampler.Annals of Applied Probability, pages 726–752, 2000

    Persi Diaconis, Susan Holmes, and Radford M Neal. Analysis of a nonreversible markov chain sampler.Annals of Applied Probability, pages 726–752, 2000

  27. [27]

    Hypocoercivity for linear kinetic equations conserving mass.Transactions of the American Mathematical Society, 367(6):3807–3828, 2015

    Jean Dolbeault, Cl´ ement Mouhot, and Christian Schmeiser. Hypocoercivity for linear kinetic equations conserving mass.Transactions of the American Mathematical Society, 367(6):3807–3828, 2015

  28. [28]

    Asymptotic bias of inexact markov chain monte carlo methods in high dimension.The Annals of Applied Probability, 34(4):3435–3468, 2024

    Alain Durmus and Andreas Eberle. Asymptotic bias of inexact markov chain monte carlo methods in high dimension.The Annals of Applied Probability, 34(4):3435–3468, 2024

  29. [29]

    Non-reversible lifts of reversible diffusion processes and relaxation times

    L¨ orler Francis Eberle, Andreas. Non-reversible lifts of reversible diffusion processes and relaxation times

  30. [30]

    Continuized accelerations of deterministic and stochastic gradient descents, and of gossip algorithms.Advances in Neural Information Processing Systems, 34:28054–28066, 2021

    Mathieu Even, Rapha¨ el Berthier, Francis Bach, Nicolas Flammarion, Hadrien Hendrikx, Pierre Gaillard, Laurent Massouli´ e, and Adrien Taylor. Continuized accelerations of deterministic and stochastic gradient descents, and of gossip algorithms.Advances in Neural Information Processing Systems, 34:28054–28066, 2021

  31. [31]

    Sharp hypocoercive convergence estimates for underdamped Langevin dynamics via the modified $L^2$ method

    Zexi Fan, Bowen Li, and Jianfeng Lu. Sharp hypocoercive convergence estimates for underdamped Langevin dynamics via the modifiedL 2 method.arXiv preprint arXiv:2604.10068, 2026

  32. [32]

    On the rate of convergence in wasserstein distance of the empirical measure.Probability theory and related fields, 162(3-4):707–738, 2015

    Nicolas Fournier and Arnaud Guillin. On the rate of convergence in wasserstein distance of the empirical measure.Probability theory and related fields, 162(3-4):707–738, 2015

  33. [33]

    Spectral decompositions andL 2-operator norms of toy hypocoercive semi-groups.Kinetic and related models, 6(2):317–372, 2013

    S´ ebastien Gadat and Laurent Miclo. Spectral decompositions andL 2-operator norms of toy hypocoercive semi-groups.Kinetic and related models, 6(2):317–372, 2013

  34. [34]

    A math- ematical perspective on transformers.Bulletin of the American Mathematical Society, 62(3):427–479, 2025

    Borjan Geshkovski, Cyril Letrouit, Yury Polyanskiy, and Philippe Rigollet. A math- ematical perspective on transformers.Bulletin of the American Mathematical Society, 62(3):427–479, 2025

  35. [35]

    HMC and Underdamped Langevin United in the Unadjusted Convex Smooth Case.SIAM/ASA Journal on Uncertainty Quantification, 13(1):278–303, 2025

    Nicolas Gouraud, Pierre Le Bris, Adrien Majka, and Pierre Monmarch´ e. HMC and Underdamped Langevin United in the Unadjusted Convex Smooth Case.SIAM/ASA Journal on Uncertainty Quantification, 13(1):278–303, 2025. 20

  36. [36]

    Uniform Poincar´ e and log- arithmic Sobolev inequalities for mean field particle systems.The Annals of Applied Probability, 32(3):1590 – 1614, 2022

    Arnaud Guillin, Wei Liu, Liming Wu, and Chaoen Zhang. Uniform Poincar´ e and log- arithmic Sobolev inequalities for mean field particle systems.The Annals of Applied Probability, 32(3):1590 – 1614, 2022

  37. [37]

    Uniform long-time and propagation of chaos estimates for mean field kinetic particles in non-convex landscapes.Journal of Statistical Physics, 185:1–20, 2021

    Arnaud Guillin and Pierre Monmarch´ e. Uniform long-time and propagation of chaos estimates for mean field kinetic particles in non-convex landscapes.Journal of Statistical Physics, 185:1–20, 2021

  38. [38]

    Short and long time behavior of the fokker–planck equation in a confining potential and applications.Journal of Functional Analysis, 244(1):95–118, 2007

    Fr´ ed´ eric H´ erau. Short and long time behavior of the fokker–planck equation in a confining potential and applications.Journal of Functional Analysis, 244(1):95–118, 2007

  39. [39]

    Isotropic hypoellipticity and trend to equilibrium for the fokker-planck equation with a high-degree potential.Archive for Rational Mechanics and Analysis, 171(2):151–218, 2004

    Fr´ ed´ eric H´ erau and Francis Nier. Isotropic hypoellipticity and trend to equilibrium for the fokker-planck equation with a high-degree potential.Archive for Rational Mechanics and Analysis, 171(2):151–218, 2004

  40. [40]

    Mean-field Langevin dy- namics and energy landscape of neural networks.Annales de l’Institut Henri Poincar´ e, Probabilit´ es et Statistiques, 57(4):2043 – 2065, 2021

    Kaitong Hu, Zhenjie Ren, David ˇSiˇ ska, and Lukasz Szpruch. Mean-field Langevin dy- namics and energy landscape of neural networks.Annales de l’Institut Henri Poincar´ e, Probabilit´ es et Statistiques, 57(4):2043 – 2065, 2021

  41. [41]

    Linear convergence of gradient and proximal-gradient methods under the polyak- lojasiewicz condition

    Hamed Karimi, Julie Nutini, and Mark Schmidt. Linear convergence of gradient and proximal-gradient methods under the polyak- lojasiewicz condition. InJoint European conference on machine learning and knowledge discovery in databases, pages 795–811. Springer, 2016

  42. [42]

    Variational inference via wasserstein gradient flows.Advances in Neural Information Processing Systems, 35:14434–14447, 2022

    Marc Lambert, Sinho Chewi, Francis Bach, Silv` ere Bonnabel, and Philippe Rigollet. Variational inference via wasserstein gradient flows.Advances in Neural Information Processing Systems, 35:14434–14447, 2022

  43. [43]

    Convergence rates for an adap- tive biasing potential scheme from a wasserstein optimization perspective.Nonlinearity, 39(4):045016, 2026

    Tony Leli` evre, Xuyang Lin, and Pierre Monmarch´ e. Convergence rates for an adap- tive biasing potential scheme from a wasserstein optimization perspective.Nonlinearity, 39(4):045016, 2026

  44. [44]

    Partial differential equations and stochastic methods in moleculardynamics.Acta Numerica, 25:681–880, 2016

    Tony Lelievre and Gabriel Stoltz. Partial differential equations and stochastic methods in moleculardynamics.Acta Numerica, 25:681–880, 2016

  45. [45]

    Ensembles semi-analytiques.IHES notes, page 220, 1965

    Stanislaw Lojasiewicz. Ensembles semi-analytiques.IHES notes, page 220, 1965

  46. [46]

    A sharp hypocoercive entropy decay estimate for underdamped Langevin dynamics

    Jianfeng Lu. A sharp hypocoercive entropy decay estimate for underdamped Langevin dynamics.arXiv e-prints, page arXiv:2605.01933, May 2026

  47. [47]

    On explicit l 2-convergence rate estimate for piecewise deterministic markov processes in mcmc algorithms.The Annals of Applied Probability, 32(2):1333–1361, 2022

    Jianfeng Lu and Lihan Wang. On explicit l 2-convergence rate estimate for piecewise deterministic markov processes in mcmc algorithms.The Annals of Applied Probability, 32(2):1333–1361, 2022

  48. [48]

    A mean field view of the land- scape of two-layer neural networks.Proceedings of the National Academy of Sciences, 115(33):E7665–E7671, 2018

    Song Mei, Andrea Montanari, and Phan-Minh Nguyen. A mean field view of the land- scape of two-layer neural networks.Proceedings of the National Academy of Sciences, 115(33):E7665–E7671, 2018

  49. [49]

    On the implicit regularization of langevin dynamics with projected noise.arXiv preprint arXiv:2602.12257, 2026

    Govind Menon, Austin J Stromme, and Adrien Vacher. On the implicit regularization of langevin dynamics with projected noise.arXiv preprint arXiv:2602.12257, 2026

  50. [50]

    ´Etude spectrale minutieuse de processus moins ind´ ecis que les autres

    Laurent Miclo and Pierre Monmarch´ e. ´Etude spectrale minutieuse de processus moins ind´ ecis que les autres. InS´ eminaire de Probabilit´ es XLV, volume 2078 ofLecture Notes in Math., pages 459–481. Springer, Cham, 2013. 21

  51. [51]

    Piecewise deterministic simulated annealing.ALEA Lat

    Pierre Monmarch´ e. Piecewise deterministic simulated annealing.ALEA Lat. Am. J. Probab. Math. Stat., 13(1):357–398, 2016

  52. [52]

    Generalized Γ calculus and application to interacting particles on a graph.Potential Analysis, 50:439–466, 2019

    Pierre Monmarch´ e. Generalized Γ calculus and application to interacting particles on a graph.Potential Analysis, 50:439–466, 2019

  53. [53]

    An entropic approach for Hamiltonian Monte Carlo: The idealized case.The Annals of Applied Probability, 34(2):2243 – 2293, 2024

    Pierre Monmarch´ e. An entropic approach for Hamiltonian Monte Carlo: The idealized case.The Annals of Applied Probability, 34(2):2243 – 2293, 2024

  54. [54]

    Free energy Wasserstein gradient flow and their particle counter- parts: toy model, (degenerate) PL inequalities and exit times.arXiv e-prints, page arXiv:2510.16506, October 2025

    Pierre Monmarch´ e. Free energy Wasserstein gradient flow and their particle counter- parts: toy model, (degenerate) PL inequalities and exit times.arXiv e-prints, page arXiv:2510.16506, October 2025

  55. [55]

    Uniform log-sobolev inequalities for mean field particles beyond flat- convexity.Stochastic Processes and their Applications, 2025

    Pierre Monmarch´ e. Uniform log-sobolev inequalities for mean field particles beyond flat- convexity.Stochastic Processes and their Applications, 2025

  56. [56]

    Local convergence rates for wasserstein gradient flows and mckean-vlasov equations with multiple stationary solutions.Probability Theory and Related Fields, pages 1–59, 2025

    Pierre Monmarch´ e and Julien Reygner. Local convergence rates for wasserstein gradient flows and mckean-vlasov equations with multiple stationary solutions.Probability Theory and Related Fields, pages 1–59, 2025

  57. [57]

    Exact targeting of gibbs distributions using velocity-jump processes.Stochastics and Partial Differential Equa- tions: Analysis and Computations, pages 1–40, 2022

    Pierre Monmarch´ e, Matthias Rousset, and Pierre-Andr´ e Zitt. Exact targeting of gibbs distributions using velocity-jump processes.Stochastics and Partial Differential Equa- tions: Analysis and Computations, pages 1–40, 2022

  58. [58]

    Long-time behaviour and propagation of chaos for mean field kinetic particles.Stochastic Processes and their Applications, 127(6):1721–1737, 2017

    Pierre Monmarch´ e. Long-time behaviour and propagation of chaos for mean field kinetic particles.Stochastic Processes and their Applications, 127(6):1721–1737, 2017

  59. [59]

    A method for solving the convex programming problem with convergence rate o (1/k2)

    Yurii Nesterov. A method for solving the convex programming problem with convergence rate o (1/k2). InDokl akad nauk Sssr, volume 269, page 543, 1983

  60. [60]

    The geometry of dissipative evolution equations: the porous medium equation

    Felix Otto. The geometry of dissipative evolution equations: the porous medium equation. 2001

  61. [61]

    E. A. J. F. Peters and G. de With. Rejection-free monte carlo sampling for general potentials.Phys. Rev. E 85, 026703, 2012

  62. [62]

    Entropic approximation of wasserstein gradient flows.SIAM Journal on Imaging Sciences, 8(4):2323–2351, 2015

    Gabriel Peyr´ e. Entropic approximation of wasserstein gradient flows.SIAM Journal on Imaging Sciences, 8(4):2323–2351, 2015

  63. [63]

    Some methods of speeding up the convergence of iteration methods.Ussr computational mathematics and mathematical physics, 4(5):1–17, 1964

    Boris T Polyak. Some methods of speeding up the convergence of iteration methods.Ussr computational mathematics and mathematical physics, 4(5):1–17, 1964

  64. [64]

    Etienne Sandier and Sylvia Serfaty. Gamma-convergence of gradient flows with applica- tions to ginzburg-landau.Communications on Pure and Applied Mathematics: A Journal Issued by the Courant Institute of Mathematical Sciences, 57(12):1627–1672, 2004

  65. [65]

    Understanding the accelera- tion phenomenon via high-resolution differential equations.Mathematical Programming, 195(1):79–148, 2022

    Bin Shi, Simon S Du, Michael I Jordan, and Weijie J Su. Understanding the accelera- tion phenomenon via high-resolution differential equations.Mathematical Programming, 195(1):79–148, 2022

  66. [66]

    A differential equation for model- ing nesterov’s accelerated gradient method: Theory and insights.Journal of Machine Learning Research, 17(153):1–43, 2016

    Weijie Su, Stephen Boyd, and Emmanuel J Candes. A differential equation for model- ing nesterov’s accelerated gradient method: Theory and insights.Journal of Machine Learning Research, 17(153):1–43, 2016. 22

  67. [67]

    Quantitative propagation of chaos of mckean-vlasov equations via the master equation

    Alvin Tsz Ho Tse. Quantitative propagation of chaos of mckean-vlasov equations via the master equation. 2019

  68. [68]

    Hypocoercivity.Mem

    C´ edric Villani. Hypocoercivity.Mem. Amer. Math. Soc., 202(950):iv+141, 2009

  69. [69]

    Uniform log-Sobolev inequalities for mean field particles with flat-convex energy.arXiv e-prints, page arXiv:2408.03283, August 2024

    Songbo Wang. Uniform log-Sobolev inequalities for mean field particles with flat-convex energy.arXiv e-prints, page arXiv:2408.03283, August 2024

  70. [70]

    Large-scale concentration and relaxation for mean-field langevin particle systems.arXiv preprint arXiv:2508.16428, 2025

    Songbo Wang. Large-scale concentration and relaxation for mean-field langevin particle systems.arXiv preprint arXiv:2508.16428, 2025

  71. [71]

    Accelerated information gradient flow.Journal of Scientific Computing, 90(1):11, 2022

    Yifei Wang and Wuchen Li. Accelerated information gradient flow.Journal of Scientific Computing, 90(1):11, 2022

  72. [72]

    A lyapunov analysis of accelerated methods in optimization.Journal of Machine Learning Research, 22(113):1–34, 2021

    Ashia C Wilson, Ben Recht, and Michael I Jordan. A lyapunov analysis of accelerated methods in optimization.Journal of Machine Learning Research, 22(113):1–34, 2021. 23