Wasserstein Mirror Gradient Flow as the limit of the Sinkhorn Algorithm

Geoffrey Schiebinger; Nabarun Deb; Soumik Pal; Young-Heon Kim

arxiv: 2307.16421 · v2 · submitted 2023-07-31 · 🧮 math.PR · math.AP· stat.ML

Wasserstein Mirror Gradient Flow as the limit of the Sinkhorn Algorithm

Nabarun Deb , Young-Heon Kim , Soumik Pal , Geoffrey Schiebinger This is my paper

Pith reviewed 2026-05-24 07:54 UTC · model grok-4.3

classification 🧮 math.PR math.APstat.ML

keywords Sinkhorn algorithmWasserstein spacemirror gradient flowoptimal transportMonge-Ampère equationMcKean-Vlasov diffusioniterative proportional fittingrelative entropy

0 comments

The pith

The Sinkhorn algorithm converges in a scaled limit to a Wasserstein mirror gradient flow on the space of probability measures.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper proves that iterating the Sinkhorn algorithm on joint densities, with regularization parameter epsilon and number of iterations scaled as 1/epsilon, produces marginal distributions that converge to an absolutely continuous curve in the 2-Wasserstein space. This limiting curve is identified as a mirror gradient flow in Wasserstein space, with the mirror map given by half the squared Wasserstein distance to one marginal and the gradient taken from the relative entropy with respect to the other. A reader would care because this bridges a widely used computational method in optimal transport to a continuous-time dynamical system, opening the door to PDE-based analysis of its behavior and long-term properties. The work also links the flow to the parabolic Monge-Ampère equation and constructs an associated McKean-Vlasov process.

Core claim

The sequence of marginals from the Sinkhorn iterations converges to an absolutely continuous curve on the 2-Wasserstein space as ε → 0 with iterations scaled as 1/ε. This Sinkhorn flow is a Wasserstein mirror gradient flow, where the gradient is that of the relative entropy functional with respect to one marginal and the mirror is half of the squared Wasserstein distance functional from the other marginal. The norm of its velocity field can be interpreted as the metric derivative with respect to the linearized optimal transport distance. An equivalent description is the parabolic Monge-Ampère PDE. Conditions for exponential convergence are derived, and a McKean-Vlasov diffusion isconstructed

What carries the argument

The Sinkhorn flow, the limit of scaled Sinkhorn iterations, acting as a Wasserstein mirror gradient flow with relative entropy gradient and Wasserstein distance mirror.

If this is right

The limiting flow satisfies the parabolic Monge-Ampère PDE.
Exponential convergence of the flow to equilibrium holds under the derived conditions.
Marginals of a constructed McKean-Vlasov diffusion coincide with the Sinkhorn flow.
The speed of the flow equals the metric derivative with respect to the linearized optimal transport distance.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

Sinkhorn iterations could serve as a practical numerical scheme to approximate solutions of the parabolic Monge-Ampère equation.
The mirror flow structure may extend to other regularized optimal transport algorithms beyond the classical Sinkhorn procedure.
The McKean-Vlasov construction opens a route to particle-based simulations of the flow for sampling or mean-field models.

Load-bearing premise

The number of Sinkhorn iterations must scale precisely as one over the regularization parameter epsilon, together with suitable technical conditions on the joint densities.

What would settle it

Numerical computation showing that marginals from Sinkhorn iterations with N exactly 1/epsilon fail to approach the solution of the parabolic Monge-Ampère PDE in the 2-Wasserstein metric as epsilon goes to zero, for some initial joint densities meeting the paper's assumptions.

Figures

Figures reproduced from arXiv: 2307.16421 by Geoffrey Schiebinger, Nabarun Deb, Soumik Pal, Young-Heon Kim.

**Figure 1.** Figure 1: Convergence of the Sinkhorn iterates (ρ ε k , k ∈ N) to an absolutely continuous curve (the Sinkhorn flow) in the 2- Wasserstein space, with k = O(1/ε), as ε → 0. Here, for a given ε > 0, we view the points on the corresponding curves as a discretization of an absolutely continuous curve with O(1/ε) points. denotes the standard Kullback-Leibler (KL) divergence and ε > 0 is called the regularization parame… view at source ↗

**Figure 2.** Figure 2: Evolution of the Sinkhorn flow (ρt, t ≥ 0). On the left, ∇ut is the Brenier map transporting ρt to e −g . (ut, t ≥ 0) satisfies the parabolic Monge-Amp`ere PDE: ∂ ∂t∇ut = ∇WKL(ρt ∥ e −f ). See Theorem 1.3. On the right, wt = u ∗ t is the corresponding family of convex conjugates. ∇wt = (∇ut) −1 transports e −g to ρt. For s < t, t − s ≈ 0, the difference ∇wt − ∇ws is approximately vs ◦ ∇ws, where vs is the … view at source ↗

read the original abstract

We prove that the sequence of marginals obtained from the iterations of the Sinkhorn algorithm or the iterative proportional fitting procedure (IPFP) on joint densities, converges to an absolutely continuous curve on the $2$-Wasserstein space, as the regularization parameter $\varepsilon$ goes to zero and the number of iterations is scaled as $1/\varepsilon$ (and other technical assumptions). This limit, which we call the Sinkhorn flow, is an example of a Wasserstein mirror gradient flow, a concept we introduce here inspired by the well-known Euclidean mirror gradient flows. In the case of Sinkhorn, the gradient is that of the relative entropy functional with respect to one of the marginals and the mirror is half of the squared Wasserstein distance functional from the other marginal. Interestingly, the norm of the velocity field of this flow can be interpreted as the metric derivative with respect to the linearized optimal transport (LOT) distance. An equivalent description of this flow is provided by the parabolic Monge-Amp\`{e}re PDE whose connection to the Sinkhorn algorithm was noticed by Berman (2020). We derive conditions for exponential convergence for this limiting flow. We also construct a Mckean-Vlasov diffusion whose marginal distributions follow the Sinkhorn flow.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

Sinkhorn iterations converge to a Wasserstein mirror gradient flow under 1/ε scaling, with a new mirror interpretation and PDE link.

read the letter

The core result here is a convergence theorem: under the scaling where iterations grow like 1/ε as ε→0, plus technical assumptions on the joint densities, the marginals from Sinkhorn/IPFP converge in 2-Wasserstein space to an absolutely continuous curve they name the Sinkhorn flow. They then identify this limit as a mirror gradient flow, with the relative entropy as the gradient functional and half the squared Wasserstein distance as the mirror, plus an interpretation of the velocity via the linearized OT metric. An equivalent parabolic Monge-Ampère PDE description is given, building on Berman's earlier observation, and they add exponential convergence conditions plus a McKean-Vlasov diffusion whose marginals follow the flow. This is the genuinely new piece relative to prior work on the PDE connection. The argument structure looks internally consistent from the abstract, with no obvious circularity or hidden fitting. The assumptions are stated up front, which is honest. The main soft spot is that the convergence relies on a precise scaling and density conditions that could turn out restrictive in practice; without seeing the full derivation it is hard to judge how delicate the limit passage is. The paper is aimed at researchers in optimal transport and probability who care about continuous limits of discrete algorithms. It is the kind of targeted theoretical contribution that belongs in a specialized journal and deserves a serious referee to check the technical steps.

Referee Report

0 major / 3 minor

Summary. The manuscript proves a convergence result for the Sinkhorn algorithm (or IPFP) applied to joint densities: as the regularization parameter ε tends to zero and the number of iterations is scaled proportionally to 1/ε (under additional technical assumptions on the densities), the sequence of marginals converges to an absolutely continuous curve in the 2-Wasserstein space. This limiting curve, termed the Sinkhorn flow, is identified as a Wasserstein mirror gradient flow, where the gradient is that of the relative entropy with respect to one marginal and the mirror is half the squared Wasserstein distance to the other marginal. The flow is also shown to satisfy a parabolic Monge-Ampère PDE, conditions for exponential convergence are derived, and a McKean-Vlasov diffusion is constructed whose marginals follow the flow.

Significance. If the main convergence theorem holds, the paper makes a significant contribution by bridging discrete iterative algorithms in optimal transport with continuous gradient flow dynamics in Wasserstein space through the new notion of Wasserstein mirror gradient flows. The explicit connection to the parabolic Monge-Ampère equation and the construction of the associated diffusion process provide concrete analytical tools. The result is self-contained with stated assumptions and offers potential for further developments in mean-field limits and numerical analysis of OT methods.

minor comments (3)

[Abstract] Abstract: the scaling 'number of iterations is scaled as 1/ε' should be stated more precisely (e.g., whether N_ε ∼ C/ε for a fixed C or an exact floor function) since this scaling is load-bearing for the convergence statement.
[References] The citation to Berman (2020) appears in the abstract and introduction but lacks a corresponding entry in the bibliography; this should be added for completeness.
[§2] §2: the notation distinguishing the regularized joint densities from their marginals is introduced inline but would benefit from a short dedicated notation subsection to improve readability for readers unfamiliar with IPFP variants.

Simulated Author's Rebuttal

0 responses · 0 unresolved

We thank the referee for the positive assessment of our manuscript, the accurate summary of the main results, and the recommendation for minor revision. The report correctly identifies the core contribution as the convergence of Sinkhorn iterations (with appropriate scaling) to a Wasserstein mirror gradient flow, along with the connections to the parabolic Monge-Ampère equation and the associated McKean-Vlasov diffusion. We are pleased that the significance of introducing this new notion of mirror gradient flows in Wasserstein space is recognized.

Circularity Check

0 steps flagged

No significant circularity; self-contained limit theorem

full rationale

The paper establishes a convergence result: under the scaling of iterations ~1/ε and technical assumptions on joint densities, the Sinkhorn/IPFP marginals converge in 2-Wasserstein space to an absolutely continuous curve (the Sinkhorn flow). This limit is then identified as a Wasserstein mirror gradient flow (with relative entropy gradient and W2 mirror) whose velocity norm matches the LOT metric derivative, and which satisfies a parabolic Monge-Ampère PDE previously linked to Sinkhorn by Berman (2020). The central derivation is a limit theorem with assumptions stated upfront; the new concept is introduced by direct analogy to Euclidean mirror flows rather than by self-referential definition or fitted parameters. No load-bearing self-citations, ansatz smuggling, or reductions by construction appear in the argument structure.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The central claim rests on unspecified technical assumptions for the densities and scaling; no free parameters, invented entities, or explicit axioms are visible in the abstract.

axioms (1)

domain assumption technical assumptions on joint densities and iteration scaling as ε → 0
Abstract states these are required for the sequence to converge to an absolutely continuous curve.

pith-pipeline@v0.9.0 · 5765 in / 1185 out tokens · 37308 ms · 2026-05-24T07:54:22.320492+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Cost/FunctionalEquation.lean; Foundation/BranchSelection.lean; Foundation/AbsoluteFloorClosure.lean washburn_uniqueness_aczel; reality_from_one_distinction; branch_selection unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

the sequence of marginals ... converges to an absolutely continuous curve on the 2-Wasserstein space ... Sinkhorn flow ... Wasserstein mirror gradient flow ... parabolic Monge-Ampère PDE

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

The entropic optimal (self-)transport problem: Limit distributions for decreasing regularization with application to score function estimation
math.ST 2024-12 unverdicted novelty 7.0

Provides asymptotic distributions for entropic OT plans and potentials under vanishing regularization and links self-transport barycentric projections to score functions.

Reference graph

Works this paper leans on

20 extracted references · 20 canonical work pages · cited by 1 Pith paper · 1 internal anchor

[1]

[ACB17] Martin Arjovsky, Soumith Chintala, and L´ eon Bottou, Wasserstein generative adversarial networks, International conference on machine learning, PMLR, 2017, pp

[AC21] Kwangjun Ahn and Sinho Chewi, Efficient constrained sampling via the mirror-Langevin algorithm, Advances in Neural Information Pro- cessing Systems 34 (2021), 28405–28418. [ACB17] Martin Arjovsky, Soumith Chintala, and L´ eon Bottou, Wasserstein generative adversarial networks, International conference on machine learning, PMLR, 2017, pp. 214–223. ...

work page 2021
[2]

1-2, 43–100

[AMTU01] Anton Arnold, Peter Markowich, Giuseppe Toscani, and Andreas Un- terreiter, On convex Sobolev inequalities and the rate of convergence to equilibrium for Fokker-Planck type equations , Communications in Partial Differential Equations 26 (2001), no. 1-2, 43–100. [AS07] Luigi Ambrosio and Giuseppe Savar´ e, Gradient flows of probability measures, H...

work page 2001
[3]

11, 116019

[CCCC20] Tianji Cai, Junyi Cheng, Nathaniel Craig, and Katy Craig, Linearized optimal transport for collider events , Physical Review D 102 (2020), no. 11, 116019. [CCGT22] Alberto Chiarini, Giovanni Conforti, Giacomo Greco, and Luca Tamanini, Gradient estimates for the Schr¨ odinger potentials: conver- gence to the Brenier map and quantitative stability ...

work page arXiv 2020
[4]

, Kinetic and Related Models 15 (2022), no

MR 4316725 [CD22] Louis-Pierre Chaintron and Antoine Diez, Propagation of chaos: A review of models, methods and applications. , Kinetic and Related Models 15 (2022), no. 6, 1017–1173. [CGP16] Yongxin Chen, Tryphon Georgiou, and Michele Pavon, Entropic and displacement interpolation: a computational approach using the Hilbert metric , SIAM J. Appl. Math. ...

work page arXiv 2022
[5]

MR 4232667 [Cut13] Marco Cuturi, Sinkhorn distances: Lightspeed computation of opti- mal transport, Advances in neural information processing systems 26 46 NABARUN DEB, YOUNG-HEON KIM, SOUMIK PAL, AND GEOFFREY SCHIEBINGER (2013). [CZHS22] L´ ena¨ ıc Chizat, Stephen Zhang, Matthieu Heitz, and Geoffrey Schiebinger, Trajectory inference via mean-field Langev...

work page arXiv 2013
[6]

[EN22b] , Quantitative stability of regularized optimal transport and convergence of Sinkhorn’s algorithm, SIAM J

[EN22a] Stephan Eckstein and Marcel Nutz, Convergence rates for regularized optimal transport via quantization , arXiv preprint arXiv:2208.14391 (2022). [EN22b] , Quantitative stability of regularized optimal transport and convergence of Sinkhorn’s algorithm, SIAM J. Math. Anal. 54 (2022), no. 6, 5922–5948. MR 4506579 [FL89] Joel Franklin and Jens Lorenz,...

work page arXiv 2022
[7]

MR 4460341 [GT18] Nicola Gigli and Luca Tamanini, Second order differentiation formula on RCD(K, N) spaces, Atti Accad. Naz. Lincei Rend. Lincei Mat. Appl. 29 (2018), no. 2, 377–386. MR 3797990 [HJ94] Roger A Horn and Charles R Johnson, Topics in matrix analysis , Cambridge university press,

work page 2018
[8]

[HS87] Richard Holley and Daniel Stroock, Logarithmic Sobolev inequalities and stochastic Ising models, J. Statist. Phys.46 (1987), no. 5-6, 1159–

work page 1987
[9]

[Jab14] Pierre-Emmanuel Jabin, A review of the mean field limits for Vlasov equations, Kinetic and Related Models 7 (2014), no

MR 893137 [HY97] Shima Hirohiko and Katsumi Yagi, Geometry of Hessian maifolds , Differential Geometry and its Applications 7 (1997), 277–290. [Jab14] Pierre-Emmanuel Jabin, A review of the mean field limits for Vlasov equations, Kinetic and Related Models 7 (2014), no. 4, 661–711. [JKO98] Richard Jordan, David Kinderlehrer, and Felix Otto, The variationa...

work page 1997
[10]

on the theory of Brownian motion

[KSW12] Young-Heon Kim, Jeffrey Streets, and Micah Warren, Parabolic op- timal transport equations on manifolds , International Mathematics Research Notices 2012 (2012), no. 19, 4325–4350. [L´ eg21] Flavien L´ eger,A gradient descent perspective on Sinkhorn , Applied Mathematics & Optimization 84 (2021), no. 2, 1843–1855. [L´ eo12] Christian L´ eonard,Fro...

work page 2012
[11]

[LM00] Beatrice Laurent and Pascal Massart, Adaptive estimation of a qua- dratic functional by model selection, Annals of Statistics (2000), 1302–

work page 2000
[12]

1, 363–389

[MC23] Caroline Moosm¨ uller and Alexander Cloninger,Linear optimal trans- port embedding: provable Wasserstein classification for certain rigid transformations and perturbations , Information and Inference: A Journal of the IMA 12 (2023), no. 1, 363–389. [McC95] Robert J. McCann, Existence and uniqueness of monotone measure- preserving maps, Duke Mathema...

work page 2023
[13]

[McK75] H. P. McKean, Fluctuations in the kinetic theory of gases , Communi- cations on Pure and Applied Mathematics 28 (1975), no. 4, 435–455. [Mik04] Toshio Mikami, Monge’s problem with a quadratic cost by the zero- noise limit of h-path processes, Probab. Theory Related Fields 129 (2004), no. 2, 245–260. MR 2063377 [MJ66] Henry P McKean Jr, A class of ...

work page 1975
[14]

[OV00] F

[Ott01] Felix Otto, The geometry of dissipative evolution equations: The porous medium equation , Communications in Partial Differential Equations (2001). [OV00] F. Otto and C. Villani, Generalization of an inequality by Talagrand and links with the logarithmic Sobolev inequality , J. Funct. Anal. 173 (2000), no. 2, 361–400. MR 1760620 [Pal19] Soumik Pal,...

work page arXiv 2001
[15]

[RW23] Cale Rankin and Leonard Wong, Bregman-Wasserstein divergence: geometry and applications , Math arxiv preprint arXiv:2302.05833,

[R¨ us95] Ludger R¨ uschendorf,Convergence of the iterative proportional fitting procedure, The Annals of Statistics (1995), 1160–1174. [RW23] Cale Rankin and Leonard Wong, Bregman-Wasserstein divergence: geometry and applications , Math arxiv preprint arXiv:2302.05833,

work page arXiv 1995
[16]

[San15] Filippo Santambrogio, Optimal transport for applied mathematicians, Birk¨ auser, NY55 (2015), no. 58-63,

work page 2015
[17]

Score-Based Generative Modeling through Stochastic Differential Equations

[SDWMG15] Jascha Sohl-Dickstein, Eric Weiss, Niru Maheswaranathan, and Surya Ganguli, Deep unsupervised learning using nonequilibrium thermody- namics, International Conference on Machine Learning, PMLR, 2015, pp. 2256–2265. [Sin67] Richard Sinkhorn, Diagonal equivalence to matrices with prescribed row and column sums , The American Mathematical Monthly 7...

work page internal anchor Pith review Pith/arXiv arXiv 2015
[18]

Villani, Optimal transportation, dissipative PDE’s and func- tional inequalities , Unpublished lecture notes

MR 1964483 [Vil12] C. Villani, Optimal transportation, dissipative PDE’s and func- tional inequalities , Unpublished lecture notes. Accessed from https://cedricvillani.org/sites/dev/files/old_images/ 2012/08/B04.MFranca.pdf,

work page 2012
[19]

Operator Theory 46 (2001), no

[Wan01] Feng-Yu Wang, Logarithmic Sobolev inequalities: conditions and counterexamples, J. Operator Theory 46 (2001), no. 1, 183–197. MR 1862186 [Wil18] Ashia Wilson, Lyapunov arguments in optimization , Phd thesis, UC Berkeley, 2018, Available at https://escholarship.org/uc/item/ 1116c975. [WL20] Yifei Wang and Wuchen Li, Information Newton’s flow: secon...

work page arXiv 2001
[20]

Ozolek, and Gus- tavo K

[WSB+13] Wei Wang, Dejan Slepˇ cev, Saurav Basu, John A. Ozolek, and Gus- tavo K. Rohde, A linear optimal transportation framework for quan- tifying and visualizing variations in sets of images , Int. J. Comput. Vis. 101 (2013), no. 2, 254–269. MR 3021062 [ZPFP20] Kelvin Shuangjian Zhang, Gabriel Peyr´ e, Jalal Fadili, and Marcelo Pereyra, Wasserstein con...

work page 2013

[1] [1]

[ACB17] Martin Arjovsky, Soumith Chintala, and L´ eon Bottou, Wasserstein generative adversarial networks, International conference on machine learning, PMLR, 2017, pp

[AC21] Kwangjun Ahn and Sinho Chewi, Efficient constrained sampling via the mirror-Langevin algorithm, Advances in Neural Information Pro- cessing Systems 34 (2021), 28405–28418. [ACB17] Martin Arjovsky, Soumith Chintala, and L´ eon Bottou, Wasserstein generative adversarial networks, International conference on machine learning, PMLR, 2017, pp. 214–223. ...

work page 2021

[2] [2]

1-2, 43–100

[AMTU01] Anton Arnold, Peter Markowich, Giuseppe Toscani, and Andreas Un- terreiter, On convex Sobolev inequalities and the rate of convergence to equilibrium for Fokker-Planck type equations , Communications in Partial Differential Equations 26 (2001), no. 1-2, 43–100. [AS07] Luigi Ambrosio and Giuseppe Savar´ e, Gradient flows of probability measures, H...

work page 2001

[3] [3]

11, 116019

[CCCC20] Tianji Cai, Junyi Cheng, Nathaniel Craig, and Katy Craig, Linearized optimal transport for collider events , Physical Review D 102 (2020), no. 11, 116019. [CCGT22] Alberto Chiarini, Giovanni Conforti, Giacomo Greco, and Luca Tamanini, Gradient estimates for the Schr¨ odinger potentials: conver- gence to the Brenier map and quantitative stability ...

work page arXiv 2020

[4] [4]

, Kinetic and Related Models 15 (2022), no

MR 4316725 [CD22] Louis-Pierre Chaintron and Antoine Diez, Propagation of chaos: A review of models, methods and applications. , Kinetic and Related Models 15 (2022), no. 6, 1017–1173. [CGP16] Yongxin Chen, Tryphon Georgiou, and Michele Pavon, Entropic and displacement interpolation: a computational approach using the Hilbert metric , SIAM J. Appl. Math. ...

work page arXiv 2022

[5] [5]

MR 4232667 [Cut13] Marco Cuturi, Sinkhorn distances: Lightspeed computation of opti- mal transport, Advances in neural information processing systems 26 46 NABARUN DEB, YOUNG-HEON KIM, SOUMIK PAL, AND GEOFFREY SCHIEBINGER (2013). [CZHS22] L´ ena¨ ıc Chizat, Stephen Zhang, Matthieu Heitz, and Geoffrey Schiebinger, Trajectory inference via mean-field Langev...

work page arXiv 2013

[6] [6]

[EN22b] , Quantitative stability of regularized optimal transport and convergence of Sinkhorn’s algorithm, SIAM J

[EN22a] Stephan Eckstein and Marcel Nutz, Convergence rates for regularized optimal transport via quantization , arXiv preprint arXiv:2208.14391 (2022). [EN22b] , Quantitative stability of regularized optimal transport and convergence of Sinkhorn’s algorithm, SIAM J. Math. Anal. 54 (2022), no. 6, 5922–5948. MR 4506579 [FL89] Joel Franklin and Jens Lorenz,...

work page arXiv 2022

[7] [7]

MR 4460341 [GT18] Nicola Gigli and Luca Tamanini, Second order differentiation formula on RCD(K, N) spaces, Atti Accad. Naz. Lincei Rend. Lincei Mat. Appl. 29 (2018), no. 2, 377–386. MR 3797990 [HJ94] Roger A Horn and Charles R Johnson, Topics in matrix analysis , Cambridge university press,

work page 2018

[8] [8]

[HS87] Richard Holley and Daniel Stroock, Logarithmic Sobolev inequalities and stochastic Ising models, J. Statist. Phys.46 (1987), no. 5-6, 1159–

work page 1987

[9] [9]

[Jab14] Pierre-Emmanuel Jabin, A review of the mean field limits for Vlasov equations, Kinetic and Related Models 7 (2014), no

MR 893137 [HY97] Shima Hirohiko and Katsumi Yagi, Geometry of Hessian maifolds , Differential Geometry and its Applications 7 (1997), 277–290. [Jab14] Pierre-Emmanuel Jabin, A review of the mean field limits for Vlasov equations, Kinetic and Related Models 7 (2014), no. 4, 661–711. [JKO98] Richard Jordan, David Kinderlehrer, and Felix Otto, The variationa...

work page 1997

[10] [10]

on the theory of Brownian motion

[KSW12] Young-Heon Kim, Jeffrey Streets, and Micah Warren, Parabolic op- timal transport equations on manifolds , International Mathematics Research Notices 2012 (2012), no. 19, 4325–4350. [L´ eg21] Flavien L´ eger,A gradient descent perspective on Sinkhorn , Applied Mathematics & Optimization 84 (2021), no. 2, 1843–1855. [L´ eo12] Christian L´ eonard,Fro...

work page 2012

[11] [11]

[LM00] Beatrice Laurent and Pascal Massart, Adaptive estimation of a qua- dratic functional by model selection, Annals of Statistics (2000), 1302–

work page 2000

[12] [12]

1, 363–389

[MC23] Caroline Moosm¨ uller and Alexander Cloninger,Linear optimal trans- port embedding: provable Wasserstein classification for certain rigid transformations and perturbations , Information and Inference: A Journal of the IMA 12 (2023), no. 1, 363–389. [McC95] Robert J. McCann, Existence and uniqueness of monotone measure- preserving maps, Duke Mathema...

work page 2023

[13] [13]

[McK75] H. P. McKean, Fluctuations in the kinetic theory of gases , Communi- cations on Pure and Applied Mathematics 28 (1975), no. 4, 435–455. [Mik04] Toshio Mikami, Monge’s problem with a quadratic cost by the zero- noise limit of h-path processes, Probab. Theory Related Fields 129 (2004), no. 2, 245–260. MR 2063377 [MJ66] Henry P McKean Jr, A class of ...

work page 1975

[14] [14]

[OV00] F

[Ott01] Felix Otto, The geometry of dissipative evolution equations: The porous medium equation , Communications in Partial Differential Equations (2001). [OV00] F. Otto and C. Villani, Generalization of an inequality by Talagrand and links with the logarithmic Sobolev inequality , J. Funct. Anal. 173 (2000), no. 2, 361–400. MR 1760620 [Pal19] Soumik Pal,...

work page arXiv 2001

[15] [15]

[RW23] Cale Rankin and Leonard Wong, Bregman-Wasserstein divergence: geometry and applications , Math arxiv preprint arXiv:2302.05833,

[R¨ us95] Ludger R¨ uschendorf,Convergence of the iterative proportional fitting procedure, The Annals of Statistics (1995), 1160–1174. [RW23] Cale Rankin and Leonard Wong, Bregman-Wasserstein divergence: geometry and applications , Math arxiv preprint arXiv:2302.05833,

work page arXiv 1995

[16] [16]

[San15] Filippo Santambrogio, Optimal transport for applied mathematicians, Birk¨ auser, NY55 (2015), no. 58-63,

work page 2015

[17] [17]

Score-Based Generative Modeling through Stochastic Differential Equations

[SDWMG15] Jascha Sohl-Dickstein, Eric Weiss, Niru Maheswaranathan, and Surya Ganguli, Deep unsupervised learning using nonequilibrium thermody- namics, International Conference on Machine Learning, PMLR, 2015, pp. 2256–2265. [Sin67] Richard Sinkhorn, Diagonal equivalence to matrices with prescribed row and column sums , The American Mathematical Monthly 7...

work page internal anchor Pith review Pith/arXiv arXiv 2015

[18] [18]

Villani, Optimal transportation, dissipative PDE’s and func- tional inequalities , Unpublished lecture notes

MR 1964483 [Vil12] C. Villani, Optimal transportation, dissipative PDE’s and func- tional inequalities , Unpublished lecture notes. Accessed from https://cedricvillani.org/sites/dev/files/old_images/ 2012/08/B04.MFranca.pdf,

work page 2012

[19] [19]

Operator Theory 46 (2001), no

[Wan01] Feng-Yu Wang, Logarithmic Sobolev inequalities: conditions and counterexamples, J. Operator Theory 46 (2001), no. 1, 183–197. MR 1862186 [Wil18] Ashia Wilson, Lyapunov arguments in optimization , Phd thesis, UC Berkeley, 2018, Available at https://escholarship.org/uc/item/ 1116c975. [WL20] Yifei Wang and Wuchen Li, Information Newton’s flow: secon...

work page arXiv 2001

[20] [20]

Ozolek, and Gus- tavo K

[WSB+13] Wei Wang, Dejan Slepˇ cev, Saurav Basu, John A. Ozolek, and Gus- tavo K. Rohde, A linear optimal transportation framework for quan- tifying and visualizing variations in sets of images , Int. J. Comput. Vis. 101 (2013), no. 2, 254–269. MR 3021062 [ZPFP20] Kelvin Shuangjian Zhang, Gabriel Peyr´ e, Jalal Fadili, and Marcelo Pereyra, Wasserstein con...

work page 2013