Stability of the Monge Map in Semi-Dual Optimal Transport
Pith reviewed 2026-05-20 23:46 UTC · model grok-4.3
The pith
The semi-dual optimal transport problem equates to constrained optimization due to its degenerate saddle-point structure.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The semi-dual formulation of the optimal transport problem has a degenerate saddle-point structure, and its numerical solution is equivalent to solving a constrained optimization problem. Necessary and sufficient conditions for the convergence of Monge maps can be derived without requiring optimality of the dual potential.
What carries the argument
The degenerate saddle-point structure of the semi-dual optimal transport formulation, which permits equivalence to a constrained optimization problem and supports derivation of Monge-map convergence conditions independent of dual optimality.
If this is right
- Numerical algorithms can update the Monge map using convergence conditions that ignore whether the dual potential has reached optimality.
- The observed need for extra iterations on the transport map in practice follows directly from the degenerate saddle-point structure.
- Stability analysis of the Monge map becomes possible in settings where dual optimality is hard to certify or enforce.
Where Pith is reading between the lines
- The same structural argument may apply to other transport problems that exhibit degenerate saddle points, such as certain unbalanced or entropic variants.
- Implementations could deliberately relax dual-potential updates once the derived conditions on the map are met, potentially reducing total computation time.
- Testing the conditions on low-dimensional examples with known exact maps would provide a direct numerical check of the equivalence claim.
Load-bearing premise
The semi-dual optimal transport problem admits a degenerate saddle-point structure that makes its numerical solution equivalent to a constrained optimization problem.
What would settle it
A concrete counter-example in which the semi-dual problem fails to reduce to constrained optimization, or in which the Monge map converges even though the stated necessary and sufficient conditions are violated.
Figures
read the original abstract
This paper shows that the semi-dual formulation of the optimal transport problem has a degenerate saddle-point structure, and that its numerical solution is equivalent to solving a constrained optimization problem. We derive necessary and sufficient conditions for the convergence of Monge maps without requiring optimality of the dual potential. This analysis helps explain why, in practice, numerical algorithms often require more iterations to update the transport map than the potential.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript shows that the semi-dual formulation of optimal transport possesses a degenerate saddle-point structure whose numerical solution is equivalent to a constrained optimization problem. It derives necessary and sufficient conditions for convergence of the Monge map that do not require optimality of the dual potential, and uses this to explain why practical algorithms typically need more iterations to stabilize the map than the potential.
Significance. If the central derivations are correct, the work supplies a useful theoretical account of observed numerical behavior in semi-dual OT solvers and separates map stability from full dual optimality. The reformulation as a constrained problem and the convergence criteria could inform algorithm design and stability analysis in applications of optimal transport.
major comments (1)
- §3.1, the equivalence argument: the reduction from the degenerate saddle-point to the constrained problem is stated at the level of the continuous formulation; it is not shown whether the same equivalence holds exactly for the discrete or semi-discrete discretizations that are used in the numerical section, which would be needed to justify the iteration-count explanation.
minor comments (2)
- Notation for the semi-dual functional and the Monge map is introduced without a consolidated table of symbols; adding one would improve readability.
- The numerical experiments section would benefit from a brief statement of the discretization scheme and the precise stopping criterion used to count iterations for the map versus the potential.
Simulated Author's Rebuttal
We thank the referee for the positive assessment, the recommendation of minor revision, and the constructive comment on the equivalence argument. We address the point below and will incorporate a clarification in the revised manuscript.
read point-by-point responses
-
Referee: §3.1, the equivalence argument: the reduction from the degenerate saddle-point to the constrained problem is stated at the level of the continuous formulation; it is not shown whether the same equivalence holds exactly for the discrete or semi-discrete discretizations that are used in the numerical section, which would be needed to justify the iteration-count explanation.
Authors: We agree that the link to the numerical experiments would be stronger if the equivalence were stated explicitly for the discrete setting. The reduction itself is purely algebraic and follows identically when the measures are atomic: the degenerate saddle-point structure reduces to a constrained finite-dimensional optimization problem by replacing integrals with finite sums over the support points. This is the mechanism that produces the observed lag between potential and map stabilization in the experiments of Section 4. In the revised manuscript we will insert a short remark (or a brief appendix paragraph) confirming that the same equivalence holds verbatim for the discrete and semi-discrete formulations used in the numerics, thereby directly justifying the iteration-count discussion. revision: yes
Circularity Check
No significant circularity; derivation self-contained
full rationale
The paper's central claims rest on observing the degenerate saddle-point structure of the semi-dual OT formulation and deriving necessary and sufficient convergence conditions for Monge maps that do not presuppose dual optimality. These steps are presented as direct consequences of the problem's convexity and marginal constraints rather than reductions to fitted inputs, self-citations, or definitional equivalences. No load-bearing premise collapses to a prior result by the same authors or to a renamed empirical pattern; the explanation for iteration counts follows from the structure without circular forcing. The analysis is therefore independent of its inputs.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption The semi-dual formulation of optimal transport admits a degenerate saddle-point structure.
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
the semi-dual optimal transport problem possesses a degenerate saddle-point structure: at the optimal transport map, the objective functional becomes independent of the potential
-
IndisputableMonolith/Foundation/BranchSelection.leanbranch_selection unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
Theorem 3 … F(ψ⋆,T⋆)=F(ψ,T⋆) … flatness of the functional in the dual variable
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
- [1]
- [2]
-
[3]
Kantorovich, L. V. and Rubinshtein G. Sh. , title =. Dokl. Akad. Nauk SSSR , year =
-
[4]
Brenier, Yann , title =. Comm. Pure Appl. Math. , year =
- [5]
-
[6]
Duke Mathematical Journal , year =
McCann, Robert , title =. Duke Mathematical Journal , year =
-
[7]
Pavliotis, G. A. , title =
-
[8]
Lipman, Yaron and Chen, Ricky T. Q. and Ben-Hamu, Heli and Nickel, Maximilian and Le Matt , title =. https://arxiv.org/abs/2210.02747 , year =
work page internal anchor Pith review Pith/arXiv arXiv
- [9]
- [10]
- [11]
-
[12]
Vershik, A. M. , title =. Russian Mathematical Surveys , volume =
-
[13]
Kantorovich, L. V. , title =. Uspekhi Matematicheskikh Nauk , volume =
-
[14]
Bogachev, V. I. and R\". Kolmogorov Problems on Equations for Stationary and Transition Probabilities of Diffusion Processes , journal =. 2023 , doi =
work page 2023
-
[15]
Annales de l'Institut Henri Poincaré (C) Analyse Non Linéaire , volume =
Pratelli, Aldo , title =. Annales de l'Institut Henri Poincaré (C) Analyse Non Linéaire , volume =
-
[16]
Bogachev, V. I. and Kalinin, A. N. and Popova, S. N. , title =. Journal of Mathematical Sciences , volume =. 2019 , note =
work page 2019
-
[17]
Ambrosio, L. and Pratelli, A. , title =. Optimal Transportation and Applications , series =. 2003 , pages =
work page 2003
- [18]
-
[19]
Dobrushin, R. L. , title =. Theory of Probability and Its Applications , volume =
-
[20]
Fr\'echet, M , title =. C. R. Acad. Sci. Paris , volume =
-
[21]
Wasserstein Generative Adversarial Networks , booktitle =
Mart. Wasserstein Generative Adversarial Networks , booktitle =. 2017 , publisher =
work page 2017
-
[22]
Improved Training of Wasserstein GANs , booktitle =
Ishaan Gulrajani and Faruk Ahmed and Mart. Improved Training of Wasserstein GANs , booktitle =. 2017 , url =
work page 2017
-
[23]
Brian D. O. Anderson , title =. Stochastic Processes and their Applications , volume =. 1982 , doi =
work page 1982
-
[24]
Rockafellar, R. Tyrrell and Wets, Roger J.-B. , title =. Pacific Journal of Mathematics , volume =
-
[25]
arXiv preprint arXiv:2110.02999 , year =
Generative Modeling with Optimal Transport Maps , author =. arXiv preprint arXiv:2110.02999 , year =
-
[26]
Calculus of Variations and Partial Differential Equations , volume =
Existence, Duality, and Cyclical Monotonicity for Weak Transport Costs , author =. Calculus of Variations and Partial Differential Equations , volume =. 2019 , doi =
work page 2019
-
[27]
Rockafellar, R. Tyrrell , title =. Nonlinear Operators and the Calculus of Variations , editor =. 1976 , series =
work page 1976
- [28]
-
[29]
Advances in Neural Information Processing Systems , volume=
Sinkhorn Distances: Lightspeed Computation of Optimal Transport , author=. Advances in Neural Information Processing Systems , volume=
-
[30]
The American Mathematical Monthly , volume =
Sinkhorn, Richard , title =. The American Mathematical Monthly , volume =
-
[31]
Training-Free Voice Conversion with Factorized Optimal Transport , author=. Interspeech , year=
- [32]
-
[33]
X-vectors: Robust DNN embeddings for speaker recognition , author=. ICASSP , year=
-
[34]
Advances in Neural Information Processing Systems , volume=
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis , author=. Advances in Neural Information Processing Systems , volume=
-
[35]
International Conference on Learning Representations (ICLR) , year=
DiffWave: A Versatile Diffusion Model for Audio Synthesis , author=. International Conference on Learning Representations (ICLR) , year=
-
[36]
WaveFM: A High-Fidelity and Efficient Vocoder Based on Flow Matching , author=. Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics (NAACL) , pages=
- [37]
-
[38]
Score-Based Generative Modeling through Stochastic Differential Equations , author=. ICLR , year=
-
[39]
Schr. Sitzungsberichte der Preussischen Akademie der Wissenschaften, Physikalisch-mathematische Klasse , year =
-
[40]
Chetrite, Rapha. E. Schr. The European Physical Journal H , volume =
-
[41]
Discrete and Continuous Dynamical Systems , year=
A survey of the Schrödinger problem and some of its connections with optimal transport , author=. Discrete and Continuous Dynamical Systems , year=
-
[42]
arXiv preprint arXiv:2106.01357 , year =
Peluchetti, Stefano , title =. arXiv preprint arXiv:2106.01357 , year =
-
[43]
Hitchcock, F. L. , title =. Journal of Mathematics and Physics , volume =. 1941 , doi =
work page 1941
- [44]
- [45]
-
[46]
Rubinshtein, G. S. , title =. Vestnik Leningrad University. Mathematics, Mechanics, Astronomy , volume =. 1958 , note =
work page 1958
- [47]
-
[48]
A Statistical Learning Perspective on Semi-dual Adversarial Neural Optimal Transport Solvers , author=. Proc. ICLR , year=
-
[49]
Villani, C\'. Optimal Transport. Old and New , publisher =. 2009 , OPTkey =
work page 2009
-
[50]
Neural Optimal Transport , author=. Proc. ICLR , year=
-
[51]
Proceedings of the 37th International Conference on Machine Learning , pages =
Optimal Transport Mapping via Input Convex Neural Networks , author =. Proceedings of the 37th International Conference on Machine Learning , pages =. 2020 , editor =
work page 2020
-
[52]
Do Neural Optimal Transport Solvers Work? A Continuous Wasserstein-2 Benchmark , author=. NeurIPS , year=
-
[53]
Optimal Transport Mapping via Input Convex Neural Networks , author=. ICML , year=
-
[54]
arXiv preprint arXiv:1909.13082 , year=
Wasserstein-2 Generative Networks , author=. arXiv preprint arXiv:1909.13082 , year=
-
[55]
Semi-dual Regularized Optimal Transport , author=. SIAM Review , year=
-
[56]
A Statistical Learning Perspective on Semi-dual Adversarial Neural Optimal Transport Solvers , author=. arXiv , year=
-
[57]
Parameter Tuning and Model Selection in Optimal Transport with Semi-dual Brenier Formulation , author=. arXiv:2112.07275 , year=
-
[58]
Riemannian Neural Optimal Transport , author=. arXiv:2602.03566 , year=
- [59]
- [60]
-
[61]
arXiv preprint arXiv:2106.03812 , year =
Neural Monge Map Estimation and Its Applications , author =. arXiv preprint arXiv:2106.03812 , year =
-
[62]
Generative Modeling through the Semi-dual Formulation of Unbalanced Optimal Transport , author =. ICLR , year =
-
[63]
2-wasserstein approximation via restricted convex potentials with application to improved training for gans , author =. arXiv preprint:1902.07197 , year =
work page internal anchor Pith review Pith/arXiv arXiv 1902
- [64]
-
[65]
Overcoming Spurious Solutions in Semi-Dual Neural Optimal Transport: A Smoothing Approach for Learning the Optimal Transport Plan , author =. NerISP , year =
-
[66]
Proceedings of the Edinburgh Mathematical Society , author=
On. Proceedings of the Edinburgh Mathematical Society , author=. 2011 , pages=. doi:10.1017/S001309150800117X , number=
-
[67]
The. Bull. Amer. Math. Soc. , author=. 2014 , pages=. doi:https://doi.org/10.1090/S0273-0979-2014-01459-4 , number=
-
[68]
(q,p)-Wasserstein GANs: Comparing Ground Metrics for Wasserstein GANs
(q,p)-Wasserstein GANs: Comparing Ground Metrics for Wasserstein GANs , author=. arXiv preprint arXiv:1902.03642 , year=
work page internal anchor Pith review Pith/arXiv arXiv 1902
- [69]
- [70]
-
[71]
Annals of Mathematics , volume =
von Neumann, John , title =. Annals of Mathematics , volume =
-
[72]
GANs Trained by a Two Time-Scale Update Rule Converge to a Local Nash Equilibrium
Martin Heusel and Hubert Ramsauer and Thomas Unterthiner and Bernhard Nessler and G. GANs Trained by a Two Time-Scale Update Rule Converge to a Nash Equilibrium , journal =. 2017 , url =. 1706.08500 , timestamp =
work page internal anchor Pith review Pith/arXiv arXiv 2017
-
[73]
Minimax estimation of smooth optimal transport maps , journal =
H\". Minimax estimation of smooth optimal transport maps , journal =
-
[74]
Bogachev, Vladimir I. and Kolesnikov, Aleksandr V. , title =. Russian Math. Surveys , year =
- [75]
- [76]
-
[77]
arXiv preprint arXiv:2201.12220 , year=
Neural optimal transport , author=. arXiv preprint arXiv:2201.12220 , year=
-
[78]
Transactions on Machine Learning Research , pages=
Neural monge map estimation and its applications , author=. Transactions on Machine Learning Research , pages=
-
[79]
arXiv preprint arXiv:2502.01310 , year=
A Statistical Learning Perspective on Semi-dual Adversarial Neural Optimal Transport Solvers , author=. arXiv preprint arXiv:2502.01310 , year=
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.