A Proximal Primal-Dual Approach to Generalized JKO Schemes for Doubly Nonlinear Parabolic Equations
Pith reviewed 2026-05-10 01:20 UTC · model grok-4.3
The pith
A proximal primal-dual method yields explicit formulas for proximal operators in generalized JKO schemes for doubly nonlinear equations.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
By recasting the generalized JKO scheme as a proximal primal-dual optimization problem, explicit formulas for the proximal operators with general costs are obtained, which are then used to approximate the solutions of doubly nonlinear parabolic equations in a computationally tractable manner.
What carries the argument
The proximal primal-dual algorithm applied to the generalized JKO scheme, which derives and applies proximal operators for general costs to discretize the gradient-flow dynamics.
If this is right
- The method produces numerical solutions for the p-Laplace equation as a special case.
- Flux-limited equations including the relativistic heat equation can be handled within the same framework.
- Validation occurs by recovering the qualitative behavior of known steepest descent evolutions.
- The scheme extends variational optimization techniques to a broad family of doubly nonlinear parabolic PDEs.
Where Pith is reading between the lines
- The explicit proximal formulas may simplify implementation for other gradient flows with similar cost structures not explicitly tested here.
- This optimization lens could connect to related numerical methods for measure-valued or nonlinear diffusion problems.
- Parameter studies on the general cost functions might become feasible in applications where direct discretization is costly.
Load-bearing premise
Explicit and efficiently computable formulas for the proximal operators exist for the general family of costs considered and the discrete scheme reproduces the continuous dynamics without uncontrolled artifacts.
What would settle it
A numerical test on a known case such as the heat equation or p-Laplace flow where the computed solutions deviate from the exact or expected qualitative behavior would falsify the approach.
Figures
read the original abstract
Variational methods based on optimization strategies are proposed to numerically solve a large family of nonlinear partial differential equations. They are all particular instances of gradient flows with general costs, including the $p$-Laplace equation and flux-limited equations such as the relativistic heat equation. This is achieved by computing explicit formulas for proximal operators with general costs amenable to efficient numerical approximation. We showcase our numerical approach via validation of the results by recovering the qualitative behavior of particular known cases of this large family of steepest descent evolutions.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript proposes a proximal primal-dual variational method to discretize generalized JKO gradient flows for doubly nonlinear parabolic equations, including the p-Laplace equation and flux-limited equations such as the relativistic heat equation. Explicit formulas for proximal operators associated with general costs are derived to enable efficient numerical approximation, with validation performed by recovering the expected qualitative features (e.g., finite propagation speed) on a small number of textbook cases.
Significance. If the explicit proximal formulas are correct and the resulting scheme is shown to be consistent and convergent, the approach would provide a useful optimization-based framework for treating a broad family of nonlinear PDEs as gradient flows with general costs, extending classical JKO schemes to doubly nonlinear settings.
major comments (2)
- [Numerical validation] Numerical validation section: the scheme is tested only by qualitative recovery of known behaviors (finite propagation speed, etc.) on a few instances; no a-priori error estimates, no convergence tables under mesh/time-step refinement, and no quantitative comparison to exact solutions or high-resolution references are provided. This directly affects the central claim that the proximal discretization faithfully reproduces the continuous dynamics without uncontrolled artifacts.
- [§3] Proximal-operator derivations (abstract and §3): while explicit formulas are asserted to exist and be amenable to efficient approximation for general costs, the manuscript supplies neither the full derivations nor an analysis of their computational cost or stability for arbitrary costs beyond the validated special cases. This is load-bearing for the claim of broad applicability.
minor comments (2)
- [Introduction] Notation for the general cost functional and the associated proximal operator could be introduced with a single consolidated definition early in the paper to improve readability.
- [Introduction] A few references to recent work on proximal methods for flux-limited equations appear to be missing; adding them would strengthen the literature context.
Simulated Author's Rebuttal
We thank the referee for the careful reading and constructive feedback on our manuscript. We address the two major comments point by point below, indicating the revisions we intend to incorporate.
read point-by-point responses
-
Referee: Numerical validation section: the scheme is tested only by qualitative recovery of known behaviors (finite propagation speed, etc.) on a few instances; no a-priori error estimates, no convergence tables under mesh/time-step refinement, and no quantitative comparison to exact solutions or high-resolution references are provided. This directly affects the central claim that the proximal discretization faithfully reproduces the continuous dynamics without uncontrolled artifacts.
Authors: We acknowledge that the present numerical section emphasizes qualitative recovery of features such as finite propagation speed to illustrate the scheme's fidelity to the underlying doubly nonlinear dynamics. For many equations in the family, closed-form exact solutions are unavailable, which limits direct quantitative error measurement. Nevertheless, we agree that additional quantitative evidence would strengthen the central claim. In the revised manuscript we will add a new subsection containing mesh-refinement studies and L^1-error tables for the linear heat equation (where exact solutions exist) and for the p-Laplacian with p=2, together with comparisons against high-resolution finite-volume references for the relativistic heat equation. We will also explicitly note that a priori error estimates lie outside the scope of this work, whose focus is the construction of explicit proximal operators, and flag this as an important direction for future analysis. revision: partial
-
Referee: Proximal-operator derivations (abstract and §3): while explicit formulas are asserted to exist and be amenable to efficient approximation for general costs, the manuscript supplies neither the full derivations nor an analysis of their computational cost or stability for arbitrary costs beyond the validated special cases. This is load-bearing for the claim of broad applicability.
Authors: Section 3 does derive the proximal operators by reducing the variational problem to a scalar optimization whose solution yields an explicit formula for each cost. To address the referee's concern, we will expand §3 with a complete, self-contained derivation (including the first-order optimality conditions and the reduction to a one-dimensional problem) and move the most technical steps to a new appendix. We will also add a short subsection that quantifies the computational cost: for costs satisfying standard convexity and growth assumptions the proximal step is solved by a safeguarded Newton or bisection method whose complexity is linear in the number of spatial degrees of freedom. Stability of the resulting time-stepping scheme under a mild CFL-type restriction derived from the proximal mapping will likewise be stated. These additions will make the general applicability explicit while retaining the validated special cases as illustrative examples. revision: yes
Circularity Check
No circularity: derivation relies on standard proximal-operator theory applied to external PDE models
full rationale
The paper computes explicit proximal-operator formulas for general costs and applies them to discretize doubly nonlinear gradient flows. These formulas are derived directly from the given cost functions without re-using the target PDE solution or fitted parameters as inputs. Validation consists of qualitative reproduction of known behaviors on textbook cases, which constitutes external checking rather than self-referential fitting. No self-definitional equations, no prediction of fitted quantities, and no load-bearing self-citations appear in the derivation chain. The scheme is therefore self-contained against the stated assumptions.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Existence and uniqueness of proximal operators for the chosen family of costs and energies.
Reference graph
Works this paper leans on
-
[1]
M. Agueh. Existence of solutions to degenerate paraboli c equations via the Monge-Kantorovich theory. Adv. Differential Equations , 10(3):309–360, 2005
2005
-
[2]
M. Agueh. Rates of decay to equilibria for p-Laplacian type equations. Nonlinear Anal. , 68(7):1909–1927, 2008
1909
-
[3]
Agueh, A
M. Agueh, A. Blanchet, and J. A. Carrillo. Large time asym ptotics of the doubly nonlinear equation in the non-displacement convexity regime. 10(1): 59–84
-
[4]
Ambrosio, N
L. Ambrosio, N. Gigli, and G. Savar´ e. Gradient flows: in metric spaces and in the space of probability measures. Springer Science & Business Media, 2008
2008
-
[5]
Andreu, V
F. Andreu, V. Caselles, J. M. Maz´ on, and J. S. Moll. A diffu sion equation in transparent media. J. Evol. Equ. , 7(1):113–143, 2007
2007
-
[6]
Andreu, V
F. Andreu, V. Caselles, J. M. Maz´ on, and S. Moll. Finite p ropagation speed for limited flux diffusion equations. Arch. Ration. Mech. Anal. , 182(2):269–297, 2006
2006
-
[7]
H. H. Bauschke and P. L. Combettes. Convex analysis and monotone operator theory in Hilbert spaces. CMS Books in Mathematics/Ouvrages de Math´ ematiques de la SMC. Springer, Cham, second edition, 2017. With a foreword by H´ edy Attouch . 24
2017
-
[8]
Bellomo, A
N. Bellomo, A. Bellouquid, J. Nieto, and J. Soler. On the a symptotic theory from microscopic to macroscopic growing tissue models: an overview with perspe ctives. Math. Models Methods Appl. Sci. , 22(1):1130001, 37, 2012
2012
-
[9]
Benamou and Y
J.-D. Benamou and Y. Brenier. A computational fluid mecha nics solution to the monge- kantorovich mass transfer problem. Numerische Mathematik , 84(3):375–393, 2000
2000
-
[10]
Benamou and G
J.-D. Benamou and G. Carlier. Augmented Lagrangian met hods for transport optimization, mean field games and degenerate elliptic equations. J. Optim. Theory Appl. , 167(1):1–26, 2015
2015
-
[11]
Bertrand, A
J. Bertrand, A. Pratelli, and M. Puel. Kantorovich pote ntials and continuity of total cost for relativistic cost functions. J. Math. Pures Appl. (9) , 110:93–122, 2018
2018
-
[12]
Bertrand and M
J. Bertrand and M. Puel. The optimal mass transport prob lem for relativistic costs. Calc. Var. Partial Differential Equations , 46(1-2):353–374, 2013
2013
-
[13]
Y. Brenier. Extended Monge-Kantorovich theory. In Optimal transportation and applications (Martina Franca, 2001) , volume 1813 of Lecture Notes in Math. , pages 91–121. Springer, Berlin, 2003
2001
-
[14]
L. M. Brice˜ no-Arias, P. L. Combettes, and F. J. Silva. P roximity operators of perspective functions with nonlinear scaling. SIAM J. Optim. , 34(4):3212–3234, 2024
2024
-
[15]
L. M. Brice˜ no-Arias, D. Kalise, and F. J. Silva. Proxim al methods for stationary mean field games with local couplings. SIAM J. Control Optim. , 56(2):801–836, 2018
2018
-
[16]
L. M. Brice˜ no-Arias, D. Kalise, Z. Kobeissi, M. Lauri` ere, A. M. Gonzalez, and F. J. Silva. On the implementation of a primal-dual algorithm for second or der time-dependent mean field games with local couplings. ESAIM: Proceedings and Surveys , 65:330–348, 2019
2019
-
[17]
Caillet and F
T. Caillet and F. Santambrogio. Doubly nonlinear diffus ive PDEs: new existence results via generalized Wasserstein gradient flows. SIAM J. Math. Anal. , 56(6):7043–7073, 2024
2024
-
[18]
Carlier, V
G. Carlier, V. Duval, G. Peyr´ e, and B. Schmitzer. Conve rgence of entropic schemes for optimal transport and gradient flows. SIAM Journal on Mathematical Analysis , 49(2):1385–1418, 2017
2017
-
[19]
J. A. Carrillo, V. Caselles, and S. Moll. On the relativi stic heat equation in one space dimension. Proc. Lond. Math. Soc. (3) , 107(6):1395–1423, 2013
2013
-
[20]
J. A. Carrillo, K. Craig, L. W ang, and C. W ei. Primal dual methods for wasserstein gradient flows. Foundations of Computational Mathematics , 22(2):389–443, 2022
2022
-
[21]
J. A. Carrillo, R. S. Gvalani, and J. S.-H. W u. An invaria nce principle for gradient flows in the space of probability measures. J. Differential Equations , 345:233–284, 2023
2023
-
[22]
J. A. Carrillo, D. Matthes, and M.-T. W olfram. Lagrangi an schemes for Wasserstein gradient flows. In Geometric partial differential equations. Part II , volume 22 of Handb. Numer. Anal., pages 271–311. Elsevier/North-Holland, Amsterdam, 2021
2021
-
[23]
J. A. Carrillo, R. McCann, and C. Villani. Kinetic equil ibration rates for granular media and related equations: entropy dissipation and mass transp ortation estimates. Rev. Mat. Iberoamericana, 19(3):971–1018, 2003
2003
-
[24]
J. A. Carrillo, R. J. McCann, and C. Villani. Contractio ns in the 2-Wasserstein length space and thermalization of granular media. Arch. Ration. Mech. Anal. , 179(2):217–263, 2006
2006
-
[25]
J. A. Carrillo and J. Moll. Numerical simulation of diffu sive and aggregation phenomena in nonlinear continuity equations by evolving diffeomorphi sms. SIAM J. Sci. Comput. , 31(6):4305–4329, 2009
2009
-
[26]
J. A. Carrillo, L. W ang, and C. W ei. Structure preservin g primal dual methods for gradient flows with nonlinear mobility transport distances. SIAM J. Numer. Anal. , 62(1):376–399, 2024
2024
-
[27]
Chambolle and T
A. Chambolle and T. Pock. A first-order primal-dual algo rithm for convex problems with application to imaging. J. Math. Imaging Vis. , 40:120–145, 2011
2011
-
[28]
P. L. Combettes and C. L. M¨ uller. Perspective function s: proximal calculus and applications in high-dimensional statistics. J. Math. Anal. Appl. , 457(2):1283–1306, 2018
2018
-
[29]
P. L. Combettes and C. L. M¨ uller. Perspective maximum l ikelihood-type estimation via prox- imal decomposition. Electron. J. Stat. , 14(1):207–238, 2020
2020
-
[30]
L. Condat. A primal-dual splitting method for convex op timization involving Lipschitzian, proximable and linear composite terms. J. Optim. Theory Appl. , 158(2):460–479, 2013
2013
-
[31]
Davis and W
D. Davis and W. Yin. A three-operator splitting scheme a nd its optimization applications. Set-valued and variational analysis , 25(4):829–858, 2017
2017
-
[32]
Fukushima, Z.-Q
M. Fukushima, Z.-Q. Luo, and P. Tseng. Smoothing functi ons for second-order-cone comple- mentarity problems. SIAM Journal on Optimization , 12(2):436–460, 2002
2002
-
[33]
Jordan, D
R. Jordan, D. Kinderlehrer, and F. Otto. The variationa l formulation of the Fokker–Planck equation. SIAM J. Math. Anal. , 29(1):1–17, 1998
1998
-
[34]
Junge, D
O. Junge, D. Matthes, and H. Osberger. A fully discrete v ariational scheme for solving nonlin- 25 ear Fokker–Planck equations in multiple space dimensions. SIAM Journal on Numerical Analysis, 55(1):419–443, 2017
2017
-
[35]
W. Li, J. Lu, and L. W ang. Fisher information regulariza tion schemes for Wasserstein gradient flows. J. Comput. Phys. , pages 109449, 24pp., 2020
2020
-
[36]
Matthes and B
D. Matthes and B. S¨ ollner. Discretization of flux-limi ted gradient flows: Γ-convergence and numerical schemes. Math. Comp. , 89(323):1027–1057, 2020
2020
-
[37]
R. J. McCann and M. Puel. Constructing a relativistic he at flow by transport time steps. Ann. Inst. H. Poincar´ e C Anal. Non Lin´ eaire, 26(6):2539–2580, 2009
2009
-
[38]
A. Mielke. On evolutionary Γ -convergence for gradient systems. In Macroscopic and large scale phenomena: coarse graining, mean field limits and ergo dicity, volume 3 of Lect. Notes Appl. Math. Mech. , pages 187–249. Springer, [Cham], 2016
2016
-
[39]
F. Otto. Double degenerate diffusion equations as steep est descent. Sonderforschungsbereich, 256, 1996
1996
-
[40]
F. Otto. The geometry of dissipative evolution equatio ns: the porous medium equation. Comm. Partial Differential Equations , 26(1-2):101–174, 2001
2001
-
[41]
Papadakis, G
N. Papadakis, G. Peyre, and E. Oudet. Optimal transport with proximal splitting. SIAM. J. Image. Sci. , 7(1):212–238, 2014
2014
-
[42]
Perthame, N
B. Perthame, N. Vauchelet, and Z. W ang. The flux limited K eller-Segel system; properties and derivation from kinetic equations. Rev. Mat. Iberoam. , 36(2):357–386, 2020
2020
-
[43]
G. Peyr´ e. Entropic approximation of Wasserstein grad ient flows. SIAM Journal on Imaging Sciences, 8(4):2323–2351, 2015
2015
-
[44]
Peyr´ e, M
G. Peyr´ e, M. Cuturi, et al. Computational optimal tran sport: With applications to data science. Foundations and Trends ® in Machine Learning , 11(5-6):355–607, 2019
2019
-
[45]
W. H. Press, S. A. Teukolsky, W. T. Vetterling, and B. P. F lannery. Numerical recipes in Fortran 77 and Fortran 90 . Cambridge University Press, Cambridge, second edition,
-
[46]
The art of scientific and parallel computing
-
[47]
R. T. Rockafellar. Level sets and continuity of conjuga te convex functions. Trans. Amer. Math. Soc., 123:46–63, 1966
1966
-
[48]
Santambrogio
F. Santambrogio. {Euclidean, metric, and Wasserstein } gradient flows: an overview. Bull. Math. Sci. , 7(1):87–154, 2017
2017
-
[49]
S. Serfaty. Gamma-convergence of gradient flows on Hilb ert and metric spaces and applications. Discrete Contin. Dyn. Syst. , 31(4):1427–1451, 2011
2011
-
[50]
B. a. C. V˜ u. A splitting algorithm for dual monotone inc lusions involving cocoercive operators. Adv. Comput. Math. , 38(3):667–681, 2013
2013
-
[51]
J. Vazquez. The Porous Medium Equation . Oxford Mathematical Monographs. Oxford Uni- versity Press, 2007. Oxford, UK
2007
-
[52]
J. L. V´ azquez. The evolution fractional p-Laplacian equation in RN . Fundamental solution and asymptotic behaviour. Nonlinear Anal. , 199:112034, 32, 2020
2020
-
[53]
C. Villani. Topics in Optimal Transport . 58 AMS. Grad. Stud. Math., 2003. Providence, RI
2003
-
[54]
M. Yan. A new primal-dual algorithm for minimizing the s um of three functions with a linear operator. J. Sci. Comput. , 76(3):1698–1717, 2018. 26
2018
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.