The adjoint state method for parametric definable optimization without smoothness or uniqueness
Pith reviewed 2026-05-14 22:24 UTC · model grok-4.3
The pith
Nonconvex definable parametric optimization problems admit an adjoint state formula under a qualification condition without smoothness or unique solutions.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
We establish that nonconvex definable parametric optimization problems with possibly nonsmooth objectives, inequality constraints, conic constraint systems, and non-unique primal and dual solutions admit an adjoint state formula under a mere qualification condition. The adjoint construction yields a selection of a conservative field for the value function, providing a computable first-order object without requiring differentiation of the solution mapping.
What carries the argument
Adjoint state formula selecting a conservative field for the value function in definable parametric optimization.
Load-bearing premise
The optimization problems must be definable in an o-minimal structure and satisfy a qualification condition.
What would settle it
A counterexample of a smooth parametric optimization problem that is not definable where the adjoint state does not provide a conservative field.
Figures
read the original abstract
We establish that nonconvex definable parametric optimization problems with possibly nonsmooth objectives, inequality constraints, conic constraint systems, and non-unique primal and dual solutions admit an adjoint state formula under a mere qualification condition. The adjoint construction yields a selection of a conservative field for the value function, providing a computable first-order object without requiring differentiation of the solution mapping. Through examples, we show that even in smooth problems, the formal adjoint construction fails without conservativity or definability, illustrating the relevance of these concepts to grasp theoretical aspects of the method. This work provides a tool which can be directly combined with existing primal-dual solvers for a wide range of parametric optimization problems.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript establishes that nonconvex parametric optimization problems definable in an o-minimal structure admit an adjoint-state formula that selects a conservative field for the value function. The result applies to problems with nonsmooth objectives, inequality constraints, conic constraint systems, and non-unique primal and dual solutions, provided only a qualification condition holds. The construction avoids explicit differentiation of the solution mapping and is shown to be compatible with existing primal-dual solvers; counterexamples illustrate that definability and conservativity are essential.
Significance. If the central theorem holds, the work supplies a first-order sensitivity tool for a wide family of non-smooth, non-unique parametric problems that previously lacked rigorous adjoint formulas. The explicit separation of definability from smoothness, together with the conservative-field guarantee, fills a documented gap and directly supports numerical sensitivity analysis without additional regularity assumptions.
minor comments (3)
- [Abstract] The abstract states that examples demonstrate failure of the formal adjoint without definability or conservativity; please add a brief parenthetical remark indicating the dimension or constraint type of the counterexamples so readers can locate them quickly.
- [Section 2] Notation for the conservative field (e.g., the symbol used for the selection of the limiting subdifferential) should be introduced once in the preliminaries and used consistently thereafter; occasional re-definition in later sections disrupts readability.
- [Theorem 3.1] The statement of the main theorem would benefit from an explicit remark that the qualification condition is independent of uniqueness of the primal-dual pair; this is already implicit but worth highlighting for readers familiar with classical KKT theory.
Simulated Author's Rebuttal
We thank the referee for the positive assessment of the manuscript and the recommendation for minor revision. No specific major comments were provided in the report.
Circularity Check
No significant circularity detected
full rationale
The paper establishes an existence result for an adjoint-state formula in nonconvex definable parametric optimization problems under a qualification condition. The derivation chain relies on o-minimal definability to guarantee conservative fields for the value function, without any reduction of the claimed formula to a fitted parameter, a self-referential definition, or a load-bearing self-citation. The abstract and stated claims isolate definability as an essential external assumption (with counter-examples for non-definable smooth cases) and position the result as compatible with existing solvers. No step in the provided derivation chain equates the output to its inputs by construction.
Axiom & Free-Parameter Ledger
axioms (2)
- domain assumption The parametric optimization problem is definable in an o-minimal structure
- domain assumption A qualification condition holds for the constraint system
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.lean (J-uniqueness); Foundation/AbsoluteFloorClosure.lean (distinction forcing)reality_from_one_distinction; washburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
We establish that nonconvex definable parametric optimization problems ... admit an adjoint state formula under a mere qualification condition. The adjoint construction yields a selection of a conservative field for the value function
-
IndisputableMonolith/Foundation/ArithmeticFromLogic.lean; AlexanderDuality.leanLogicNat recovery; alexander_duality_circle_linking unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
all the mappings F,G,H belong to a common o-minimal structure ... conservative calculus
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
Grégoire Allaire. A review of adjoint methods for sensitivity analysis, uncertainty quantification and optimization in numerical codes.Ingénieurs de l’Automobile, 836: 33–36, 2015
work page 2015
-
[2]
Optnet: Differentiable optimization as a layer in neural networks
Brandon Amos and J Zico Kolter. Optnet: Differentiable optimization as a layer in neural networks. InInternational conference on machine learning, pages 136–145. PMLR, 2017
work page 2017
-
[3]
Auslender.Differentiable stability in non convex and non differentiable program- ming, pages 29–41
A. Auslender.Differentiable stability in non convex and non differentiable program- ming, pages 29–41. Springer Berlin Heidelberg, Berlin, Heidelberg, 1979. ISBN 978-3-642-00798-9. doi: 10.1007/BFb0120841. URL https://doi.org/10.1007/ BFb0120841
-
[4]
Deep equilibrium models.Advances in neural information processing systems, 32, 2019
Shaojie Bai, J Zico Kolter, and Vladlen Koltun. Deep equilibrium models.Advances in neural information processing systems, 32, 2019
work page 2019
-
[5]
Robust optimization–methodology and applications.Mathematical programming, 92(3):453–480, 2002
Aharon Ben-Tal and Arkadi Nemirovski. Robust optimization–methodology and applications.Mathematical programming, 92(3):453–480, 2002
work page 2002
-
[6]
Jérôme Bolte and Edouard Pauwels. A mathematical model for automatic differenti- ation in machine learning.Advances in Neural Information Processing Systems, 33: 10809–10819, 2020
work page 2020
-
[7]
Jérôme Bolte and Edouard Pauwels. Conservative set valued fields, automatic differ- entiation, stochastic gradient methods and deep learning.Mathematical Programming, 188(1):19–51, 2021
work page 2021
-
[8]
Jérôme Bolte, Antoine Hochart, and Edouard Pauwels. Qualification conditions in semialgebraic programming.SIAM Journal on Optimization, 28(2):1867–1891, 2018
work page 2018
-
[9]
Jérôme Bolte, Tam Le, and Edouard Pauwels. Subgradient sampling for nonsmooth nonconvex minimization.SIAM Journal on Optimization, 33(4):2542–2569, 2023
work page 2023
-
[10]
Optimization problems with perturba- tions: A guided tour.SIAM review, 40(2):228–264, 1998
J Frédéric Bonnans and Alexander Shapiro. Optimization problems with perturba- tions: A guided tour.SIAM review, 40(2):228–264, 1998
work page 1998
-
[11]
Springer Science & Business Media, 2000
J Frédéric Bonnans and Alexander Shapiro.Perturbation analysis of optimization problems. Springer Science & Business Media, 2000
work page 2000
-
[12]
An optimal engineering design method with failure rate constraints and sensitivity analysis
C Castillo, R Mínguez, E Castillo, and MA Losada. An optimal engineering design method with failure rate constraints and sensitivity analysis. application to composite breakwaters.Coastal Engineering, 53(1):1–25, 2006. 24
work page 2006
-
[13]
Jean Céa. Conception optimale ou identification de formes, calcul rapide de la dérivée directionnelle de la fonction coût.ESAIM: Modélisation mathématique et analyse numérique, 20(3):371–402, 1986
work page 1986
-
[14]
Neural ordinary differential equations.Advances in neural information processing systems, 31, 2018
Ricky TQ Chen, Yulia Rubanova, Jesse Bettencourt, and David K Duvenaud. Neural ordinary differential equations.Advances in neural information processing systems, 31, 2018
work page 2018
-
[15]
Clarke.Optimization and Nonsmooth Analysis
Frank H. Clarke.Optimization and Nonsmooth Analysis. Canadian Mathematical Society series of monographs and advanced texts. John Wiley & Sons, 1983. ISBN 9780471875048
work page 1983
-
[16]
Directional derivative of a minimax function
Rafael Correa and Alberto Seeger. Directional derivative of a minimax function. Nonlinear Analysis: Theory, Methods & Applications, 9(1):13–22, 1985
work page 1985
-
[17]
Istituti editoriali e poligrafici internazionali Pisa, 2000
Michel Coste.An introduction to o-minimal geometry. Istituti editoriali e poligrafici internazionali Pisa, 2000
work page 2000
-
[18]
Damek Davis, Dmitriy Drusvyatskiy, Sham Kakade, and Jason D Lee. Stochastic subgradient method converges on tame functions.Foundations of computational mathematics, 20(1):119–154, 2020
work page 2020
-
[19]
Shape sensitivity analysis via min max differentiability
MC Delfour and J-P Zolésio. Shape sensitivity analysis via min max differentiability. SIAM Journal on Control and Optimization, 26(4):834–862, 1988
work page 1988
-
[20]
Control, shape, and topological derivatives via minimax differen- tiability of lagrangians
Michel C Delfour. Control, shape, and topological derivatives via minimax differen- tiability of lagrangians. InNumerical Methods for Optimal Control Problems, pages 137–164. Springer, 2019
work page 2019
-
[21]
Michel C Delfour. Semidifferential of minimax of lagrangians with respect to parame- ters, controls, or geometric variables.SIAM Journal on Control and Optimization, 63(2):936–965, 2025
work page 2025
-
[22]
Parametric semidifferentiability of minimax of lagrangians: averaged adjoint approach.J
Michel C Delfour and Kevin Sturm. Parametric semidifferentiability of minimax of lagrangians: averaged adjoint approach.J. Convex Anal, 24(4):1117–1142, 2017
work page 2017
-
[23]
Victor DeMiguel and Francisco J Nogales. On decomposition methods for a class of partially separable nonlinear programs.Mathematics of Operations Research, 33(1): 119–139, 2008
work page 2008
-
[24]
Stochastic bregman subgradient methods for nonsmooth nonconvex optimization problems: K
Kuangyu Ding and Kim-Chuan Toh. Stochastic bregman subgradient methods for nonsmooth nonconvex optimization problems: K. ding, k.-c. toh.Journal of Optimization Theory and Applications, 206(3):67, 2025
work page 2025
-
[25]
I. Ekeland and R. Temam.Analyse convexe et problèmes variationnels. Etudes mathématiques. Dunod, 1974. ISBN 9782040073688
work page 1974
-
[26]
Anthony V Fiacco and William P Hutzler. Extensions of the gauvin-tolle optimal value differential stability results to general mathematical programs (no. serialt393). Technical report, Institute for Management Science and Engineering, School of Engineering and Applied Science, The George Washington University, 1979. 25
work page 1979
-
[27]
Differential stability in nonlinear programming
Jacques Gauvin and Jon W Tolle. Differential stability in nonlinear programming. SIAM Journal on Control and Optimization, 15(2):294–311, 1977
work page 1977
-
[28]
The methodof parametric decomposition in mathematical programming: The nonconvex case
Jean Gauvin. The methodof parametric decomposition in mathematical programming: The nonconvex case. InIIASA Proceedings Series, page 131, 1978
work page 1978
-
[29]
Snopt: An sqp algorithm for large-scale constrained optimization.SIAM review, 47(1):99–131, 2005
Philip E Gill, Walter Murray, and Michael A Saunders. Snopt: An sqp algorithm for large-scale constrained optimization.SIAM review, 47(1):99–131, 2005
work page 2005
-
[30]
Andreas Griewank and Andrea Walther.Evaluating derivatives: principles and techniques of algorithmic differentiation. SIAM, 2008
work page 2008
-
[31]
SpringerBriefs on PDEs and Data Science
Frédéric Hecht, Gontran Lance, and Emmanuel Trélat.PDE-Constrained Optimiza- tion within FreeFEM. SpringerBriefs on PDEs and Data Science. Springer Singapore, Singapore, 2026. ISBN 978-981-95-7048-5. eBook ISBN: 978-981-95-7049-2
work page 2026
-
[32]
JB Hiriart-Urruty. Gradients généralisés de fonctions marginales.SIAM Journal on Control and Optimization, 16(2):301–316, 1978
work page 1978
-
[33]
Springer Berlin Heidelberg, Berlin, Heidelberg, 1984
Robert Janin.Directional derivative of the marginal function in nonlinear program- ming, pages 110–126. Springer Berlin Heidelberg, Berlin, Heidelberg, 1984. ISBN 978-3-642-00913-6. doi: 10.1007/BFb0121214. URL https://doi.org/10.1007/ BFb0121214
-
[34]
A Jourani. Constraint qualifications and lagrange multipliers in nondifferentiable programming problems.Journal of Optimization Theory and Applications, 81(3): 533–548, 1994
work page 1994
-
[35]
Tam Le. Nonsmooth nonconvex stochastic heavy ball.Journal of Optimization Theory and Applications, 201(2):699–719, 2024
work page 2024
-
[36]
doi: https: //doi.org/10.1007/978-1-4419-9982-5
John M. Lee.Introduction to Smooth Manifolds, volume 218 ofGraduate Texts in Mathematics. Springer, New York, NY, 2 edition, 2012. ISBN 978-1-4419-9981-8. doi: 10.1007/978-1-4419-9982-5
-
[37]
Ana P Lemos-Paião, Sofia O Lopes, and MDR de Pinho. A parametric optimal control problem applied to daily irrigation.Mathematical Modelling of Natural Phenomena, 20:2, 2025
work page 2025
-
[38]
Boris Mordukhovich. Lipschitzian stability of constraint systems and generalized equations.Nonlinear Analysis: Theory, Methods & Applications, 22(2):173–206, 1994
work page 1994
-
[39]
François Pacaud. Sensitivity analysis for parametric nonlinear programming: A tutorial.arXiv preprint arXiv:2504.15851, 2025
-
[40]
Edouard Pauwels. Conservative parametric optimality and the ridge method for tame min-max problems.Set-Valued and Variational Analysis, 31(3):19, 2023
work page 2023
-
[41]
Hyperparameter optimization with approximate gradient
Fabian Pedregosa. Hyperparameter optimization with approximate gradient. In International conference on machine learning, pages 737–746. PMLR, 2016
work page 2016
-
[42]
Princeton university press, 1970
R Tyrrell Rockafellar.Convex analysis, volume 28. Princeton university press, 1970. 26
work page 1970
- [43]
-
[44]
R Tyrrell Rockafellar and Roger JB Wets.Variational analysis. Springer, 1998
work page 1998
-
[45]
Kim-Chuan Toh, Michael J Todd, and Reha H Tütüncü. Sdpt3—a matlab software packageforsemidefiniteprogramming, version1.3.Optimization methods and software, 11(1-4):545–581, 1999
work page 1999
-
[46]
Geometric categories and o-minimal structures
Lou Van den Dries and Chris Miller. Geometric categories and o-minimal structures. Duke Mathematical Journal, 84(2):497–540, 1996
work page 1996
-
[47]
Andreas Wächter and Lorenz T Biegler. On the implementation of an interior-point filter line-search algorithm for large-scale nonlinear programming.Mathematical programming, 106(1):25–57, 2006
work page 2006
-
[48]
Nachuan Xiao, Xiaoyin Hu, Xin Liu, and Kim-Chuan Toh. Adam-family methods for nonsmooth optimization with convergence guarantees.Journal of Machine Learning Research, 25(48):1–53, 2024
work page 2024
-
[49]
Makoto Yamashita, Katsuki Fujisawa, and Masakazu Kojima. Implementation and evaluation of sdpa 6.0 (semidefinite programming algorithm 6.0).Optimization Methods and Software, 18(4):491–505, 2003. doi: 10.1080/1055678031000118482. URLhttps://doi.org/10.1080/1055678031000118482
-
[50]
Shapes and geometries: Metrics, analysis, differential calculus, and optimization, 2011
Jean-Paul Zolésio and MC Delfour. Shapes and geometries: Metrics, analysis, differential calculus, and optimization, 2011. 27
work page 2011
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.