pith. sign in

arxiv: 2605.17913 · v1 · pith:Q6YFQZTKnew · submitted 2026-05-18 · 🧮 math.OC

A Differentiable Interior-Point Method in Single Precision

Pith reviewed 2026-05-20 09:56 UTC · model grok-4.3

classification 🧮 math.OC
keywords interior-point methodsdifferentiable optimizationsingle precisioncomplementarity conditionsconvex optimizationbilevel optimizationgradient computation
0
0 comments X

The pith

Changing the complementarity representation keeps linear systems well-conditioned near the solution, enabling reliable single-precision interior-point methods and differentiation.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

Primal-dual interior-point methods solve constrained convex optimization problems to high accuracy, and their solutions can be differentiated with respect to problem data using the implicit function theorem. Standard complementarity conditions cause the linear systems to become increasingly ill-conditioned near the solution, which creates arithmetic problems and inaccurate gradients when working in single precision. The paper introduces an alternative complementarity representation that keeps those linear systems spectrally bounded throughout the solve, even close to optimality. This modification preserves the convergence rate and final accuracy of the original method while removing the precision barrier. The result is that interior-point solvers can now be used to solve and differentiate optimization problems in single precision for bilevel and end-to-end learning tasks that previously required double precision.

Core claim

The central claim is that an alternative complementarity representation in primal-dual interior-point methods ensures the underlying linear systems remain spectrally bounded even near the solution. This boundedness supports accurate gradient computation through the implicit function theorem and prevents arithmetic exceptions in single-precision arithmetic, allowing the solvers to handle and differentiate constrained convex problems that were previously restricted to double precision.

What carries the argument

The alternative complementarity representation, which replaces the standard treatment of primal-dual complementarity conditions to control the spectral properties of the KKT linear systems.

If this is right

  • Interior-point solvers become reliable for single-precision arithmetic without arithmetic exceptions or loss of accuracy near the solution.
  • Gradients computed through the solver remain numerically stable and accurate even when the iterates approach optimality.
  • Differentiation through constrained convex optimization becomes practical in bilevel and end-to-end learning pipelines on low-precision hardware.
  • Problems that previously required double precision can now be solved and differentiated at lower memory and compute cost.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The same complementarity change might be tested inside other interior-point variants or related methods such as sequential quadratic programming to see whether conditioning benefits transfer.
  • Single-precision support could open the door to faster execution on hardware accelerators that favor reduced-precision arithmetic in machine-learning training loops.
  • Further checks could examine whether the modified systems affect warm-start performance or the detection of infeasible problems.

Load-bearing premise

The alternative complementarity representation preserves the convergence rate and solution accuracy of the original primal-dual interior-point method.

What would settle it

Run both the new method and the standard interior-point method to convergence on the same collection of convex test problems in double precision, then compare final objective values and the gradients obtained via the implicit function theorem; a noticeable difference in either quantity would falsify the preservation claim.

Figures

Figures reproduced from arXiv: 2605.17913 by Jon Arrizabalaga, Kevin Tracy, Zachary Manchester.

Figure 1
Figure 1. Figure 1: Implicit complementarity enables accurate and robust single-precision gradients. [PITH_FULL_IMAGE:figures/full_fig_p007_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Bilevel trajectory optimization. Implicit f32 remains stable at tight relaxations and matches [PITH_FULL_IMAGE:figures/full_fig_p008_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: End-to-end learning of a multi-agent safety filter. Implicit f32 matches f64 training stability [PITH_FULL_IMAGE:figures/full_fig_p009_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: NaN tracing for operations in explicit f32. [PITH_FULL_IMAGE:figures/full_fig_p017_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: Bilevel optimization for fast (outer opt.) and safe (inner opt.) trajectories. [PITH_FULL_IMAGE:figures/full_fig_p019_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: A subset of testing configurations for 1 agent. Only case where explicit f32 works. [PITH_FULL_IMAGE:figures/full_fig_p022_6.png] view at source ↗
Figure 7
Figure 7. Figure 7: A subset of testing configurations for 3 agents. [PITH_FULL_IMAGE:figures/full_fig_p023_7.png] view at source ↗
Figure 8
Figure 8. Figure 8: A subset of testing configurations for 5 agents. [PITH_FULL_IMAGE:figures/full_fig_p023_8.png] view at source ↗
Figure 9
Figure 9. Figure 9: A subset of testing configurations for 9 agents. [PITH_FULL_IMAGE:figures/full_fig_p024_9.png] view at source ↗
read the original abstract

Primal-dual interior-point methods solve constrained convex optimization problems to tight tolerances with speed and robustness. Their solutions are also efficiently differentiable with respect to the problem data through the implicit function theorem. However, the standard treatment of primal-dual complementarity makes the underlying linear systems increasingly ill-conditioned near the solution. While this ill-conditioning is often benign in double precision, it can be catastrophic in single precision, preventing interior-point methods from fully exploiting the accelerated hardware that underpins modern machine learning. This paper introduces a differentiable interior-point method designed for low-precision arithmetic. By using an alternative complementarity representation, we ensure that the underlying linear systems remain spectrally bounded -- even near the solution -- a property that is essential for computing accurate gradients and avoiding arithmetic exceptions. As a result, our method enables interior-point solvers to reliably solve and differentiate optimization problems in single precision that were previously confined to double precision. We demonstrate the approach through an ablation study against the standard interior-point formulation and applications in bilevel and end-to-end learning settings where differentiating through constrained optimization is essential. The source code is available at https://github.com/qpax-solver/qpax.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 3 minor

Summary. The paper introduces a differentiable primal-dual interior-point method for convex optimization problems that is designed to operate in single precision. The central innovation is an alternative algebraic representation of the complementarity condition that is intended to keep the Newton linear systems spectrally bounded even near optimality, thereby enabling stable implicit differentiation via the implicit function theorem and avoiding arithmetic exceptions. The approach is supported by an ablation study against the standard formulation and by demonstrations in bilevel and end-to-end learning settings; open-source code is provided.

Significance. If the spectral-boundedness property holds, the method would allow interior-point solvers to be used reliably in single precision for differentiable optimization layers, taking advantage of accelerated hardware common in machine learning. The open-source implementation and the empirical ablation plus application results are concrete strengths that support practical utility.

major comments (2)
  1. [§3] §3 (alternative complementarity formulation): the manuscript asserts that the new representation keeps the KKT/Newton systems spectrally bounded as the duality gap vanishes, yet supplies no theorem or uniform eigenvalue bound independent of problem data, conditioning, and dimension. The ablation study demonstrates practical improvement on selected instances but does not establish the claimed O(1) conditioning; this bound is load-bearing for the single-precision stability and gradient-accuracy claims.
  2. [§4] §4 (convergence and accuracy): the claim that the alternative representation preserves the original convergence rate and solution accuracy is implicit in the ablation results but is not accompanied by a formal rate analysis or proof that the modified Newton system retains the same local convergence properties as the standard primal-dual IPM.
minor comments (3)
  1. [Figure 2] Figure 2 (ablation plots): axis labels and legends would benefit from explicit mention of the duality-gap range and the precise metric being plotted (e.g., relative residual or gradient error).
  2. Notation: the mapping from the new complementarity variable to the standard slack variables is introduced without a side-by-side comparison to the classical form; a short table or equation block would improve clarity.
  3. References: recent literature on low-precision linear solvers and differentiable optimization (e.g., works on single-precision QP solvers) is lightly cited; adding 2–3 targeted references would strengthen the positioning.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the careful reading and constructive comments on our manuscript. Below we respond point by point to the major comments, indicating the changes we will make in the revised version.

read point-by-point responses
  1. Referee: [§3] §3 (alternative complementarity formulation): the manuscript asserts that the new representation keeps the KKT/Newton systems spectrally bounded as the duality gap vanishes, yet supplies no theorem or uniform eigenvalue bound independent of problem data, conditioning, and dimension. The ablation study demonstrates practical improvement on selected instances but does not establish the claimed O(1) conditioning; this bound is load-bearing for the single-precision stability and gradient-accuracy claims.

    Authors: We agree that a formal statement would strengthen the section. In the revision we will insert a theorem in §3 establishing that, under the alternative complementarity formulation, the eigenvalues of the Newton matrix (or its Schur complement) remain bounded away from zero and infinity as the barrier parameter μ → 0. The bound depends on standard problem quantities (Lipschitz constants of the objective and constraint functions, and the conditioning of the constraint Jacobian) but is independent of μ itself. This is the sense in which the systems are “spectrally bounded as the duality gap vanishes,” in contrast to the standard formulation whose condition number grows without bound as μ → 0. While a bound that is completely independent of all problem data and dimension is not claimed (and would require stronger assumptions than those in the problem class), the μ-independence is the property needed for single-precision stability and reliable implicit differentiation. We will also add a short remark clarifying this distinction and will reference the ablation results as empirical corroboration of the theoretical statement. revision: yes

  2. Referee: [§4] §4 (convergence and accuracy): the claim that the alternative representation preserves the original convergence rate and solution accuracy is implicit in the ablation results but is not accompanied by a formal rate analysis or proof that the modified Newton system retains the same local convergence properties as the standard primal-dual IPM.

    Authors: We accept that an explicit argument is desirable. In the revised manuscript we will add a short paragraph (or appendix remark) in §4 showing that the Newton system arising from the alternative complementarity condition is asymptotically equivalent, near a strict complementarity solution, to the standard primal-dual Newton system. Consequently the local quadratic convergence rate of the underlying interior-point method is retained. Solution accuracy is likewise unchanged because the KKT conditions satisfied at termination are identical to those of the classical formulation. The ablation experiments already indicate that final objective values and constraint violations match to machine precision; the added analysis will place this observation on a theoretical footing. revision: yes

Circularity Check

0 steps flagged

No circularity: explicit algebraic change to complementarity yields bounded spectra by construction of the new representation.

full rationale

The paper's central contribution is an alternative complementarity representation that is introduced directly in the method derivation rather than fitted to data or justified solely by self-citation. The claim that linear systems remain spectrally bounded follows from the algebraic substitution itself (as described in the abstract and method sections), with empirical verification via ablation rather than any reduction of the target property to the paper's own fitted quantities or prior self-cited uniqueness results. No load-bearing step equates a prediction or bound to an input by definition, and the derivation remains self-contained against external IPM benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The approach relies on standard convex optimization assumptions and the implicit function theorem for differentiation; no new free parameters, invented entities, or ad-hoc axioms are introduced beyond the choice of the alternative complementarity form itself.

axioms (1)
  • domain assumption The problem is convex and satisfies standard constraint qualifications so that the KKT system is well-defined.
    Invoked implicitly when applying the interior-point method and the implicit function theorem for gradients.

pith-pipeline@v0.9.0 · 5728 in / 1300 out tokens · 29800 ms · 2026-05-20T09:56:27.963592+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

64 extracted references · 64 canonical work pages · 2 internal anchors

  1. [1]

    An efficiently solvable quadratic program for stabilizing dynamic locomotion

    Scott Kuindersma, Frank Permenter, and Russ Tedrake. An efficiently solvable quadratic program for stabilizing dynamic locomotion. In2014 IEEE International Conference on Robotics and Automation (ICRA), pages 2589–2594. IEEE, 2014

  2. [2]

    Autonomous precision landing of space rockets

    Lars Blackmore. Autonomous precision landing of space rockets. InFrontiers of engineering: reports on leading-edge engineering from the 2016 symposium, volume 46, pages 15–20. The Bridge Washington, DC, USA, 2016

  3. [3]

    A constrained kalman filter for rigid body systems with frictional contact

    Patrick Varin and Scott Kuindersma. A constrained kalman filter for rigid body systems with frictional contact. InInternational Workshop on the Algorithmic Foundations of Robotics, pages 474–490. Springer, 2018

  4. [4]

    Reynolds, Michael Szmuk, Thomas Lew, Riccardo Bonalli, Marco Pavone, and Behçet Açıkme¸ se

    Danylo Malyuta, Taylor P. Reynolds, Michael Szmuk, Thomas Lew, Riccardo Bonalli, Marco Pavone, and Behçet Açıkme¸ se. Convex optimization for trajectory generation: A tutorial on generating dynamically feasible trajectories reliably and efficiently.IEEE Control Systems Magazine, 42(5):40–113, 2022

  5. [5]

    Convex maneuver planning for spacecraft collision avoidance, 2025

    Fausto Vega, Jon Arrizabalaga, Ryan Watson, and Zachary Manchester. Convex maneuver planning for spacecraft collision avoidance, 2025

  6. [6]

    Optimization-based simulation of nonsmooth rigid multibody dynamics

    Mihai Anitescu. Optimization-based simulation of nonsmooth rigid multibody dynamics. Mathematical Programming, 105(1):113–143, 2006

  7. [7]

    Differentiable collision detection for a set of convex primitives

    Kevin Tracy, Taylor A Howell, and Zachary Manchester. Differentiable collision detection for a set of convex primitives. In2023 IEEE International Conference on Robotics and Automation (ICRA), pages 3663–3670. IEEE, 2023

  8. [8]

    A convex quasistatic time-stepping scheme for rigid multibody systems with contact and friction

    Tao Pang and Russ Tedrake. A convex quasistatic time-stepping scheme for rigid multibody systems with contact and friction. In2021 IEEE International Conference on Robotics and Automation (ICRA), pages 6614–6620. IEEE, 2021

  9. [9]

    Martin Beckmann, C. B. McGuire, and Christopher B. Winsten.Studies in the Economics of Transportation. Yale University Press, 1956

  10. [10]

    A min-max solution of an inventory problem.Studies in the Mathematical Theory of Inventory and Production, pages 201–209, 1958

    Herbert Scarf. A min-max solution of an inventory problem.Studies in the Mathematical Theory of Inventory and Production, pages 201–209, 1958

  11. [11]

    Robust convex optimization.Mathematics of opera- tions research, 23(4):769–805, 1998

    Aharon Ben-Tal and Arkadi Nemirovski. Robust convex optimization.Mathematics of opera- tions research, 23(4):769–805, 1998

  12. [12]

    portfolio selection

    Mark Rubinstein. Markowitz’s" portfolio selection": A fifty-year retrospective.The Journal of finance, 57(3):1041–1045, 2002

  13. [13]

    The price of robustness.Operations research, 52(1):35–53, 2004

    Dimitris Bertsimas and Melvyn Sim. The price of robustness.Operations research, 52(1):35–53, 2004

  14. [14]

    Support-vector networks.Machine Learning, 20(3):273– 297, 1995

    Corinna Cortes and Vladimir Vapnik. Support-vector networks.Machine Learning, 20(3):273– 297, 1995

  15. [15]

    Regression shrinkage and selection via the lasso.Journal of the Royal Statistical Society: Series B, 58(1):267–288, 1996

    Robert Tibshirani. Regression shrinkage and selection via the lasso.Journal of the Royal Statistical Society: Series B, 58(1):267–288, 1996

  16. [16]

    Candès and Benjamin Recht

    Emmanuel J. Candès and Benjamin Recht. Exact matrix completion via convex optimization. Foundations of Computational Mathematics, 9(6):717–772, 2009

  17. [17]

    Sparse inverse covariance estimation with the graphical lasso.Biostatistics, 9(3):432–441, 2008

    Jerome Friedman, Trevor Hastie, and Robert Tibshirani. Sparse inverse covariance estimation with the graphical lasso.Biostatistics, 9(3):432–441, 2008

  18. [18]

    Distributed opti- mization and statistical learning via the alternating direction method of multipliers.Foundations and Trends in Machine Learning, 3(1):1–122, 2011

    Stephen Boyd, Neal Parikh, Eric Chu, Borja Peleato, and Jonathan Eckstein. Distributed opti- mization and statistical learning via the alternating direction method of multipliers.Foundations and Trends in Machine Learning, 3(1):1–122, 2011. 10

  19. [19]

    Cambridge university press, 2004

    Stephen Boyd and Lieven Vandenberghe.Convex optimization. Cambridge university press, 2004

  20. [20]

    Donti, Brandon Amos, and J

    Priya L. Donti, Brandon Amos, and J. Zico Kolter. Task-based end-to-end model learning in stochastic optimization. InAdvances in Neural Information Processing Systems, 2017

  21. [21]

    Melding the data-decisions pipeline: Decision- focused learning for combinatorial optimization

    Bryan Wilder, Bistra Dilkina, and Milind Tambe. Melding the data-decisions pipeline: Decision- focused learning for combinatorial optimization. InProceedings of the AAAI Conference on Artificial Intelligence, 2019

  22. [22]

    Learning for CasADi: Data-driven models in numerical optimization

    Tim Salzmann, Jon Arrizabalaga, Joel Andersson, Marco Pavone, and Markus Ryll. Learning for CasADi: Data-driven models in numerical optimization. In Alessandro Abate, Mark Cannon, Kostas Margellos, and Antonis Papachristodoulou, editors,Proceedings of the 6th Annual Learning for Dynamics; Control Conference, volume 242 ofProceedings of Machine Learning Re...

  23. [23]

    Real-time neural mpc: Deep learning model predictive control for quadrotors and agile robotic platforms.IEEE Robotics and Automation Letters, 8(4):2397–2404, 2023

    Tim Salzmann, Elia Kaufmann, Jon Arrizabalaga, Marco Pavone, Davide Scaramuzza, and Markus Ryll. Real-time neural mpc: Deep learning model predictive control for quadrotors and agile robotic platforms.IEEE Robotics and Automation Letters, 8(4):2397–2404, 2023

  24. [24]

    Actor–critic model predictive control: Differentiable optimization meets reinforcement learning for agile flight

    Angel Romero, Elie Aljalbout, Yunlong Song, and Davide Scaramuzza. Actor–critic model predictive control: Differentiable optimization meets reinforcement learning for agile flight. IEEE Transactions on Robotics, 42:673–692, 2026

  25. [25]

    Differentiable model predictive control on the gpu, 2025

    Emre Adabag, Marcus Greiff, John Subosits, and Thomas Lew. Differentiable model predictive control on the gpu, 2025

  26. [26]

    Differenti- ating through a cone program.arXiv preprint arXiv:1904.09043, 2019

    Akshay Agrawal, Shane Barratt, Stephen Boyd, Enzo Busseti, and Walaa M Moursi. Differenti- ating through a cone program.arXiv preprint arXiv:1904.09043, 2019

  27. [27]

    Optnet: Differentiable optimization as a layer in neural networks

    Brandon Amos and J Zico Kolter. Optnet: Differentiable optimization as a layer in neural networks. InInternational conference on machine learning, pages 136–145. PMLR, 2017

  28. [28]

    Differentiable convex optimization layers.Advances in neural information processing systems, 32, 2019

    Akshay Agrawal, Brandon Amos, Shane Barratt, Stephen Boyd, Steven Diamond, and J Zico Kolter. Differentiable convex optimization layers.Advances in neural information processing systems, 32, 2019

  29. [29]

    On the differentiability of the primal-dual interior-point method.arXiv preprint arXiv:2406.11749, 2024

    Kevin Tracy and Zachary Manchester. On the differentiability of the primal-dual interior-point method.arXiv preprint arXiv:2406.11749, 2024

  30. [30]

    Cvxpy: A python-embedded modeling language for convex optimization.Journal of Machine Learning Research, 17(83):1–5, 2016

    Steven Diamond and Stephen Boyd. Cvxpy: A python-embedded modeling language for convex optimization.Journal of Machine Learning Research, 17(83):1–5, 2016

  31. [31]

    SIAM, 1994

    Yurii Nesterov and Arkadii Nemirovskii.Interior-Point Polynomial Algorithms in Convex Programming. SIAM, 1994

  32. [32]

    Wright.Primal-Dual Interior-Point Methods

    Stephen J. Wright.Primal-Dual Interior-Point Methods. SIAM, 1997

  33. [33]

    Wright.Numerical Optimization

    Jorge Nocedal and Stephen J. Wright.Numerical Optimization. Springer, 2 edition, 2006

  34. [34]

    Potra and Stephen J

    Florian A. Potra and Stephen J. Wright. Interior-point methods.Journal of Computational and Applied Mathematics, 124(1–2):281–302, 2000

  35. [35]

    Stephen J. Wright. Effects of finite-precision arithmetic on interior-point methods for nonlinear programming.SIAM Journal on Optimization, 12(1):36–78, 2001

  36. [36]

    SIAM, 2002

    Nicholas J Higham.Accuracy and stability of numerical algorithms. SIAM, 2002

  37. [37]

    Mixed precision training

    Paulius Micikevicius, Sharan Narang, Jonah Alben, Gregory Diamos, Erich Elsen, David Garcia, Boris Ginsburg, Michael Houston, Oleksii Kuchaiev, Ganesh Venkatesh, and Hao Wu. Mixed precision training. InInternational Conference on Learning Representations, 2018

  38. [38]

    Jouppi, Cliff Young, Nishant Patil, David Patterson, Gaurav Agrawal, Raminder Bajwa, Sarah Bates, Suresh Bhatia, Nan Boden, Al Borchers, et al

    Norman P. Jouppi, Cliff Young, Nishant Patil, David Patterson, Gaurav Agrawal, Raminder Bajwa, Sarah Bates, Suresh Bhatia, Nan Boden, Al Borchers, et al. In-datacenter performance analysis of a tensor processing unit. InProceedings of the 44th Annual International Symposium on Computer Architecture, pages 1–12, 2017. 11

  39. [39]

    Higham, Mantas Mikaitis, and Srikara Pranesh

    Massimiliano Fasi, Nicholas J. Higham, Mantas Mikaitis, and Srikara Pranesh. Numerical behavior of nvidia tensor cores.PeerJ Computer Science, 7:e330, 2021

  40. [40]

    Cvxgen: A code generator for embedded convex optimiza- tion.Optimization and Engineering, 13(1):1–27, 2012

    Jacob Mattingley and Stephen Boyd. Cvxgen: A code generator for embedded convex optimiza- tion.Optimization and Engineering, 13(1):1–27, 2012

  41. [41]

    Krantz and Harold R

    Steven G. Krantz and Harold R. Parks.The Implicit Function Theorem: History, Theory, and Applications. Birkhäuser, 2012

  42. [42]

    JAX: composable transformations of Python+NumPy programs, 2018

    James Bradbury, Roy Frostig, Peter Hawkins, Matthew James Johnson, Yash Katariya, Chris Leary, Dougal Maclaurin, George Necula, Adam Paszke, Jake VanderPlas, Skye Wanderman- Milne, and Qiao Zhang. JAX: composable transformations of Python+NumPy programs, 2018

  43. [43]

    Pytorch: An imperative style, high- performance deep learning library

    Adam Paszke, Sam Gross, Francisco Massa, Adam Lerer, James Bradbury, Gregory Chanan, Trevor Killeen, Zeming Lin, Natalia Gimelshein, Luca Antiga, Alban Desmaison, Andreas Kopf, Edward Yang, Zachary DeVito, Martin Raison, Alykhan Tejani, Sasank Chilamkurthy, Benoit Steiner, Lu Fang, Junjie Bai, and Soumith Chintala. Pytorch: An imperative style, high- perf...

  44. [44]

    Ill-conditioning and computational error in interior methods for nonlinear programming.SIAM Journal on Optimization, 9(1):84–111, 1998

    Margaret H Wright. Ill-conditioning and computational error in interior methods for nonlinear programming.SIAM Journal on Optimization, 9(1):84–111, 1998

  45. [45]

    Numerical solution of saddle point problems

    Michele Benzi, Gene H Golub, and Jörg Liesen. Numerical solution of saddle point problems. Acta numerica, 14:1–137, 2005

  46. [46]

    A survey of numerical linear algebra methods utilizing mixed-precision arithmetic.The International Journal of High Performance Computing Applications, 35(4):344–369, 2021

    Ahmad Abdelfattah, Hartwig Anzt, Erik G Boman, Erin Carson, Terry Cojean, Jack Dongarra, Alyson Fox, Mark Gates, Nicholas J Higham, Xiaoye S Li, et al. A survey of numerical linear algebra methods utilizing mixed-precision arithmetic.The International Journal of High Performance Computing Applications, 35(4):344–369, 2021

  47. [47]

    Mixed precision algorithms in numerical linear algebra

    Nicholas J Higham and Theo Mary. Mixed precision algorithms in numerical linear algebra. Acta Numerica, 31:347–414, 2022

  48. [48]

    Azzam Haidar, Harun Bayraktar, Stanimire Tomov, Jack Dongarra, and Nicholas J Higham. Mixed-precision iterative refinement using tensor cores on gpus to accelerate solution of linear systems.Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences, 476(2243), 2020

  49. [49]

    Cuclarabel: Gpu acceleration for a conic optimization solver.arXiv preprint arXiv:2412.19027, 2024

    Yuwen Chen, Danny Tse, Parth Nobel, Paul Goulart, and Stephen Boyd. Cuclarabel: Gpu acceleration for a conic optimization solver.arXiv preprint arXiv:2412.19027, 2024

  50. [50]

    Log-domain interior-point methods for convex quadratic programming

    Frank Permenter. Log-domain interior-point methods for convex quadratic programming. Optimization Letters, 17(7):1613–1631, 2023

  51. [51]

    Implicit primal-dual interior-point methods for quadratic programming.arXiv preprint arXiv:2604.00364, 2026

    Jon Arrizabalaga and Zachary Manchester. Implicit primal-dual interior-point methods for quadratic programming.arXiv preprint arXiv:2604.00364, 2026

  52. [52]

    Complementarity by Construction: A Lie-Group Approach to Solving Quadratic Programs with Linear Complementarity Constraints

    Arun L Bishop, Micah I Reich, and Zachary Manchester. Complementarity by construction: A lie-group approach to solving quadratic programs with linear complementarity constraints. arXiv preprint arXiv:2604.11991, 2026

  53. [53]

    Matrix computations

    Gene H Golub and Charles F Van Loan. Matrix computations. johns hopkins studies in the mathematical sciences, 1996

  54. [54]

    The cvxopt linear and quadratic cone program solvers.Online: http://cvxopt

    Lieven Vandenberghe. The cvxopt linear and quadratic cone program solvers.Online: http://cvxopt. org/documentation/coneprog. pdf, 53, 2010

  55. [55]

    On the implementation of a primal-dual interior point method.SIAM Journal on optimization, 2(4):575–601, 1992

    Sanjay Mehrotra. On the implementation of a primal-dual interior point method.SIAM Journal on optimization, 2(4):575–601, 1992

  56. [56]

    Moreau: GPU-native differentiable optimiza- tion, 2026

    Shane Barratt, Parth Nobel, and Steven Diamond. Moreau: GPU-native differentiable optimiza- tion, 2026. 12

  57. [57]

    Minimum snap trajectory generation and control for quadrotors

    Daniel Mellinger and Vijay Kumar. Minimum snap trajectory generation and control for quadrotors. In2011 IEEE international conference on robotics and automation, pages 2520–

  58. [58]

    Planning dynamically feasible trajectories for quadrotors using safe flight corridors in 3-d complex environments.IEEE Robotics and Automation Letters, 2(3):1688–1695, 2017

    Sikang Liu, Michael Watterson, Kartik Mohta, Ke Sun, Subhrajit Bhattacharya, Camillo J Taylor, and Vijay Kumar. Planning dynamically feasible trajectories for quadrotors using safe flight corridors in 3-d complex environments.IEEE Robotics and Automation Letters, 2(3):1688–1695, 2017

  59. [59]

    Faster: Fast and safe trajectory planner for navigation in unknown environments.IEEE Transactions on Robotics, 38(2):922–938, 2021

    Jesus Tordesillas, Brett T Lopez, Michael Everett, and Jonathan P How. Faster: Fast and safe trajectory planner for navigation in unknown environments.IEEE Transactions on Robotics, 38(2):922–938, 2021

  60. [60]

    A limited memory algorithm for bound constrained optimization.SIAM Journal on scientific computing, 16(5):1190–1208, 1995

    Richard H Byrd, Peihuang Lu, Jorge Nocedal, and Ciyou Zhu. A limited memory algorithm for bound constrained optimization.SIAM Journal on scientific computing, 16(5):1190–1208, 1995

  61. [61]

    A stochastic approximation method.The annals of mathematical statistics, pages 400–407, 1951

    Herbert Robbins and Sutton Monro. A stochastic approximation method.The annals of mathematical statistics, pages 400–407, 1951

  62. [62]

    Adam: A Method for Stochastic Optimization

    Diederik P Kingma and Jimmy Ba. Adam: A method for stochastic optimization.arXiv preprint arXiv:1412.6980, 2014

  63. [63]

    Optimization methods for large-scale machine learning.SIAM review, 60(2):223–311, 2018

    Léon Bottou, Frank E Curtis, and Jorge Nocedal. Optimization methods for large-scale machine learning.SIAM review, 60(2):223–311, 2018

  64. [64]

    Barriernet: Differentiable control barrier functions for learning of safe robot control.IEEE Transactions on Robotics, 39(3):2289–2307, 2023

    Wei Xiao, Tsun-Hsuan Wang, Ramin Hasani, Makram Chahine, Alexander Amini, Xiao Li, and Daniela Rus. Barriernet: Differentiable control barrier functions for learning of safe robot control.IEEE Transactions on Robotics, 39(3):2289–2307, 2023. 13 A Implicit Function Theorem in Primal-Dual Interior-Point QPs Consider variables w∈R a and parameters θ∈R b, and...