A Differentiable Interior-Point Method in Single Precision

Jon Arrizabalaga; Kevin Tracy; Zachary Manchester

arxiv: 2605.17913 · v1 · pith:Q6YFQZTKnew · submitted 2026-05-18 · 🧮 math.OC

A Differentiable Interior-Point Method in Single Precision

Jon Arrizabalaga , Kevin Tracy , Zachary Manchester This is my paper

Pith reviewed 2026-05-20 09:56 UTC · model grok-4.3

classification 🧮 math.OC

keywords interior-point methodsdifferentiable optimizationsingle precisioncomplementarity conditionsconvex optimizationbilevel optimizationgradient computation

0 comments

The pith

Changing the complementarity representation keeps linear systems well-conditioned near the solution, enabling reliable single-precision interior-point methods and differentiation.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

Primal-dual interior-point methods solve constrained convex optimization problems to high accuracy, and their solutions can be differentiated with respect to problem data using the implicit function theorem. Standard complementarity conditions cause the linear systems to become increasingly ill-conditioned near the solution, which creates arithmetic problems and inaccurate gradients when working in single precision. The paper introduces an alternative complementarity representation that keeps those linear systems spectrally bounded throughout the solve, even close to optimality. This modification preserves the convergence rate and final accuracy of the original method while removing the precision barrier. The result is that interior-point solvers can now be used to solve and differentiate optimization problems in single precision for bilevel and end-to-end learning tasks that previously required double precision.

Core claim

The central claim is that an alternative complementarity representation in primal-dual interior-point methods ensures the underlying linear systems remain spectrally bounded even near the solution. This boundedness supports accurate gradient computation through the implicit function theorem and prevents arithmetic exceptions in single-precision arithmetic, allowing the solvers to handle and differentiate constrained convex problems that were previously restricted to double precision.

What carries the argument

The alternative complementarity representation, which replaces the standard treatment of primal-dual complementarity conditions to control the spectral properties of the KKT linear systems.

If this is right

Interior-point solvers become reliable for single-precision arithmetic without arithmetic exceptions or loss of accuracy near the solution.
Gradients computed through the solver remain numerically stable and accurate even when the iterates approach optimality.
Differentiation through constrained convex optimization becomes practical in bilevel and end-to-end learning pipelines on low-precision hardware.
Problems that previously required double precision can now be solved and differentiated at lower memory and compute cost.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same complementarity change might be tested inside other interior-point variants or related methods such as sequential quadratic programming to see whether conditioning benefits transfer.
Single-precision support could open the door to faster execution on hardware accelerators that favor reduced-precision arithmetic in machine-learning training loops.
Further checks could examine whether the modified systems affect warm-start performance or the detection of infeasible problems.

Load-bearing premise

The alternative complementarity representation preserves the convergence rate and solution accuracy of the original primal-dual interior-point method.

What would settle it

Run both the new method and the standard interior-point method to convergence on the same collection of convex test problems in double precision, then compare final objective values and the gradients obtained via the implicit function theorem; a noticeable difference in either quantity would falsify the preservation claim.

Figures

Figures reproduced from arXiv: 2605.17913 by Jon Arrizabalaga, Kevin Tracy, Zachary Manchester.

**Figure 2.** Figure 2: Bilevel trajectory optimization. Implicit f32 remains stable at tight relaxations and matches [PITH_FULL_IMAGE:figures/full_fig_p008_2.png] view at source ↗

**Figure 3.** Figure 3: End-to-end learning of a multi-agent safety filter. Implicit f32 matches f64 training stability [PITH_FULL_IMAGE:figures/full_fig_p009_3.png] view at source ↗

**Figure 4.** Figure 4: NaN tracing for operations in explicit f32. [PITH_FULL_IMAGE:figures/full_fig_p017_4.png] view at source ↗

**Figure 5.** Figure 5: Bilevel optimization for fast (outer opt.) and safe (inner opt.) trajectories. [PITH_FULL_IMAGE:figures/full_fig_p019_5.png] view at source ↗

**Figure 6.** Figure 6: A subset of testing configurations for 1 agent. Only case where explicit f32 works. [PITH_FULL_IMAGE:figures/full_fig_p022_6.png] view at source ↗

**Figure 7.** Figure 7: A subset of testing configurations for 3 agents. [PITH_FULL_IMAGE:figures/full_fig_p023_7.png] view at source ↗

**Figure 8.** Figure 8: A subset of testing configurations for 5 agents. [PITH_FULL_IMAGE:figures/full_fig_p023_8.png] view at source ↗

**Figure 9.** Figure 9: A subset of testing configurations for 9 agents. [PITH_FULL_IMAGE:figures/full_fig_p024_9.png] view at source ↗

read the original abstract

Primal-dual interior-point methods solve constrained convex optimization problems to tight tolerances with speed and robustness. Their solutions are also efficiently differentiable with respect to the problem data through the implicit function theorem. However, the standard treatment of primal-dual complementarity makes the underlying linear systems increasingly ill-conditioned near the solution. While this ill-conditioning is often benign in double precision, it can be catastrophic in single precision, preventing interior-point methods from fully exploiting the accelerated hardware that underpins modern machine learning. This paper introduces a differentiable interior-point method designed for low-precision arithmetic. By using an alternative complementarity representation, we ensure that the underlying linear systems remain spectrally bounded -- even near the solution -- a property that is essential for computing accurate gradients and avoiding arithmetic exceptions. As a result, our method enables interior-point solvers to reliably solve and differentiate optimization problems in single precision that were previously confined to double precision. We demonstrate the approach through an ablation study against the standard interior-point formulation and applications in bilevel and end-to-end learning settings where differentiating through constrained optimization is essential. The source code is available at https://github.com/qpax-solver/qpax.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 3 minor

Summary. The paper introduces a differentiable primal-dual interior-point method for convex optimization problems that is designed to operate in single precision. The central innovation is an alternative algebraic representation of the complementarity condition that is intended to keep the Newton linear systems spectrally bounded even near optimality, thereby enabling stable implicit differentiation via the implicit function theorem and avoiding arithmetic exceptions. The approach is supported by an ablation study against the standard formulation and by demonstrations in bilevel and end-to-end learning settings; open-source code is provided.

Significance. If the spectral-boundedness property holds, the method would allow interior-point solvers to be used reliably in single precision for differentiable optimization layers, taking advantage of accelerated hardware common in machine learning. The open-source implementation and the empirical ablation plus application results are concrete strengths that support practical utility.

major comments (2)

[§3] §3 (alternative complementarity formulation): the manuscript asserts that the new representation keeps the KKT/Newton systems spectrally bounded as the duality gap vanishes, yet supplies no theorem or uniform eigenvalue bound independent of problem data, conditioning, and dimension. The ablation study demonstrates practical improvement on selected instances but does not establish the claimed O(1) conditioning; this bound is load-bearing for the single-precision stability and gradient-accuracy claims.
[§4] §4 (convergence and accuracy): the claim that the alternative representation preserves the original convergence rate and solution accuracy is implicit in the ablation results but is not accompanied by a formal rate analysis or proof that the modified Newton system retains the same local convergence properties as the standard primal-dual IPM.

minor comments (3)

[Figure 2] Figure 2 (ablation plots): axis labels and legends would benefit from explicit mention of the duality-gap range and the precise metric being plotted (e.g., relative residual or gradient error).
Notation: the mapping from the new complementarity variable to the standard slack variables is introduced without a side-by-side comparison to the classical form; a short table or equation block would improve clarity.
References: recent literature on low-precision linear solvers and differentiable optimization (e.g., works on single-precision QP solvers) is lightly cited; adding 2–3 targeted references would strengthen the positioning.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the careful reading and constructive comments on our manuscript. Below we respond point by point to the major comments, indicating the changes we will make in the revised version.

read point-by-point responses

Referee: [§3] §3 (alternative complementarity formulation): the manuscript asserts that the new representation keeps the KKT/Newton systems spectrally bounded as the duality gap vanishes, yet supplies no theorem or uniform eigenvalue bound independent of problem data, conditioning, and dimension. The ablation study demonstrates practical improvement on selected instances but does not establish the claimed O(1) conditioning; this bound is load-bearing for the single-precision stability and gradient-accuracy claims.

Authors: We agree that a formal statement would strengthen the section. In the revision we will insert a theorem in §3 establishing that, under the alternative complementarity formulation, the eigenvalues of the Newton matrix (or its Schur complement) remain bounded away from zero and infinity as the barrier parameter μ → 0. The bound depends on standard problem quantities (Lipschitz constants of the objective and constraint functions, and the conditioning of the constraint Jacobian) but is independent of μ itself. This is the sense in which the systems are “spectrally bounded as the duality gap vanishes,” in contrast to the standard formulation whose condition number grows without bound as μ → 0. While a bound that is completely independent of all problem data and dimension is not claimed (and would require stronger assumptions than those in the problem class), the μ-independence is the property needed for single-precision stability and reliable implicit differentiation. We will also add a short remark clarifying this distinction and will reference the ablation results as empirical corroboration of the theoretical statement. revision: yes
Referee: [§4] §4 (convergence and accuracy): the claim that the alternative representation preserves the original convergence rate and solution accuracy is implicit in the ablation results but is not accompanied by a formal rate analysis or proof that the modified Newton system retains the same local convergence properties as the standard primal-dual IPM.

Authors: We accept that an explicit argument is desirable. In the revised manuscript we will add a short paragraph (or appendix remark) in §4 showing that the Newton system arising from the alternative complementarity condition is asymptotically equivalent, near a strict complementarity solution, to the standard primal-dual Newton system. Consequently the local quadratic convergence rate of the underlying interior-point method is retained. Solution accuracy is likewise unchanged because the KKT conditions satisfied at termination are identical to those of the classical formulation. The ablation experiments already indicate that final objective values and constraint violations match to machine precision; the added analysis will place this observation on a theoretical footing. revision: yes

Circularity Check

0 steps flagged

No circularity: explicit algebraic change to complementarity yields bounded spectra by construction of the new representation.

full rationale

The paper's central contribution is an alternative complementarity representation that is introduced directly in the method derivation rather than fitted to data or justified solely by self-citation. The claim that linear systems remain spectrally bounded follows from the algebraic substitution itself (as described in the abstract and method sections), with empirical verification via ablation rather than any reduction of the target property to the paper's own fitted quantities or prior self-cited uniqueness results. No load-bearing step equates a prediction or bound to an input by definition, and the derivation remains self-contained against external IPM benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The approach relies on standard convex optimization assumptions and the implicit function theorem for differentiation; no new free parameters, invented entities, or ad-hoc axioms are introduced beyond the choice of the alternative complementarity form itself.

axioms (1)

domain assumption The problem is convex and satisfies standard constraint qualifications so that the KKT system is well-defined.
Invoked implicitly when applying the interior-point method and the implicit function theorem for gradients.

pith-pipeline@v0.9.0 · 5728 in / 1300 out tokens · 29800 ms · 2026-05-20T09:56:27.963592+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

By using an alternative complementarity representation, we ensure that the underlying linear systems remain spectrally bounded — even near the solution
IndisputableMonolith/Foundation/AlphaCoordinateFixation.lean costAlphaLog_high_calibrated_iff echoes

?

echoes
ECHOES: this paper passage has the same mathematical shape or conceptual pattern as the Recognition theorem, but is not a direct formal dependency.

∂v b_κ(v) + ∂v b_κ(−v) = 1 and 0 < ∂v b_κ(v) ≤ 1

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

64 extracted references · 64 canonical work pages · 2 internal anchors

[1]

An efficiently solvable quadratic program for stabilizing dynamic locomotion

Scott Kuindersma, Frank Permenter, and Russ Tedrake. An efficiently solvable quadratic program for stabilizing dynamic locomotion. In2014 IEEE International Conference on Robotics and Automation (ICRA), pages 2589–2594. IEEE, 2014

work page 2014
[2]

Autonomous precision landing of space rockets

Lars Blackmore. Autonomous precision landing of space rockets. InFrontiers of engineering: reports on leading-edge engineering from the 2016 symposium, volume 46, pages 15–20. The Bridge Washington, DC, USA, 2016

work page 2016
[3]

A constrained kalman filter for rigid body systems with frictional contact

Patrick Varin and Scott Kuindersma. A constrained kalman filter for rigid body systems with frictional contact. InInternational Workshop on the Algorithmic Foundations of Robotics, pages 474–490. Springer, 2018

work page 2018
[4]

Reynolds, Michael Szmuk, Thomas Lew, Riccardo Bonalli, Marco Pavone, and Behçet Açıkme¸ se

Danylo Malyuta, Taylor P. Reynolds, Michael Szmuk, Thomas Lew, Riccardo Bonalli, Marco Pavone, and Behçet Açıkme¸ se. Convex optimization for trajectory generation: A tutorial on generating dynamically feasible trajectories reliably and efficiently.IEEE Control Systems Magazine, 42(5):40–113, 2022

work page 2022
[5]

Convex maneuver planning for spacecraft collision avoidance, 2025

Fausto Vega, Jon Arrizabalaga, Ryan Watson, and Zachary Manchester. Convex maneuver planning for spacecraft collision avoidance, 2025

work page 2025
[6]

Optimization-based simulation of nonsmooth rigid multibody dynamics

Mihai Anitescu. Optimization-based simulation of nonsmooth rigid multibody dynamics. Mathematical Programming, 105(1):113–143, 2006

work page 2006
[7]

Differentiable collision detection for a set of convex primitives

Kevin Tracy, Taylor A Howell, and Zachary Manchester. Differentiable collision detection for a set of convex primitives. In2023 IEEE International Conference on Robotics and Automation (ICRA), pages 3663–3670. IEEE, 2023

work page 2023
[8]

A convex quasistatic time-stepping scheme for rigid multibody systems with contact and friction

Tao Pang and Russ Tedrake. A convex quasistatic time-stepping scheme for rigid multibody systems with contact and friction. In2021 IEEE International Conference on Robotics and Automation (ICRA), pages 6614–6620. IEEE, 2021

work page 2021
[9]

Martin Beckmann, C. B. McGuire, and Christopher B. Winsten.Studies in the Economics of Transportation. Yale University Press, 1956

work page 1956
[10]

A min-max solution of an inventory problem.Studies in the Mathematical Theory of Inventory and Production, pages 201–209, 1958

Herbert Scarf. A min-max solution of an inventory problem.Studies in the Mathematical Theory of Inventory and Production, pages 201–209, 1958

work page 1958
[11]

Robust convex optimization.Mathematics of opera- tions research, 23(4):769–805, 1998

Aharon Ben-Tal and Arkadi Nemirovski. Robust convex optimization.Mathematics of opera- tions research, 23(4):769–805, 1998

work page 1998
[12]

portfolio selection

Mark Rubinstein. Markowitz’s" portfolio selection": A fifty-year retrospective.The Journal of finance, 57(3):1041–1045, 2002

work page 2002
[13]

The price of robustness.Operations research, 52(1):35–53, 2004

Dimitris Bertsimas and Melvyn Sim. The price of robustness.Operations research, 52(1):35–53, 2004

work page 2004
[14]

Support-vector networks.Machine Learning, 20(3):273– 297, 1995

Corinna Cortes and Vladimir Vapnik. Support-vector networks.Machine Learning, 20(3):273– 297, 1995

work page 1995
[15]

Regression shrinkage and selection via the lasso.Journal of the Royal Statistical Society: Series B, 58(1):267–288, 1996

Robert Tibshirani. Regression shrinkage and selection via the lasso.Journal of the Royal Statistical Society: Series B, 58(1):267–288, 1996

work page 1996
[16]

Candès and Benjamin Recht

Emmanuel J. Candès and Benjamin Recht. Exact matrix completion via convex optimization. Foundations of Computational Mathematics, 9(6):717–772, 2009

work page 2009
[17]

Sparse inverse covariance estimation with the graphical lasso.Biostatistics, 9(3):432–441, 2008

Jerome Friedman, Trevor Hastie, and Robert Tibshirani. Sparse inverse covariance estimation with the graphical lasso.Biostatistics, 9(3):432–441, 2008

work page 2008
[18]

Distributed opti- mization and statistical learning via the alternating direction method of multipliers.Foundations and Trends in Machine Learning, 3(1):1–122, 2011

Stephen Boyd, Neal Parikh, Eric Chu, Borja Peleato, and Jonathan Eckstein. Distributed opti- mization and statistical learning via the alternating direction method of multipliers.Foundations and Trends in Machine Learning, 3(1):1–122, 2011. 10

work page 2011
[19]

Cambridge university press, 2004

Stephen Boyd and Lieven Vandenberghe.Convex optimization. Cambridge university press, 2004

work page 2004
[20]

Donti, Brandon Amos, and J

Priya L. Donti, Brandon Amos, and J. Zico Kolter. Task-based end-to-end model learning in stochastic optimization. InAdvances in Neural Information Processing Systems, 2017

work page 2017
[21]

Melding the data-decisions pipeline: Decision- focused learning for combinatorial optimization

Bryan Wilder, Bistra Dilkina, and Milind Tambe. Melding the data-decisions pipeline: Decision- focused learning for combinatorial optimization. InProceedings of the AAAI Conference on Artificial Intelligence, 2019

work page 2019
[22]

Learning for CasADi: Data-driven models in numerical optimization

Tim Salzmann, Jon Arrizabalaga, Joel Andersson, Marco Pavone, and Markus Ryll. Learning for CasADi: Data-driven models in numerical optimization. In Alessandro Abate, Mark Cannon, Kostas Margellos, and Antonis Papachristodoulou, editors,Proceedings of the 6th Annual Learning for Dynamics; Control Conference, volume 242 ofProceedings of Machine Learning Re...

work page 2024
[23]

Real-time neural mpc: Deep learning model predictive control for quadrotors and agile robotic platforms.IEEE Robotics and Automation Letters, 8(4):2397–2404, 2023

Tim Salzmann, Elia Kaufmann, Jon Arrizabalaga, Marco Pavone, Davide Scaramuzza, and Markus Ryll. Real-time neural mpc: Deep learning model predictive control for quadrotors and agile robotic platforms.IEEE Robotics and Automation Letters, 8(4):2397–2404, 2023

work page 2023
[24]

Actor–critic model predictive control: Differentiable optimization meets reinforcement learning for agile flight

Angel Romero, Elie Aljalbout, Yunlong Song, and Davide Scaramuzza. Actor–critic model predictive control: Differentiable optimization meets reinforcement learning for agile flight. IEEE Transactions on Robotics, 42:673–692, 2026

work page 2026
[25]

Differentiable model predictive control on the gpu, 2025

Emre Adabag, Marcus Greiff, John Subosits, and Thomas Lew. Differentiable model predictive control on the gpu, 2025

work page 2025
[26]

Differenti- ating through a cone program.arXiv preprint arXiv:1904.09043, 2019

Akshay Agrawal, Shane Barratt, Stephen Boyd, Enzo Busseti, and Walaa M Moursi. Differenti- ating through a cone program.arXiv preprint arXiv:1904.09043, 2019

work page arXiv 1904
[27]

Optnet: Differentiable optimization as a layer in neural networks

Brandon Amos and J Zico Kolter. Optnet: Differentiable optimization as a layer in neural networks. InInternational conference on machine learning, pages 136–145. PMLR, 2017

work page 2017
[28]

Differentiable convex optimization layers.Advances in neural information processing systems, 32, 2019

Akshay Agrawal, Brandon Amos, Shane Barratt, Stephen Boyd, Steven Diamond, and J Zico Kolter. Differentiable convex optimization layers.Advances in neural information processing systems, 32, 2019

work page 2019
[29]

On the differentiability of the primal-dual interior-point method.arXiv preprint arXiv:2406.11749, 2024

Kevin Tracy and Zachary Manchester. On the differentiability of the primal-dual interior-point method.arXiv preprint arXiv:2406.11749, 2024

work page arXiv 2024
[30]

Cvxpy: A python-embedded modeling language for convex optimization.Journal of Machine Learning Research, 17(83):1–5, 2016

Steven Diamond and Stephen Boyd. Cvxpy: A python-embedded modeling language for convex optimization.Journal of Machine Learning Research, 17(83):1–5, 2016

work page 2016
[31]

SIAM, 1994

Yurii Nesterov and Arkadii Nemirovskii.Interior-Point Polynomial Algorithms in Convex Programming. SIAM, 1994

work page 1994
[32]

Wright.Primal-Dual Interior-Point Methods

Stephen J. Wright.Primal-Dual Interior-Point Methods. SIAM, 1997

work page 1997
[33]

Wright.Numerical Optimization

Jorge Nocedal and Stephen J. Wright.Numerical Optimization. Springer, 2 edition, 2006

work page 2006
[34]

Potra and Stephen J

Florian A. Potra and Stephen J. Wright. Interior-point methods.Journal of Computational and Applied Mathematics, 124(1–2):281–302, 2000

work page 2000
[35]

Stephen J. Wright. Effects of finite-precision arithmetic on interior-point methods for nonlinear programming.SIAM Journal on Optimization, 12(1):36–78, 2001

work page 2001
[36]

SIAM, 2002

Nicholas J Higham.Accuracy and stability of numerical algorithms. SIAM, 2002

work page 2002
[37]

Mixed precision training

Paulius Micikevicius, Sharan Narang, Jonah Alben, Gregory Diamos, Erich Elsen, David Garcia, Boris Ginsburg, Michael Houston, Oleksii Kuchaiev, Ganesh Venkatesh, and Hao Wu. Mixed precision training. InInternational Conference on Learning Representations, 2018

work page 2018
[38]

Jouppi, Cliff Young, Nishant Patil, David Patterson, Gaurav Agrawal, Raminder Bajwa, Sarah Bates, Suresh Bhatia, Nan Boden, Al Borchers, et al

Norman P. Jouppi, Cliff Young, Nishant Patil, David Patterson, Gaurav Agrawal, Raminder Bajwa, Sarah Bates, Suresh Bhatia, Nan Boden, Al Borchers, et al. In-datacenter performance analysis of a tensor processing unit. InProceedings of the 44th Annual International Symposium on Computer Architecture, pages 1–12, 2017. 11

work page 2017
[39]

Higham, Mantas Mikaitis, and Srikara Pranesh

Massimiliano Fasi, Nicholas J. Higham, Mantas Mikaitis, and Srikara Pranesh. Numerical behavior of nvidia tensor cores.PeerJ Computer Science, 7:e330, 2021

work page 2021
[40]

Cvxgen: A code generator for embedded convex optimiza- tion.Optimization and Engineering, 13(1):1–27, 2012

Jacob Mattingley and Stephen Boyd. Cvxgen: A code generator for embedded convex optimiza- tion.Optimization and Engineering, 13(1):1–27, 2012

work page 2012
[41]

Krantz and Harold R

Steven G. Krantz and Harold R. Parks.The Implicit Function Theorem: History, Theory, and Applications. Birkhäuser, 2012

work page 2012
[42]

JAX: composable transformations of Python+NumPy programs, 2018

James Bradbury, Roy Frostig, Peter Hawkins, Matthew James Johnson, Yash Katariya, Chris Leary, Dougal Maclaurin, George Necula, Adam Paszke, Jake VanderPlas, Skye Wanderman- Milne, and Qiao Zhang. JAX: composable transformations of Python+NumPy programs, 2018

work page 2018
[43]

Pytorch: An imperative style, high- performance deep learning library

Adam Paszke, Sam Gross, Francisco Massa, Adam Lerer, James Bradbury, Gregory Chanan, Trevor Killeen, Zeming Lin, Natalia Gimelshein, Luca Antiga, Alban Desmaison, Andreas Kopf, Edward Yang, Zachary DeVito, Martin Raison, Alykhan Tejani, Sasank Chilamkurthy, Benoit Steiner, Lu Fang, Junjie Bai, and Soumith Chintala. Pytorch: An imperative style, high- perf...

work page 2019
[44]

Ill-conditioning and computational error in interior methods for nonlinear programming.SIAM Journal on Optimization, 9(1):84–111, 1998

Margaret H Wright. Ill-conditioning and computational error in interior methods for nonlinear programming.SIAM Journal on Optimization, 9(1):84–111, 1998

work page 1998
[45]

Numerical solution of saddle point problems

Michele Benzi, Gene H Golub, and Jörg Liesen. Numerical solution of saddle point problems. Acta numerica, 14:1–137, 2005

work page 2005
[46]

A survey of numerical linear algebra methods utilizing mixed-precision arithmetic.The International Journal of High Performance Computing Applications, 35(4):344–369, 2021

Ahmad Abdelfattah, Hartwig Anzt, Erik G Boman, Erin Carson, Terry Cojean, Jack Dongarra, Alyson Fox, Mark Gates, Nicholas J Higham, Xiaoye S Li, et al. A survey of numerical linear algebra methods utilizing mixed-precision arithmetic.The International Journal of High Performance Computing Applications, 35(4):344–369, 2021

work page 2021
[47]

Mixed precision algorithms in numerical linear algebra

Nicholas J Higham and Theo Mary. Mixed precision algorithms in numerical linear algebra. Acta Numerica, 31:347–414, 2022

work page 2022
[48]

Azzam Haidar, Harun Bayraktar, Stanimire Tomov, Jack Dongarra, and Nicholas J Higham. Mixed-precision iterative refinement using tensor cores on gpus to accelerate solution of linear systems.Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences, 476(2243), 2020

work page 2020
[49]

Cuclarabel: Gpu acceleration for a conic optimization solver.arXiv preprint arXiv:2412.19027, 2024

Yuwen Chen, Danny Tse, Parth Nobel, Paul Goulart, and Stephen Boyd. Cuclarabel: Gpu acceleration for a conic optimization solver.arXiv preprint arXiv:2412.19027, 2024

work page arXiv 2024
[50]

Log-domain interior-point methods for convex quadratic programming

Frank Permenter. Log-domain interior-point methods for convex quadratic programming. Optimization Letters, 17(7):1613–1631, 2023

work page 2023
[51]

Implicit primal-dual interior-point methods for quadratic programming.arXiv preprint arXiv:2604.00364, 2026

Jon Arrizabalaga and Zachary Manchester. Implicit primal-dual interior-point methods for quadratic programming.arXiv preprint arXiv:2604.00364, 2026

work page arXiv 2026
[52]

Complementarity by Construction: A Lie-Group Approach to Solving Quadratic Programs with Linear Complementarity Constraints

Arun L Bishop, Micah I Reich, and Zachary Manchester. Complementarity by construction: A lie-group approach to solving quadratic programs with linear complementarity constraints. arXiv preprint arXiv:2604.11991, 2026

work page internal anchor Pith review Pith/arXiv arXiv 2026
[53]

Matrix computations

Gene H Golub and Charles F Van Loan. Matrix computations. johns hopkins studies in the mathematical sciences, 1996

work page 1996
[54]

The cvxopt linear and quadratic cone program solvers.Online: http://cvxopt

Lieven Vandenberghe. The cvxopt linear and quadratic cone program solvers.Online: http://cvxopt. org/documentation/coneprog. pdf, 53, 2010

work page 2010
[55]

On the implementation of a primal-dual interior point method.SIAM Journal on optimization, 2(4):575–601, 1992

Sanjay Mehrotra. On the implementation of a primal-dual interior point method.SIAM Journal on optimization, 2(4):575–601, 1992

work page 1992
[56]

Moreau: GPU-native differentiable optimiza- tion, 2026

Shane Barratt, Parth Nobel, and Steven Diamond. Moreau: GPU-native differentiable optimiza- tion, 2026. 12

work page 2026
[57]

Minimum snap trajectory generation and control for quadrotors

Daniel Mellinger and Vijay Kumar. Minimum snap trajectory generation and control for quadrotors. In2011 IEEE international conference on robotics and automation, pages 2520–

work page
[58]

Planning dynamically feasible trajectories for quadrotors using safe flight corridors in 3-d complex environments.IEEE Robotics and Automation Letters, 2(3):1688–1695, 2017

Sikang Liu, Michael Watterson, Kartik Mohta, Ke Sun, Subhrajit Bhattacharya, Camillo J Taylor, and Vijay Kumar. Planning dynamically feasible trajectories for quadrotors using safe flight corridors in 3-d complex environments.IEEE Robotics and Automation Letters, 2(3):1688–1695, 2017

work page 2017
[59]

Faster: Fast and safe trajectory planner for navigation in unknown environments.IEEE Transactions on Robotics, 38(2):922–938, 2021

Jesus Tordesillas, Brett T Lopez, Michael Everett, and Jonathan P How. Faster: Fast and safe trajectory planner for navigation in unknown environments.IEEE Transactions on Robotics, 38(2):922–938, 2021

work page 2021
[60]

A limited memory algorithm for bound constrained optimization.SIAM Journal on scientific computing, 16(5):1190–1208, 1995

Richard H Byrd, Peihuang Lu, Jorge Nocedal, and Ciyou Zhu. A limited memory algorithm for bound constrained optimization.SIAM Journal on scientific computing, 16(5):1190–1208, 1995

work page 1995
[61]

A stochastic approximation method.The annals of mathematical statistics, pages 400–407, 1951

Herbert Robbins and Sutton Monro. A stochastic approximation method.The annals of mathematical statistics, pages 400–407, 1951

work page 1951
[62]

Adam: A Method for Stochastic Optimization

Diederik P Kingma and Jimmy Ba. Adam: A method for stochastic optimization.arXiv preprint arXiv:1412.6980, 2014

work page internal anchor Pith review Pith/arXiv arXiv 2014
[63]

Optimization methods for large-scale machine learning.SIAM review, 60(2):223–311, 2018

Léon Bottou, Frank E Curtis, and Jorge Nocedal. Optimization methods for large-scale machine learning.SIAM review, 60(2):223–311, 2018

work page 2018
[64]

Barriernet: Differentiable control barrier functions for learning of safe robot control.IEEE Transactions on Robotics, 39(3):2289–2307, 2023

Wei Xiao, Tsun-Hsuan Wang, Ramin Hasani, Makram Chahine, Alexander Amini, Xiao Li, and Daniela Rus. Barriernet: Differentiable control barrier functions for learning of safe robot control.IEEE Transactions on Robotics, 39(3):2289–2307, 2023. 13 A Implicit Function Theorem in Primal-Dual Interior-Point QPs Consider variables w∈R a and parameters θ∈R b, and...

work page 2023

[1] [1]

An efficiently solvable quadratic program for stabilizing dynamic locomotion

Scott Kuindersma, Frank Permenter, and Russ Tedrake. An efficiently solvable quadratic program for stabilizing dynamic locomotion. In2014 IEEE International Conference on Robotics and Automation (ICRA), pages 2589–2594. IEEE, 2014

work page 2014

[2] [2]

Autonomous precision landing of space rockets

Lars Blackmore. Autonomous precision landing of space rockets. InFrontiers of engineering: reports on leading-edge engineering from the 2016 symposium, volume 46, pages 15–20. The Bridge Washington, DC, USA, 2016

work page 2016

[3] [3]

A constrained kalman filter for rigid body systems with frictional contact

Patrick Varin and Scott Kuindersma. A constrained kalman filter for rigid body systems with frictional contact. InInternational Workshop on the Algorithmic Foundations of Robotics, pages 474–490. Springer, 2018

work page 2018

[4] [4]

Reynolds, Michael Szmuk, Thomas Lew, Riccardo Bonalli, Marco Pavone, and Behçet Açıkme¸ se

Danylo Malyuta, Taylor P. Reynolds, Michael Szmuk, Thomas Lew, Riccardo Bonalli, Marco Pavone, and Behçet Açıkme¸ se. Convex optimization for trajectory generation: A tutorial on generating dynamically feasible trajectories reliably and efficiently.IEEE Control Systems Magazine, 42(5):40–113, 2022

work page 2022

[5] [5]

Convex maneuver planning for spacecraft collision avoidance, 2025

Fausto Vega, Jon Arrizabalaga, Ryan Watson, and Zachary Manchester. Convex maneuver planning for spacecraft collision avoidance, 2025

work page 2025

[6] [6]

Optimization-based simulation of nonsmooth rigid multibody dynamics

Mihai Anitescu. Optimization-based simulation of nonsmooth rigid multibody dynamics. Mathematical Programming, 105(1):113–143, 2006

work page 2006

[7] [7]

Differentiable collision detection for a set of convex primitives

Kevin Tracy, Taylor A Howell, and Zachary Manchester. Differentiable collision detection for a set of convex primitives. In2023 IEEE International Conference on Robotics and Automation (ICRA), pages 3663–3670. IEEE, 2023

work page 2023

[8] [8]

A convex quasistatic time-stepping scheme for rigid multibody systems with contact and friction

Tao Pang and Russ Tedrake. A convex quasistatic time-stepping scheme for rigid multibody systems with contact and friction. In2021 IEEE International Conference on Robotics and Automation (ICRA), pages 6614–6620. IEEE, 2021

work page 2021

[9] [9]

Martin Beckmann, C. B. McGuire, and Christopher B. Winsten.Studies in the Economics of Transportation. Yale University Press, 1956

work page 1956

[10] [10]

A min-max solution of an inventory problem.Studies in the Mathematical Theory of Inventory and Production, pages 201–209, 1958

Herbert Scarf. A min-max solution of an inventory problem.Studies in the Mathematical Theory of Inventory and Production, pages 201–209, 1958

work page 1958

[11] [11]

Robust convex optimization.Mathematics of opera- tions research, 23(4):769–805, 1998

Aharon Ben-Tal and Arkadi Nemirovski. Robust convex optimization.Mathematics of opera- tions research, 23(4):769–805, 1998

work page 1998

[12] [12]

portfolio selection

Mark Rubinstein. Markowitz’s" portfolio selection": A fifty-year retrospective.The Journal of finance, 57(3):1041–1045, 2002

work page 2002

[13] [13]

The price of robustness.Operations research, 52(1):35–53, 2004

Dimitris Bertsimas and Melvyn Sim. The price of robustness.Operations research, 52(1):35–53, 2004

work page 2004

[14] [14]

Support-vector networks.Machine Learning, 20(3):273– 297, 1995

Corinna Cortes and Vladimir Vapnik. Support-vector networks.Machine Learning, 20(3):273– 297, 1995

work page 1995

[15] [15]

Regression shrinkage and selection via the lasso.Journal of the Royal Statistical Society: Series B, 58(1):267–288, 1996

Robert Tibshirani. Regression shrinkage and selection via the lasso.Journal of the Royal Statistical Society: Series B, 58(1):267–288, 1996

work page 1996

[16] [16]

Candès and Benjamin Recht

Emmanuel J. Candès and Benjamin Recht. Exact matrix completion via convex optimization. Foundations of Computational Mathematics, 9(6):717–772, 2009

work page 2009

[17] [17]

Sparse inverse covariance estimation with the graphical lasso.Biostatistics, 9(3):432–441, 2008

Jerome Friedman, Trevor Hastie, and Robert Tibshirani. Sparse inverse covariance estimation with the graphical lasso.Biostatistics, 9(3):432–441, 2008

work page 2008

[18] [18]

Distributed opti- mization and statistical learning via the alternating direction method of multipliers.Foundations and Trends in Machine Learning, 3(1):1–122, 2011

Stephen Boyd, Neal Parikh, Eric Chu, Borja Peleato, and Jonathan Eckstein. Distributed opti- mization and statistical learning via the alternating direction method of multipliers.Foundations and Trends in Machine Learning, 3(1):1–122, 2011. 10

work page 2011

[19] [19]

Cambridge university press, 2004

Stephen Boyd and Lieven Vandenberghe.Convex optimization. Cambridge university press, 2004

work page 2004

[20] [20]

Donti, Brandon Amos, and J

Priya L. Donti, Brandon Amos, and J. Zico Kolter. Task-based end-to-end model learning in stochastic optimization. InAdvances in Neural Information Processing Systems, 2017

work page 2017

[21] [21]

Melding the data-decisions pipeline: Decision- focused learning for combinatorial optimization

Bryan Wilder, Bistra Dilkina, and Milind Tambe. Melding the data-decisions pipeline: Decision- focused learning for combinatorial optimization. InProceedings of the AAAI Conference on Artificial Intelligence, 2019

work page 2019

[22] [22]

Learning for CasADi: Data-driven models in numerical optimization

Tim Salzmann, Jon Arrizabalaga, Joel Andersson, Marco Pavone, and Markus Ryll. Learning for CasADi: Data-driven models in numerical optimization. In Alessandro Abate, Mark Cannon, Kostas Margellos, and Antonis Papachristodoulou, editors,Proceedings of the 6th Annual Learning for Dynamics; Control Conference, volume 242 ofProceedings of Machine Learning Re...

work page 2024

[23] [23]

Real-time neural mpc: Deep learning model predictive control for quadrotors and agile robotic platforms.IEEE Robotics and Automation Letters, 8(4):2397–2404, 2023

Tim Salzmann, Elia Kaufmann, Jon Arrizabalaga, Marco Pavone, Davide Scaramuzza, and Markus Ryll. Real-time neural mpc: Deep learning model predictive control for quadrotors and agile robotic platforms.IEEE Robotics and Automation Letters, 8(4):2397–2404, 2023

work page 2023

[24] [24]

Actor–critic model predictive control: Differentiable optimization meets reinforcement learning for agile flight

Angel Romero, Elie Aljalbout, Yunlong Song, and Davide Scaramuzza. Actor–critic model predictive control: Differentiable optimization meets reinforcement learning for agile flight. IEEE Transactions on Robotics, 42:673–692, 2026

work page 2026

[25] [25]

Differentiable model predictive control on the gpu, 2025

Emre Adabag, Marcus Greiff, John Subosits, and Thomas Lew. Differentiable model predictive control on the gpu, 2025

work page 2025

[26] [26]

Differenti- ating through a cone program.arXiv preprint arXiv:1904.09043, 2019

Akshay Agrawal, Shane Barratt, Stephen Boyd, Enzo Busseti, and Walaa M Moursi. Differenti- ating through a cone program.arXiv preprint arXiv:1904.09043, 2019

work page arXiv 1904

[27] [27]

Optnet: Differentiable optimization as a layer in neural networks

Brandon Amos and J Zico Kolter. Optnet: Differentiable optimization as a layer in neural networks. InInternational conference on machine learning, pages 136–145. PMLR, 2017

work page 2017

[28] [28]

Differentiable convex optimization layers.Advances in neural information processing systems, 32, 2019

Akshay Agrawal, Brandon Amos, Shane Barratt, Stephen Boyd, Steven Diamond, and J Zico Kolter. Differentiable convex optimization layers.Advances in neural information processing systems, 32, 2019

work page 2019

[29] [29]

On the differentiability of the primal-dual interior-point method.arXiv preprint arXiv:2406.11749, 2024

Kevin Tracy and Zachary Manchester. On the differentiability of the primal-dual interior-point method.arXiv preprint arXiv:2406.11749, 2024

work page arXiv 2024

[30] [30]

Cvxpy: A python-embedded modeling language for convex optimization.Journal of Machine Learning Research, 17(83):1–5, 2016

Steven Diamond and Stephen Boyd. Cvxpy: A python-embedded modeling language for convex optimization.Journal of Machine Learning Research, 17(83):1–5, 2016

work page 2016

[31] [31]

SIAM, 1994

Yurii Nesterov and Arkadii Nemirovskii.Interior-Point Polynomial Algorithms in Convex Programming. SIAM, 1994

work page 1994

[32] [32]

Wright.Primal-Dual Interior-Point Methods

Stephen J. Wright.Primal-Dual Interior-Point Methods. SIAM, 1997

work page 1997

[33] [33]

Wright.Numerical Optimization

Jorge Nocedal and Stephen J. Wright.Numerical Optimization. Springer, 2 edition, 2006

work page 2006

[34] [34]

Potra and Stephen J

Florian A. Potra and Stephen J. Wright. Interior-point methods.Journal of Computational and Applied Mathematics, 124(1–2):281–302, 2000

work page 2000

[35] [35]

Stephen J. Wright. Effects of finite-precision arithmetic on interior-point methods for nonlinear programming.SIAM Journal on Optimization, 12(1):36–78, 2001

work page 2001

[36] [36]

SIAM, 2002

Nicholas J Higham.Accuracy and stability of numerical algorithms. SIAM, 2002

work page 2002

[37] [37]

Mixed precision training

Paulius Micikevicius, Sharan Narang, Jonah Alben, Gregory Diamos, Erich Elsen, David Garcia, Boris Ginsburg, Michael Houston, Oleksii Kuchaiev, Ganesh Venkatesh, and Hao Wu. Mixed precision training. InInternational Conference on Learning Representations, 2018

work page 2018

[38] [38]

Jouppi, Cliff Young, Nishant Patil, David Patterson, Gaurav Agrawal, Raminder Bajwa, Sarah Bates, Suresh Bhatia, Nan Boden, Al Borchers, et al

Norman P. Jouppi, Cliff Young, Nishant Patil, David Patterson, Gaurav Agrawal, Raminder Bajwa, Sarah Bates, Suresh Bhatia, Nan Boden, Al Borchers, et al. In-datacenter performance analysis of a tensor processing unit. InProceedings of the 44th Annual International Symposium on Computer Architecture, pages 1–12, 2017. 11

work page 2017

[39] [39]

Higham, Mantas Mikaitis, and Srikara Pranesh

Massimiliano Fasi, Nicholas J. Higham, Mantas Mikaitis, and Srikara Pranesh. Numerical behavior of nvidia tensor cores.PeerJ Computer Science, 7:e330, 2021

work page 2021

[40] [40]

Cvxgen: A code generator for embedded convex optimiza- tion.Optimization and Engineering, 13(1):1–27, 2012

Jacob Mattingley and Stephen Boyd. Cvxgen: A code generator for embedded convex optimiza- tion.Optimization and Engineering, 13(1):1–27, 2012

work page 2012

[41] [41]

Krantz and Harold R

Steven G. Krantz and Harold R. Parks.The Implicit Function Theorem: History, Theory, and Applications. Birkhäuser, 2012

work page 2012

[42] [42]

JAX: composable transformations of Python+NumPy programs, 2018

James Bradbury, Roy Frostig, Peter Hawkins, Matthew James Johnson, Yash Katariya, Chris Leary, Dougal Maclaurin, George Necula, Adam Paszke, Jake VanderPlas, Skye Wanderman- Milne, and Qiao Zhang. JAX: composable transformations of Python+NumPy programs, 2018

work page 2018

[43] [43]

Pytorch: An imperative style, high- performance deep learning library

Adam Paszke, Sam Gross, Francisco Massa, Adam Lerer, James Bradbury, Gregory Chanan, Trevor Killeen, Zeming Lin, Natalia Gimelshein, Luca Antiga, Alban Desmaison, Andreas Kopf, Edward Yang, Zachary DeVito, Martin Raison, Alykhan Tejani, Sasank Chilamkurthy, Benoit Steiner, Lu Fang, Junjie Bai, and Soumith Chintala. Pytorch: An imperative style, high- perf...

work page 2019

[44] [44]

Ill-conditioning and computational error in interior methods for nonlinear programming.SIAM Journal on Optimization, 9(1):84–111, 1998

Margaret H Wright. Ill-conditioning and computational error in interior methods for nonlinear programming.SIAM Journal on Optimization, 9(1):84–111, 1998

work page 1998

[45] [45]

Numerical solution of saddle point problems

Michele Benzi, Gene H Golub, and Jörg Liesen. Numerical solution of saddle point problems. Acta numerica, 14:1–137, 2005

work page 2005

[46] [46]

A survey of numerical linear algebra methods utilizing mixed-precision arithmetic.The International Journal of High Performance Computing Applications, 35(4):344–369, 2021

Ahmad Abdelfattah, Hartwig Anzt, Erik G Boman, Erin Carson, Terry Cojean, Jack Dongarra, Alyson Fox, Mark Gates, Nicholas J Higham, Xiaoye S Li, et al. A survey of numerical linear algebra methods utilizing mixed-precision arithmetic.The International Journal of High Performance Computing Applications, 35(4):344–369, 2021

work page 2021

[47] [47]

Mixed precision algorithms in numerical linear algebra

Nicholas J Higham and Theo Mary. Mixed precision algorithms in numerical linear algebra. Acta Numerica, 31:347–414, 2022

work page 2022

[48] [48]

Azzam Haidar, Harun Bayraktar, Stanimire Tomov, Jack Dongarra, and Nicholas J Higham. Mixed-precision iterative refinement using tensor cores on gpus to accelerate solution of linear systems.Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences, 476(2243), 2020

work page 2020

[49] [49]

Cuclarabel: Gpu acceleration for a conic optimization solver.arXiv preprint arXiv:2412.19027, 2024

Yuwen Chen, Danny Tse, Parth Nobel, Paul Goulart, and Stephen Boyd. Cuclarabel: Gpu acceleration for a conic optimization solver.arXiv preprint arXiv:2412.19027, 2024

work page arXiv 2024

[50] [50]

Log-domain interior-point methods for convex quadratic programming

Frank Permenter. Log-domain interior-point methods for convex quadratic programming. Optimization Letters, 17(7):1613–1631, 2023

work page 2023

[51] [51]

Implicit primal-dual interior-point methods for quadratic programming.arXiv preprint arXiv:2604.00364, 2026

Jon Arrizabalaga and Zachary Manchester. Implicit primal-dual interior-point methods for quadratic programming.arXiv preprint arXiv:2604.00364, 2026

work page arXiv 2026

[52] [52]

Complementarity by Construction: A Lie-Group Approach to Solving Quadratic Programs with Linear Complementarity Constraints

Arun L Bishop, Micah I Reich, and Zachary Manchester. Complementarity by construction: A lie-group approach to solving quadratic programs with linear complementarity constraints. arXiv preprint arXiv:2604.11991, 2026

work page internal anchor Pith review Pith/arXiv arXiv 2026

[53] [53]

Matrix computations

Gene H Golub and Charles F Van Loan. Matrix computations. johns hopkins studies in the mathematical sciences, 1996

work page 1996

[54] [54]

The cvxopt linear and quadratic cone program solvers.Online: http://cvxopt

Lieven Vandenberghe. The cvxopt linear and quadratic cone program solvers.Online: http://cvxopt. org/documentation/coneprog. pdf, 53, 2010

work page 2010

[55] [55]

On the implementation of a primal-dual interior point method.SIAM Journal on optimization, 2(4):575–601, 1992

Sanjay Mehrotra. On the implementation of a primal-dual interior point method.SIAM Journal on optimization, 2(4):575–601, 1992

work page 1992

[56] [56]

Moreau: GPU-native differentiable optimiza- tion, 2026

Shane Barratt, Parth Nobel, and Steven Diamond. Moreau: GPU-native differentiable optimiza- tion, 2026. 12

work page 2026

[57] [57]

Minimum snap trajectory generation and control for quadrotors

Daniel Mellinger and Vijay Kumar. Minimum snap trajectory generation and control for quadrotors. In2011 IEEE international conference on robotics and automation, pages 2520–

work page

[58] [58]

Planning dynamically feasible trajectories for quadrotors using safe flight corridors in 3-d complex environments.IEEE Robotics and Automation Letters, 2(3):1688–1695, 2017

Sikang Liu, Michael Watterson, Kartik Mohta, Ke Sun, Subhrajit Bhattacharya, Camillo J Taylor, and Vijay Kumar. Planning dynamically feasible trajectories for quadrotors using safe flight corridors in 3-d complex environments.IEEE Robotics and Automation Letters, 2(3):1688–1695, 2017

work page 2017

[59] [59]

Faster: Fast and safe trajectory planner for navigation in unknown environments.IEEE Transactions on Robotics, 38(2):922–938, 2021

Jesus Tordesillas, Brett T Lopez, Michael Everett, and Jonathan P How. Faster: Fast and safe trajectory planner for navigation in unknown environments.IEEE Transactions on Robotics, 38(2):922–938, 2021

work page 2021

[60] [60]

A limited memory algorithm for bound constrained optimization.SIAM Journal on scientific computing, 16(5):1190–1208, 1995

Richard H Byrd, Peihuang Lu, Jorge Nocedal, and Ciyou Zhu. A limited memory algorithm for bound constrained optimization.SIAM Journal on scientific computing, 16(5):1190–1208, 1995

work page 1995

[61] [61]

A stochastic approximation method.The annals of mathematical statistics, pages 400–407, 1951

Herbert Robbins and Sutton Monro. A stochastic approximation method.The annals of mathematical statistics, pages 400–407, 1951

work page 1951

[62] [62]

Adam: A Method for Stochastic Optimization

Diederik P Kingma and Jimmy Ba. Adam: A method for stochastic optimization.arXiv preprint arXiv:1412.6980, 2014

work page internal anchor Pith review Pith/arXiv arXiv 2014

[63] [63]

Optimization methods for large-scale machine learning.SIAM review, 60(2):223–311, 2018

Léon Bottou, Frank E Curtis, and Jorge Nocedal. Optimization methods for large-scale machine learning.SIAM review, 60(2):223–311, 2018

work page 2018

[64] [64]

Barriernet: Differentiable control barrier functions for learning of safe robot control.IEEE Transactions on Robotics, 39(3):2289–2307, 2023

Wei Xiao, Tsun-Hsuan Wang, Ramin Hasani, Makram Chahine, Alexander Amini, Xiao Li, and Daniela Rus. Barriernet: Differentiable control barrier functions for learning of safe robot control.IEEE Transactions on Robotics, 39(3):2289–2307, 2023. 13 A Implicit Function Theorem in Primal-Dual Interior-Point QPs Consider variables w∈R a and parameters θ∈R b, and...

work page 2023