pith. sign in

arxiv: 2005.06674 · v5 · submitted 2020-05-14 · 🧮 math.OC · cs.LG· math.DS

On the Convergence of Overlapping Schwarz Decomposition for Nonlinear Optimal Control

Pith reviewed 2026-05-24 14:02 UTC · model grok-4.3

classification 🧮 math.OC cs.LGmath.DS
keywords Schwarz decompositionnonlinear optimal controlconvergence analysisexponential decay of sensitivitydomain decompositionparallel optimizationquadratic programming
0
0 comments X

The pith

An overlapping Schwarz decomposition algorithm for nonlinear optimal control problems converges locally at a linear rate that improves exponentially with the size of the overlap.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper shows that decomposing the time horizon of a nonlinear optimal control problem into overlapping subdomains and solving the subproblems in parallel leads to a convergent algorithm. Convergence is local and linear, and the rate gets better exponentially as the overlap between subdomains increases. This relies on a property called exponential decay of sensitivity that bounds how much boundary changes affect the interior solution. The same framework gives global convergence when the problem is a quadratic program, which lets the method be embedded in sequential quadratic programming solvers. Numerical tests on a quadrotor planning task and a PDE control problem confirm that the method is competitive with a centralized solver and faster than ADMM.

Core claim

The overlapping Schwarz decomposition algorithm exhibits local linear convergence for nonlinear optimal control problems, with the convergence rate improving exponentially with the overlap size. This is established using the exponential decay of sensitivity result, which applies to both primal and dual solutions under uniform second-order sufficient conditions, controllability, and boundedness. Global convergence is proven for quadratic programs, enabling integration into second-order optimization algorithms.

What carries the argument

The exponential decay of sensitivity (EDS) property, which states that the effect of perturbations at the initial and terminal times of a subdomain decays exponentially away from the boundaries.

If this is right

  • The algorithm achieves local linear convergence on nonlinear OCPs.
  • Convergence rate improves exponentially as overlap size increases.
  • Global convergence holds for general quadratic programs.
  • The Schwarz scheme can be applied inside sequential quadratic programming methods.
  • Parallel subdomain solves yield efficiency gains over centralized solvers in tested cases.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • This suggests the method could scale well to problems with very long time horizons where centralized solves become intractable.
  • Similar decay properties might be exploitable in spatial domain decompositions for PDE-constrained optimization.
  • The parallel structure implies potential speedups on distributed computing architectures.

Load-bearing premise

The exponential decay of sensitivity holds for both the primal and dual solutions of the nonlinear optimal control problem under the uniform second-order sufficient condition, controllability condition, and boundedness condition.

What would settle it

A counterexample nonlinear optimal control problem where boundary perturbations cause non-exponentially decaying effects on the solution, or numerical evidence that the observed convergence rate does not improve with larger overlaps.

Figures

Figures reproduced from arXiv: 2005.06674 by Mihai Anitescu, Sen Na, Sungho Shin, Victor M. Zavala.

Figure 1
Figure 1. Figure 1: Overlapping Schwarz decomposition scheme for OCPs. Here, [PITH_FULL_IMAGE:figures/full_fig_p003_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Schematic of restriction operation. (f) for k ∈ [n 1 i −1, n2 i ], we use Tk(w[i]) to extract the variable on stage k of w[i] ; we also use T(i)(w[i]) to extract variables of w[i] that are on non-overlapping subdomains. The solution of Pi(·) may not be unique. The issues of the existence and uniqueness of the solution will be resolved in Theorem 6. For now, we assume that the solution w † [i] (·) exists an… view at source ↗
Figure 3
Figure 3. Figure 3: Primal-dual exponential decay of sensitivity for quadrotor problem. [PITH_FULL_IMAGE:figures/full_fig_p012_3.png] view at source ↗
Figure 5
Figure 5. Figure 5: Convergence of primal trajectory with τ˜ = 0.1. Top-to-bottom: iterations 1,2, and 3; blue, red, green markers are solutions from subproblems 1, 2, and 3, respectively; black line is solution trajectory. (ADMM) for solving the above two problems. For both the Schwarz and ADMM schemes, we partition the domain into 20 intervals with the same length. For the Schwarz scheme, we expand each interval by (8) with… view at source ↗
Figure 6
Figure 6. Figure 6: Benchmark of overlapping Schwarz against Ipopt and ADMM. Top: [PITH_FULL_IMAGE:figures/full_fig_p013_6.png] view at source ↗
read the original abstract

We study the convergence properties of an overlapping Schwarz decomposition algorithm for solving nonlinear optimal control problems (OCPs). The algorithm decomposes the time domain into a set of overlapping subdomains, and solves all subproblems defined over subdomains in parallel. The convergence is attained by updating primal-dual information at the boundaries of overlapping subdomains. We show that the algorithm exhibits local linear convergence, and that the convergence rate improves exponentially with the overlap size. We also establish global convergence results for a general quadratic programming, which enables the application of the Schwarz scheme inside second-order optimization algorithms (e.g., sequential quadratic programming). The theoretical foundation of our convergence analysis is a sensitivity result of nonlinear OCPs, which we call "exponential decay of sensitivity" (EDS). Intuitively, EDS states that the impact of perturbations at domain boundaries (i.e. initial and terminal time) on the solution decays exponentially as one moves into the domain. Here, we expand a previous analysis available in the literature by showing that EDS holds for both primal and dual solutions of nonlinear OCPs, under uniform second-order sufficient condition, controllability condition, and boundedness condition. We conduct experiments with a quadrotor motion planning problem and a PDE control problem to validate our theory; and show that the approach is significantly more efficient than ADMM and as efficient as the centralized solver Ipopt.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 3 minor

Summary. The manuscript analyzes an overlapping Schwarz domain decomposition algorithm for nonlinear optimal control problems (OCPs). The time horizon is partitioned into overlapping subintervals whose subproblems are solved in parallel; convergence is obtained by exchanging primal-dual boundary data. Under uniform second-order sufficient conditions, controllability, and boundedness, the authors establish an exponential decay of sensitivity (EDS) property for both primal and dual solutions. This yields local linear convergence whose rate improves exponentially with overlap size. Global convergence is proved for the quadratic-programming case, permitting the scheme to be used inside SQP methods. Numerical results on a quadrotor planning problem and a PDE control problem show the method is faster than ADMM and competitive with the centralized solver Ipopt.

Significance. If the EDS property and the attendant convergence statements hold, the paper supplies a theoretically grounded, parallelizable alternative to centralized and ADMM-type solvers for large-scale nonlinear OCPs. The extension of EDS to dual variables and the global QP result are concrete strengths that directly support embedding inside second-order algorithms. The exponential dependence of the rate on overlap size supplies a clear, tunable performance guarantee that is rare in decomposition methods for optimal control.

major comments (2)
  1. [§3.3, Theorem 4.2] §3.3, Theorem 4.2: the local linear convergence rate is stated to improve exponentially with overlap length, yet the proof sketch only invokes the EDS constant without deriving an explicit dependence of that constant on the overlap parameter; a quantitative bound is required to substantiate the exponential claim.
  2. [§4.1, Assumption 3] §4.1, Assumption 3 (boundedness): the uniform boundedness condition on the primal-dual trajectories is used to close the EDS argument for both primal and dual variables; the manuscript does not indicate whether this assumption can be verified a priori or is automatically satisfied under the stated SOSC and controllability conditions.
minor comments (3)
  1. [Definition 2.1] The notation for the overlap size and the decay constant should be unified between the statement of EDS (Definition 2.1) and the convergence theorems.
  2. [Figure 3] Figure 3 (quadrotor trajectories) lacks error bars or multiple random initializations; a single run does not fully illustrate robustness of the observed speedup.
  3. [Section 5] The comparison with Ipopt reports wall-clock time but omits the number of iterations or function evaluations; adding these metrics would strengthen the efficiency claim.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the careful reading, the positive assessment of the contribution, and the recommendation for minor revision. We address each major comment below.

read point-by-point responses
  1. Referee: [§3.3, Theorem 4.2] §3.3, Theorem 4.2: the local linear convergence rate is stated to improve exponentially with overlap length, yet the proof sketch only invokes the EDS constant without deriving an explicit dependence of that constant on the overlap parameter; a quantitative bound is required to substantiate the exponential claim.

    Authors: We agree that an explicit quantitative link strengthens the statement. The EDS property (Theorems 3.1 and 3.2) already yields a decay factor of the form C ρ^d with ρ < 1 independent of the overlap length δ and d the distance from the perturbed boundary. Substituting d = δ into the fixed-point argument of Theorem 4.2 produces a contraction constant bounded by K ρ^δ for a problem-dependent K. In the revised manuscript we will add a corollary to Theorem 4.2 that records this explicit exponential dependence on δ. revision: yes

  2. Referee: [§4.1, Assumption 3] §4.1, Assumption 3 (boundedness): the uniform boundedness condition on the primal-dual trajectories is used to close the EDS argument for both primal and dual variables; the manuscript does not indicate whether this assumption can be verified a priori or is automatically satisfied under the stated SOSC and controllability conditions.

    Authors: Assumption 3 is not implied by SOSC and controllability alone; counter-examples exist on unbounded domains. It is, however, satisfied under standard additional hypotheses (compact control sets, quadratic growth of the running cost, or finite-horizon turnpike-type bounds). We will revise the paragraph following Assumption 3 to list these sufficient conditions together with a short reference to the relevant turnpike literature. revision: yes

Circularity Check

0 steps flagged

No significant circularity; derivation self-contained

full rationale

The paper establishes local linear convergence (with exponential improvement in overlap) and global QP convergence as consequences of the EDS property. EDS itself is proven from scratch under explicit, standard assumptions (uniform SOSC, controllability, boundedness) that do not presuppose the Schwarz convergence result. The abstract explicitly frames the contribution as an expansion of prior literature sensitivity analysis rather than a self-referential definition or fitted prediction. No equations reduce the claimed predictions to inputs by construction, and no load-bearing uniqueness theorem is imported solely via self-citation. The derivation chain is therefore independent of the target claims.

Axiom & Free-Parameter Ledger

0 free parameters · 3 axioms · 0 invented entities

The central claim rests on the exponential decay of sensitivity property, which is not derived from first principles in the abstract but assumed to hold under three domain conditions from optimal control theory.

axioms (3)
  • domain assumption uniform second-order sufficient condition
    Invoked as necessary for EDS to hold for primal and dual solutions of the OCP.
  • domain assumption controllability condition
    Invoked as necessary for EDS to hold.
  • domain assumption boundedness condition
    Invoked as necessary for EDS to hold.

pith-pipeline@v0.9.0 · 5785 in / 1296 out tokens · 31902 ms · 2026-05-24T14:02:50.175076+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Controllability and Observability Imply Exponential Decay of Sensitivity in Dynamic Optimization

    math.OC 2021-01 unverdicted novelty 6.0

    Uniform controllability and observability imply exponential decay of sensitivity under uniform Hessian boundedness, uSOSC, and uLICQ in dynamic optimization.

  2. A Julia Framework for Graph-Structured Nonlinear Optimization

    math.OC 2022-04 unverdicted novelty 4.0

    A Julia framework combines Plasmo.jl and MadNLP.jl to model and solve graph-structured nonlinear optimization problems, demonstrated on a large stochastic gas network instance with over 1.7 million variables.

Reference graph

Works this paper leans on

53 extracted references · 53 canonical work pages · cited by 2 Pith papers · 2 internal anchors

  1. [1]

    Rawlings, Model predictive control : theory, computation, and design

    J. Rawlings, Model predictive control : theory, computation, and design . Madison, Wisconsin: Nob Hill Publishing, 2017, vol. 2. [Online]. Available: http://www.nobhillpublishing.com/mpc/index-mpc.html

  2. [2]

    A survey of industrial model predictive control technology,

    S. Qin and T. A. Badgwell, “A survey of industrial model predictive control technology,” Control Engineering Practice , vol. 11, no. 7, pp. 733–764, jul 2003. [Online]. Available: https://doi.org/10.1016/ S0967-0661(02)00186-7

  3. [3]

    Stochastic model predictive control for central HV AC plants,

    R. Kumar, M. J. Wenzel, M. N. ElBsat, M. J. Risbeck, K. H. Drees, and V . M. Zavala, “Stochastic model predictive control for central HV AC plants,”Journal of Process Control , vol. 90, pp. 1–17, jun 2020. [Online]. Available: https://doi.org/10.1016/j.jprocont.2020.03.015

  4. [4]

    Temporal decomposition scheme for nonlinear multisite production planning and distribution models,

    J. R. Jackson and I. E. Grossmann, “Temporal decomposition scheme for nonlinear multisite production planning and distribution models,” Industrial & Engineering Chemistry Research , vol. 42, no. 13, pp. 3045– 3055, jun 2003. [Online]. Available: https://doi.org/10.1021/ie030070p

  5. [5]

    Predictive active steering control for autonomous vehicle systems,

    P. Falcone, F. Borrelli, J. Asgari, H. E. Tseng, and D. Hrovat, “Predictive active steering control for autonomous vehicle systems,” IEEE Transactions on Control Systems Technology , vol. 15, no. 3, pp. 566–580, may 2007. [Online]. Available: https://doi.org/10.1109/TCST. 2007.894653

  6. [6]

    General nonlinear modal representation of large scale power systems,

    H. Shanechi, N. Pariz, and E. Vaahedi, “General nonlinear modal representation of large scale power systems,” IEEE Transactions on Power Systems , vol. 18, no. 3, pp. 1103–1109, aug 2003. [Online]. Available: https://doi.org/10.1109/tpwrs.2003.814883

  7. [7]

    Distributed receding horizon control of dynamically coupled nonlinear systems,

    W. B. Dunbar, “Distributed receding horizon control of dynamically coupled nonlinear systems,” IEEE Transactions on Automatic Control , vol. 52, no. 7, pp. 1249–1263, jul 2007. [Online]. Available: https://doi.org/10.1109/TAC.2007.900828

  8. [8]

    Neural-network predictive control for nonlinear dynamic systems with time-delay,

    J.-Q. Huang and F. Lewis, “Neural-network predictive control for nonlinear dynamic systems with time-delay,” IEEE Transactions on Neural Networks , vol. 14, no. 2, pp. 377–389, mar 2003. [Online]. Available: https://doi.org/10.1109/tnn.2003.809424

  9. [9]

    Handling long horizons in MPC: A stochastic programming approach,

    R. Kumar, M. J. Wenzel, M. J. Ellis, M. N. ElBsat, K. H. Drees, and V . M. Zavala, “Handling long horizons in MPC: A stochastic programming approach,” in 2018 Annual American Control Conference (ACC), IEEE. IEEE, jun 2018, pp. 715–720. [Online]. Available: https://doi.org/10.23919/acc.2018.8430780

  10. [10]

    Temporal lagrangian decomposition of model predictive control for hybrid systems,

    A. Beccuti, T. Geyer, and M. Morari, “Temporal lagrangian decomposition of model predictive control for hybrid systems,” in 2004 43rd IEEE Conference on Decision and Control (CDC) (IEEE Cat. No.04CH37601), vol. 3, IEEE. IEEE, 2004, pp. 2509–2514. [Online]. Available: https://doi.org/10.1109/cdc.2004.1428793

  11. [11]

    Distributed optimization and statistical learning via the alternating direction method of multipliers,

    S. Boyd, “Distributed optimization and statistical learning via the alternating direction method of multipliers,” Foundations and Trends® in Machine Learning , vol. 3, no. 1, pp. 1–122, 2010. [Online]. Available: https://doi.org/10.1561/2200000016

  12. [12]

    A stochastic dual dynamic programming framework for multiscale MPC,

    R. Kumar, M. J. Wenzel, M. J. Ellis, M. N. ElBsat, K. H. Drees, and V . M. Zavala, “A stochastic dual dynamic programming framework for multiscale MPC,” IFAC-PapersOnLine, vol. 51, no. 20, pp. 493–498,

  13. [13]

    Available: https://doi.org/10.1016/j.ifacol.2018.11.041

    [Online]. Available: https://doi.org/10.1016/j.ifacol.2018.11.041

  14. [14]

    New architectures for hierarchical predictive control,

    V . M. Zavala, “New architectures for hierarchical predictive control,” IFAC-PapersOnLine, vol. 49, no. 7, pp. 43–48, 2016. [Online]. Available: https://doi.org/10.1016/j.ifacol.2016.07.214

  15. [15]

    An augmented lagrangian filter method for real-time embedded optimization,

    N.-Y . Chiang, R. Huang, and V . M. Zavala, “An augmented lagrangian filter method for real-time embedded optimization,” IEEE Transactions on Automatic Control , vol. 62, no. 12, pp. 6110–6121, dec 2017. [Online]. Available: https://doi.org/10.1109/tac.2017.2694806

  16. [16]

    Benchmarking large-scale distributed convex quadratic programming algorithms,

    A. Kozma, C. Conte, and M. Diehl, “Benchmarking large-scale distributed convex quadratic programming algorithms,” Optimization Methods and Software , vol. 30, no. 1, pp. 191–214, may 2014. [Online]. Available: https://doi.org/10.1080/10556788.2014.911298

  17. [17]

    An o (log n) parallel algorithm for newton step computation in model predictive control,

    I. Nielsen and D. Axehill, “An o (log n) parallel algorithm for newton step computation in model predictive control,” IFAC Proceedings Volumes, vol. 47, no. 3, pp. 10 505–10 511, 2014. [Online]. Available: https://doi.org/10.3182/20140824-6-ZA-1003.01577

  18. [18]

    A parallel structure exploiting factorization algorithm with applications to model predictive control,

    ——, “A parallel structure exploiting factorization algorithm with applications to model predictive control,” in 2015 54th IEEE Conference on Decision and Control (CDC), IEEE. IEEE, dec 2015, pp. 3932–3938. [Online]. Available: https://doi.org/10.1109/CDC.2015.7402830

  19. [19]

    Parallelizing LQR computation through endpoint-explicit riccati recursion,

    F. Laine and C. Tomlin, “Parallelizing LQR computation through endpoint-explicit riccati recursion,” in 2019 IEEE 58th Conference on Decision and Control (CDC) , IEEE. IEEE, dec 2019, pp. 1395–1402. [Online]. Available: https://doi.org/10.1109/CDC40024.2019.9029974

  20. [20]

    Solution of discrete-time optimal control problems on parallel computers,

    S. J. Wright, “Solution of discrete-time optimal control problems on parallel computers,” Parallel Computing, vol. 16, no. 2-3, pp. 221–237, dec 1990. [Online]. Available: https://doi.org/10.1016/0167-8191(90) 90060-M

  21. [21]

    Application of interior-point methods to model predictive control,

    C. V . Rao, S. J. Wright, and J. B. Rawlings, “Application of interior-point methods to model predictive control,” Journal of Optimization Theory and Applications , vol. 99, no. 3, pp. 723–757, dec 1998. [Online]. Available: https://doi.org/10.1023/A:1021711402723

  22. [22]

    Parallel cyclic reduction decomposition for dynamic optimization problems,

    W. Wan, J. P. Eason, B. Nicholson, and L. T. Biegler, “Parallel cyclic reduction decomposition for dynamic optimization problems,” Computers & Chemical Engineering , vol. 120, pp. 54–69, jan 2019. [Online]. Available: https://doi.org/10.1016/j.compchemeng.2017.09.023

  23. [23]

    Nonlinear programming strategies on high-performance computers,

    J. Kang, N. Chiang, C. D. Laird, and V . M. Zavala, “Nonlinear programming strategies on high-performance computers,” in 2015 54th IEEE Conference on Decision and Control (CDC) , IEEE. IEEE, dec 2015, pp. 4612–4620. [Online]. Available: https://doi.org/10.1109/CDC. 2015.7402938

  24. [24]

    Efficient implementation of the riccati recursion for solving linear-quadratic control problems,

    G. Frison and J. B. Jorgensen, “Efficient implementation of the riccati recursion for solving linear-quadratic control problems,” in 2013 IEEE International Conference on Control Applications (CCA) , IEEE. IEEE, aug 2013, pp. 1117–1122. [Online]. Available: https://doi.org/10.1109/CCA.2013.6662901

  25. [25]

    Time domain partitioning of electricity production cost simulations,

    C. Barrows, M. Hummon, W. Jones, and E. Hale, “Time domain partitioning of electricity production cost simulations,” National Renewable Energy Lab.(NREL), Golden, CO (United States) , jan 2014. [Online]. Available: https://doi.org/10.2172/1123223

  26. [26]

    Exponentially accurate temporal decomposition for long-horizon linear-quadratic dynamic optimization,

    W. Xu and M. Anitescu, “Exponentially accurate temporal decomposition for long-horizon linear-quadratic dynamic optimization,” SIAM Journal on Optimization , vol. 28, no. 3, pp. 2541–2573, jan 2018. [Online]. Available: https://doi.org/10.1137/16M1081993

  27. [27]

    Parallelizing LQR computation through endpoint-explicit riccati recursion,

    S. Shin, T. Faulwasser, M. Zanon, and V . M. Zavala, “A parallel decomposition scheme for solving long-horizon optimal control problems,” in 2019 IEEE 58th Conference on Decision and Control (CDC). IEEE, dec 2019, pp. 5264–5271. [Online]. Available: https://doi.org/10.1109/cdc40024.2019.9030139

  28. [28]

    Exponential decay in the sensitivity analysis of nonlinear dynamic programming,

    S. Na and M. Anitescu, “Exponential decay in the sensitivity analysis of nonlinear dynamic programming,” SIAM Journal on Optimization , vol. 30, no. 2, pp. 1527–1554, jan 2020. [Online]. Available: https://doi.org/10.1137/19M1265065

  29. [29]

    Graph-Based Modeling and Decomposition of Energy Infrastructures

    S. Shin, C. Coffrin, K. Sundar, and V . M. Zavala, “Graph-based modeling and decomposition of energy infrastructures,” arXiv preprint 16 arXiv:2010.02404, 2020. [Online]. Available: https://arxiv.org/abs/2010. 02404

  30. [30]

    Na and M

    S. Na and M. Anitescu, “Superconvergence of online optimization for model predictive control,” arXiv preprint arXiv:2001.03707 , 2020. [Online]. Available: https://arxiv.org/abs/2001.03707

  31. [31]

    Zheng, Y

    T. Ohtsuka, “A continuation/GMRES method for fast computation of nonlinear receding horizon control,” Automatica, vol. 40, no. 4, pp. 563–574, apr 2004. [Online]. Available: https://doi.org/10.1016/j. automatica.2003.11.005

  32. [32]

    A real-time iteration scheme for nonlinear optimization in optimal feedback control,

    M. Diehl, H. G. Bock, and J. P. Schl ¨oder, “A real-time iteration scheme for nonlinear optimization in optimal feedback control,” SIAM Journal on Control and Optimization , vol. 43, no. 5, pp. 1714–1736, jan 2005. [Online]. Available: https://doi.org/10.1137/S0363012902400713

  33. [33]

    Distributed event-triggered control for global consensus of multi-agent systems with input saturation,

    V . M. Zavala and L. T. Biegler, “The advanced-step NMPC controller: Optimality, stability and robustness,” Automatica, vol. 45, no. 1, pp. 86–93, jan 2009. [Online]. Available: https://doi.org/10.1016/j.automatica. 2008.06.011

  34. [34]

    Real-time nonlinear optimization as a generalized equation,

    V . M. Zavala and M. Anitescu, “Real-time nonlinear optimization as a generalized equation,” SIAM Journal on Control and Optimization , vol. 48, no. 8, pp. 5444–5467, jan 2010. [Online]. Available: https://doi.org/10.1137/090762634

  35. [35]

    Non quadratic smooth model of fatigue for optimal fatigue-oriented individual pitch control,

    D. Collet, M. Alamir, D. D. Domenico, and G. Sabiron, “Non quadratic smooth model of fatigue for optimal fatigue-oriented individual pitch control,” in Journal of Physics: Conference Series , vol. 1618, no. 2, IOP Publishing. IOP Publishing, sep 2020, p. 022004. [Online]. Available: https://doi.org/10.1088/1742-6596/1618/2/022004

  36. [36]

    Convergence analysis of alternating direction method of multipliers for a family of nonconvex problems,

    M. Hong, Z.-Q. Luo, and M. Razaviyayn, “Convergence analysis of alternating direction method of multipliers for a family of nonconvex problems,” SIAM Journal on Optimization , vol. 26, no. 1, pp. 337–364, jan 2016. [Online]. Available: https://doi.org/10.1137/140990309

  37. [37]

    Global convergence of ADMM in nonconvex nonsmooth optimization,

    Y . Wang, W. Yin, and J. Zeng, “Global convergence of ADMM in nonconvex nonsmooth optimization,” Journal of Scientific Computing , vol. 78, no. 1, pp. 29–63, jun 2018. [Online]. Available: https: //doi.org/10.1007/s10915-018-0757-z

  38. [38]

    Diffusing-Horizon Model Predictive Control

    S. Shin and V . M. Zavala, “Diffusing-horizon model predictive control,” arXiv preprint arXiv:2002.08556 , 2020. [Online]. Available: https://arxiv.org/abs/2002.08556

  39. [40]

    Available: https://arxiv.org/abs/2007.14446

    [Online]. Available: https://arxiv.org/abs/2007.14446

  40. [41]

    Abstract nonlinear sensitivity and turnpike analysis and an application to semilinear parabolic PDEs,

    L. Gr ¨une, M. Schaller, and A. Schiela, “Abstract nonlinear sensitivity and turnpike analysis and an application to semilinear parabolic PDEs,” ESAIM: Control, Optimisation and Calculus of Variations , vol. 27, p. 56,

  41. [42]

    Available: https://doi.org/10.1051/cocv/2021030

    [Online]. Available: https://doi.org/10.1051/cocv/2021030

  42. [43]

    Optimal infinite-horizon feedback laws for a general class of constrained discrete-time systems: Stability and moving-horizon approximations,

    S. S. Keerthi and E. G. Gilbert, “Optimal infinite-horizon feedback laws for a general class of constrained discrete-time systems: Stability and moving-horizon approximations,” Journal of Optimization Theory and Applications, vol. 57, no. 2, pp. 265–293, may 1988. [Online]. Available: https://doi.org/10.1007/bf00938540

  43. [44]

    J. F. Bonnans and A. Shapiro, Perturbation Analysis of Optimization Problems. Springer New York, 2000. [Online]. Available: https: //doi.org/10.1007/978-1-4612-1394-9

  44. [45]

    Nocedal and S

    J. Nocedal and S. J. Wright, Numerical Optimization , 2nd ed., ser. Springer Series in Operations Research and Financial Engineering. Springer New York, 2006. [Online]. Available: https://doi.org/10.1007/ 978-0-387-40065-5

  45. [46]

    A sparsity preserving convexification procedure for indefinite quadratic programs arising in direct optimal control,

    R. Verschueren, M. Zanon, R. Quirynen, and M. Diehl, “A sparsity preserving convexification procedure for indefinite quadratic programs arising in direct optimal control,” SIAM Journal on Optimization , vol. 27, no. 3, pp. 2085–2109, jan 2017. [Online]. Available: https://doi.org/10.1137/16m1081543

  46. [47]

    Nominal stability of real-time iteration scheme for nonlinear model predictive control,

    M. Diehl, R. Findeisen, H. Bock, F. Allg ¨ower, and J. Schl ¨oder, “Nominal stability of real-time iteration scheme for nonlinear model predictive control,” IEE Proceedings - Control Theory and Applications , vol. 152, no. 3, pp. 296–308, may 2005. [Online]. Available: https://doi.org/10.1049/ip-cta:20040008

  47. [48]

    Perturbed kuhn-tucker points and rates of convergence for a class of nonlinear-programming algorithms,

    S. M. Robinson, “Perturbed kuhn-tucker points and rates of convergence for a class of nonlinear-programming algorithms,” Mathematical Programming, vol. 7, no. 1, pp. 1–16, dec 1974. [Online]. Available: https://doi.org/10.1007/bf01585500

  48. [49]

    A flying inverted pendulum,

    M. Hehn and R. D’Andrea, “A flying inverted pendulum,” in 2011 IEEE International Conference on Robotics and Automation , IEEE. IEEE, may 2011, pp. 763–770. [Online]. Available: https: //doi.org/10.1109/icra.2011.5980244

  49. [50]

    A parallel newton-type method for nonlinear model predictive control,

    H. Deng and T. Ohtsuka, “A parallel newton-type method for nonlinear model predictive control,” Automatica, vol. 109, p. 108560, nov 2019. [Online]. Available: https://doi.org/10.1016/j.automatica.2019.108560

  50. [51]

    Nonlinear heat transfer in thin plate,

    “Nonlinear heat transfer in thin plate,” https://www.mathworks.com/help/pde/ug/nonlinear-heat-transfer- in-a-thin-plate.html. [Online]. Available: https://www.mathworks.com/ help/pde/ug/nonlinear-heat-transfer-in-a-thin-plate.html

  51. [52]

    Benchmarking ADMM in nonconvex NLPs,

    J. S. Rodriguez, B. Nicholson, C. Laird, and V . M. Zavala, “Benchmarking ADMM in nonconvex NLPs,” Computers & Chemical Engineering, vol. 119, pp. 315–325, nov 2018. [Online]. Available: https://doi.org/10.1016/j.compchemeng.2018.08.036

  52. [53]

    On the implementation of an interior-point filter line-search algorithm for large-scale nonlinear programming,

    A. W¨achter and L. T. Biegler, “On the implementation of an interior-point filter line-search algorithm for large-scale nonlinear programming,” Mathematical Programming , vol. 106, no. 1, pp. 25–57, apr 2005. [Online]. Available: https://doi.org/10.1007/s10107-004-0559-y

  53. [54]

    collection of Fortran codes for large-scale scientific computation,

    A. HSL, “collection of Fortran codes for large-scale scientific computation,” See http://www. hsl. rl. ac. uk , 2007. [Online]. Available: http://www.hsl.rl.ac.uk/ Sen Na is a fifth-year Ph.D. student in the Department of Statistics at the University of Chicago under the supervision of Mihai Anitescu and Mladen Kolar. Before coming to UChicago, he received ...