Solving Quadratic Programs with Slack Variables via ADMM without Increasing the Problem Size
Pith reviewed 2026-05-17 23:29 UTC · model grok-4.3
The pith
A modified ADMM projection step solves quadratic programs with implicit slack variables without enlarging the problem.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Altering only the projection that appears in the z-update of the standard ADMM algorithm for a quadratic program makes the resulting iterates and convergence behavior identical to those of ADMM applied to the same program after slack variables have been added explicitly.
What carries the argument
A modified projection operator inside the z-update of ADMM that implicitly enforces the effect of slack variables.
If this is right
- The new scheme produces exactly the same sequence of iterates as the slack-augmented ADMM.
- All convergence guarantees already known for the augmented formulation transfer directly.
- Problem dimension and matrix sizes remain those of the original QP.
- Numerical experiments confirm measurable speedups on benchmark instances.
Where Pith is reading between the lines
- Existing ADMM implementations can adopt the change with a single-line modification to the projection routine.
- The same projection adjustment may extend to other proximal algorithms that currently rely on explicit slack augmentation.
- Real-time applications such as model-predictive control could benefit from the reduced per-iteration cost.
Load-bearing premise
Changing only the z-update projection produces identical iterates and convergence behavior to the version of the algorithm that adds slack variables explicitly.
What would settle it
Run both the proposed method and standard ADMM on the same QP with added slacks and check whether every iterate matches to machine precision.
Figures
read the original abstract
Proximal methods such as the Alternating Direction Method of Multipliers (ADMM) are effective at solving constrained quadratic programs (QPs). To tackle infeasible QPs, slack variables are often introduced to ensure feasibility, which changes the structure of the problem, increases its size, and slows down numerical resolution. In this letter, we propose a simple ADMM scheme to tackle QPs with slack variables without increasing the size of the original problem. The only modification is a slightly different projection in the z-update, while the rest of the algorithm remains standard. We prove that the method is equivalent to applying ADMM to the QP with additional slack variables, even though slack variables are not added. Numerical experiments show speedups of the approach.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript proposes an ADMM variant for quadratic programs (QPs) that handles infeasibility via slack variables without augmenting the problem dimension. The sole algorithmic change is a modified projection operator in the z-update; the x-update and dual update remain standard. The central claim is a proof that this yields iterates and convergence behavior identical to those of ADMM applied to the explicitly slack-augmented QP. Numerical experiments report speedups relative to the augmented formulation.
Significance. If the equivalence holds for both equality and inequality constraints and the full primal-dual trajectories match, the result would allow practitioners to obtain the benefits of slack variables (feasibility recovery) at the computational cost of the original-sized QP. This is particularly relevant for large-scale or real-time QP solvers where dimension increase is prohibitive. The parameter-free nature of the modification and the reported speedups strengthen its practical value.
major comments (2)
- [§3] §3 (Equivalence proof): The derivation of the modified z-projection must explicitly verify that the dual-ascent step produces identical multipliers to the augmented formulation. Because the slack variables are eliminated analytically, the interaction between the augmented quadratic term and the dual update must be shown to cancel exactly; otherwise the sequences diverge even if the fixed point is the same.
- [§4.2] §4.2 (Numerical validation): The experiments compare solve times and final objective values but do not report the norm of the difference between the proposed iterates and those of the slack-augmented ADMM at each iteration. Without this check, it remains unclear whether the claimed equivalence holds throughout the trajectory or only at convergence.
minor comments (2)
- [§2.3] The projection formula in the z-update should be stated as an explicit operator (e.g., Eq. (12)) rather than described only in prose, to facilitate direct implementation and verification.
- [§3] Clarify whether the equivalence extends to the case of inequality constraints with nonzero lower bounds; the current statement appears to assume the standard nonnegativity slack form.
Simulated Author's Rebuttal
We are grateful to the referee for the insightful comments on the equivalence proof and numerical experiments. Below we address each point and outline the planned revisions to the manuscript.
read point-by-point responses
-
Referee: §3 (Equivalence proof): The derivation of the modified z-projection must explicitly verify that the dual-ascent step produces identical multipliers to the augmented formulation. Because the slack variables are eliminated analytically, the interaction between the augmented quadratic term and the dual update must be shown to cancel exactly; otherwise the sequences diverge even if the fixed point is the same.
Authors: In our proof of equivalence (Theorem 1), we show that the modified projection in the z-update exactly replicates the effect of the slack variables in the augmented QP. Because the x- and dual-updates are identical to those in the augmented formulation, and the z-update is constructed to yield the same z (after analytic elimination of slacks), the dual variables are identical by construction at every iteration. The cancellation of the augmented quadratic term with the projection adjustment is implicit in the derivation. To make this explicit as suggested, we will revise Section 3 to include a dedicated paragraph verifying the dual-ascent step. revision: yes
-
Referee: §4.2 (Numerical validation): The experiments compare solve times and final objective values but do not report the norm of the difference between the proposed iterates and those of the slack-augmented ADMM at each iteration. Without this check, it remains unclear whether the claimed equivalence holds throughout the trajectory or only at convergence.
Authors: We agree that an explicit comparison of the full trajectories would strengthen the validation of the equivalence claim. In the revised manuscript, we will include plots or tables reporting the norm of the difference between the iterates (including primal and dual variables) of the proposed method and the standard slack-augmented ADMM at each iteration for the numerical examples in Section 4.2. revision: yes
Circularity Check
No circularity: direct equivalence proof via modified projection operator
full rationale
The paper derives an equivalence between standard ADMM on the original QP and ADMM on the slack-augmented QP by analytically eliminating the slack variables from the z-update projection step. This is a self-contained algebraic reduction shown against the explicit augmented formulation; the iterates, dual updates, and fixed points are shown to match by direct substitution into the standard ADMM loop rather than by fitting parameters or invoking self-citations. No load-bearing step reduces to a definition of the target result or to a prior result by the same authors. The derivation is therefore independent of the claimed outcome.
Axiom & Free-Parameter Ledger
axioms (1)
- standard math Standard convergence properties of ADMM for convex quadratic programs
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
The only modification is a slightly different projection in the z-update... z = gProj[ℓ,u](Ax + μ/ρ)
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
B. Houska and M. E. Villanueva, “Robust optimization for MPC,” inHandbook of Model Predictive Control. Springer International Publishing, 2018, pp. 413–443
work page 2018
-
[2]
Cerberus in the DARPA Subterranean Challenge,
M. Tranzatto, T. Miki, M. Dharmadhikari, L. Bernreiter, M. Kulkarni, F. Mascarich, O. Andersson, S. Khattak, M. Hut- ter, R. Siegwartet al., “Cerberus in the DARPA Subterranean Challenge,”Science Robotics, vol. 7, no. 66, p. eabp9742, 2022
work page 2022
-
[3]
Optimization-based control for dynamic legged robots,
P. M. Wensing, M. Posa, Y . Hu, A. Escande, N. Mansard, and A. Del Prete, “Optimization-based control for dynamic legged robots,”IEEE Transactions on Robotics, 2023
work page 2023
-
[4]
E. Adabag, M. Atal, W. Gerard, and B. Plancher, “MPCGPU: Real-time nonlinear model predictive control through precondi- tioned conjugate gradient on the GPU,” inProc. IEEE Conf. on Robotics and Automation, 2024
work page 2024
- [5]
-
[6]
Risk-averse model predictive control for racing in adverse conditions,
T. Lew, M. Greiff, F. Djeumou, M. Suminaka, M. Thompson, and J. Subosits, “Risk-averse model predictive control for racing in adverse conditions,” inProc. IEEE Conf. on Robotics and Automation, 2025
work page 2025
-
[7]
Distributed MPC strategies with application to power system automatic generation control,
A. N. Venkat, I. A. Hiskens, J. B. Rawlings, and S. J. Wright, “Distributed MPC strategies with application to power system automatic generation control,”IEEE transactions on control systems technology, vol. 16, no. 6, pp. 1192–1206, 2008
work page 2008
-
[8]
Model predictive con- trol: A review of its applications in power electronics,
S. Vazquez, J. I. Leon, L. G. Franquelo, J. Rodriguez, H. A. Young, A. Marquez, and P. Zanchetta, “Model predictive con- trol: A review of its applications in power electronics,”IEEE industrial electronics magazine, vol. 8, no. 1, pp. 16–31, 2014
work page 2014
-
[9]
S. Boyd, N. Parikh, E. Chu, B. Peleato, and J. Eckstein, “Dis- tributed optimization and statistical learning via the alternating direction method of multipliers,”Foundations and Trends® in Machine Learning, vol. 3, no. 1, pp. 1–122, 2011
work page 2011
-
[10]
OSQP: an operator splitting solver for quadratic programs,
B. Stellato, G. Banjac, P. Goulart, A. Bemporad, and S. Boyd, “OSQP: an operator splitting solver for quadratic programs,” Mathematical Programming Computation, vol. 12, no. 4, pp. 637–672, 2020
work page 2020
-
[11]
Conic opti- mization via operator splitting and homogeneous self-dual em- bedding,
B. O’donoghue, E. Chu, N. Parikh, and S. Boyd, “Conic opti- mization via operator splitting and homogeneous self-dual em- bedding,”Journal of Optimization Theory and Applications, vol. 169, no. 3, pp. 1042–1068, 2016
work page 2016
-
[12]
Relaxed logarithmic barrier function based model predictive control of linear systems,
C. Feller and C. Ebenbauer, “Relaxed logarithmic barrier function based model predictive control of linear systems,”IEEE Trans- actions on Automatic Control, vol. 62, no. 3, pp. 1223–1238, 2017
work page 2017
-
[13]
Relaxed recentered log-barrier function based nonlinear model predictive control,
T. A. Oancea, Y . Jiang, and C. N. Jones, “Relaxed recentered log-barrier function based nonlinear model predictive control,” inEuropean Control Conference, 2023
work page 2023
-
[14]
N. Parikh and S. Boyd, “Proximal algorithms,”Foundations and Trends in Optimization, vol. 1, no. 3, pp. 127–239, 2014
work page 2014
-
[15]
J. Eckstein and D. P. Bertsekas, “On the Douglas-Rachford splitting method and the proximal point algorithm for maximal monotone operators,”Mathematical Programming, vol. 55, no. 1-3, pp. 293–318, 1992
work page 1992
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.