Alternating Direction Method of Multipliers with Variable Metric Indefinite Proximal Terms for Convex Optimization
Pith reviewed 2026-05-25 14:02 UTC · model grok-4.3
The pith
Variable-metric ADMM with indefinite proximal terms converges globally when the matrices meet stated conditions, and a BFGS construction satisfies them.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
We consider a variable metric indefinite proximal ADMM, and give sufficient conditions on the proximal terms for the global convergence. Moreover, we propose a new indefinite proximal term based on the BFGS update which can satisfy the conditions for the global convergence.
What carries the argument
The sequence of variable-metric indefinite proximal matrices, generated by a BFGS-style update, that obeys the sufficient conditions required for the convergence analysis.
If this is right
- Global convergence of the iterates holds for the ADMM even when the proximal terms are allowed to be indefinite.
- The BFGS update supplies a practical way to generate proximal terms that obey the convergence conditions.
- The method applies directly to linearly constrained convex optimization problems.
- Each subproblem remains as easy to solve as in the standard proximal ADMM.
Where Pith is reading between the lines
- The same style of indefinite proximal construction could be tried in other splitting algorithms that rely on proximal terms.
- Numerical tests on specific convex problems could reveal whether the indefinite terms produce measurable speedups compared with semidefinite alternatives.
- The sufficient conditions might admit further weakening while still preserving convergence.
Load-bearing premise
The sequence of indefinite proximal matrices must satisfy the specific sufficient conditions on their properties that the convergence proof relies upon.
What would settle it
A convex optimization instance together with a sequence of proximal matrices obeying the stated conditions for which the generated iterates fail to converge to an optimal solution.
read the original abstract
This paper studies a proximal alternating direction method of multipliers (ADMM) with variable metric indefinite proximal terms for linearly constrained convex optimization problems. The proximal ADMM plays an important role in many application areas, since the subproblems of the method are easy to solve. Recently, it is reported that the proximal ADMM with a certain fixed indefinite proximal term is faster than that with a positive semidefinite term, and still has the global convergence property. On the other hand, Gu and Yamashita studied a variable metric semidefinite proximal ADMM whose proximal term is generated by the BFGS update. They reported that a slightly indefinite matrix also makes the algorithm work well in their numerical experiments. Motivated by this fact, we consider a variable metric indefinite proximal ADMM, and give sufficient conditions on the proximal terms for the global convergence. Moreover, we propose a new indefinite proximal term based on the BFGS update which can satisfy the conditions for the global convergence.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript studies a proximal ADMM with variable metric indefinite proximal terms for linearly constrained convex optimization. It derives sufficient conditions on the sequence of indefinite proximal matrices ensuring global convergence and proposes a BFGS-based construction claimed to satisfy those conditions.
Significance. If the sufficient conditions are correctly stated and the BFGS construction is shown to meet them, the result would extend the theory of proximal ADMM to indefinite metrics while retaining global convergence, providing a theoretical basis for observed practical speedups with slightly indefinite terms.
major comments (2)
- [§3] §3 (convergence theorem): the sufficient conditions require a uniform lower bound on the smallest eigenvalue of a linear combination of the metric and constraint matrix together with a summability condition on successive differences; the manuscript must explicitly verify that the proposed BFGS update preserves these bounds for all iterates, as standard BFGS can produce eigenvalue drift when the secant condition holds only approximately.
- [§4] §4 (BFGS construction): the claim that the new indefinite proximal term based on the BFGS update satisfies the conditions of §3 is load-bearing for the main contribution, yet the provided derivation does not include an error analysis or inductive argument confirming that the eigenvalue lower bound remains positive uniformly.
minor comments (2)
- [Introduction] The reference to the fixed indefinite proximal term result and to Gu-Yamashita should be expanded with precise citations and a short comparison table of convergence rates.
- [§2] Notation for the proximal matrix sequence {B_k} should be introduced once and used consistently; several early equations reuse B without the subscript.
Simulated Author's Rebuttal
We thank the referee for the detailed and constructive report. The comments correctly identify that the manuscript's main claims rest on the BFGS construction satisfying the convergence conditions, yet the current text does not supply an explicit inductive verification or error analysis. We will revise the paper to include this material.
read point-by-point responses
-
Referee: [§3] §3 (convergence theorem): the sufficient conditions require a uniform lower bound on the smallest eigenvalue of a linear combination of the metric and constraint matrix together with a summability condition on successive differences; the manuscript must explicitly verify that the proposed BFGS update preserves these bounds for all iterates, as standard BFGS can produce eigenvalue drift when the secant condition holds only approximately.
Authors: We agree that an explicit verification is required. In the revision we will insert a new lemma (with proof) that proceeds by induction on the iterates. The argument exploits the specific form of the indefinite BFGS update together with the fact that the secant condition is enforced only on the range of the constraint matrix; this prevents the eigenvalue drift that can occur in the classical positive-definite BFGS setting. The summability condition on successive differences will also be verified directly from the update formula. revision: yes
-
Referee: [§4] §4 (BFGS construction): the claim that the new indefinite proximal term based on the BFGS update satisfies the conditions of §3 is load-bearing for the main contribution, yet the provided derivation does not include an error analysis or inductive argument confirming that the eigenvalue lower bound remains positive uniformly.
Authors: We acknowledge the gap. The revised manuscript will contain a dedicated subsection (or appendix) that supplies the missing inductive argument and a uniform lower-bound estimate on the relevant eigenvalue. The proof will be self-contained and will not rely on external results beyond standard properties of the BFGS update under the paper's assumptions. revision: yes
Circularity Check
Convergence proof and BFGS construction are independent of each other and of prior self-citations.
full rationale
The paper states sufficient conditions on the sequence of indefinite proximal matrices and separately constructs a BFGS-based term asserted to meet those conditions. No equation reduces the claimed convergence to a fitted parameter or to a definition that presupposes the result. The citation to the authors' prior semidefinite BFGS work supplies only motivation and background; the new indefinite conditions and their verification are developed within the present manuscript without load-bearing reliance on unverified self-referential steps. The derivation therefore remains self-contained against external convex-analysis benchmarks.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption The problem is a linearly constrained convex optimization problem.
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
We present sufficient conditions on {T_k} for the global convergence of VMIP-ADMM... we propose a new indefinite proximal term based on the BFGS update which can satisfy the conditions
-
IndisputableMonolith/Foundation/RealityFromDistinction.leanreality_from_one_distinction unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
Condition 2.1... (g) ∃ c ∈ (0, 0.5), T_k + 3/2 Σ_g - γ_{k-1/2} T_k^+ - 2T^- + (3/4 - 1/2 c)βBᵀB ⪰ 0
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
H. A TTOUCH , L. M. B RICENO -A RIAS , AND P. L. C OMBETTES , A parallel splitting method for coupled monotone inclusions , SIAM Journal on Control and Optimization, 48 (2010), pp. 32 46– 3270
work page 2010
-
[2]
S. B ANERT , R. I. B OT, AND E. R. C SETNEK , Fixing and extending some recent results on the ADMM algorithm, arXiv preprint arXiv:1612.05057, (2016)
-
[3]
H. H. B AUSCHKE , P. L. C OMBETTES , ET AL ., Convex analysis and monotone operator theory in Hilbert spaces, vol. 408, Springer, 2011
work page 2011
- [4]
-
[5]
A. C HAMBOLLE AND T. P OCK , A first-order primal-dual algorithm for convex problems wit h applications to imaging, Journal of mathematical imaging and vision, 40 (2011), pp. 120–145
work page 2011
-
[6]
G. C HEN AND M. T EBOULLE , A proximal-based decomposition method for convex minimiza tion problems, Mathematical Programming, 64 (1994), pp. 81–101
work page 1994
- [7]
-
[8]
P. L. C OMBETTES , Iterative construction of the resolvent of a sum of maximal monotone operators, J. Convex Anal, 16 (2009), pp. 727–748
work page 2009
-
[9]
W. D ENG AND W. Y IN, On the global and linear convergence of the generalized alte rnating di- rection method of multipliers, tech. rep., DTIC Document, 2012. [10] , On the global and linear convergence of the generalized alte rnating direction method of multipliers, Journal of Scientific Computing, 66 (2016), pp. 889–916. 18
work page 2012
-
[10]
J. D OUGLAS AND H. R ACHFORD , On the numerical solution of heat conduction problems in two and three space variables , Transactions of the American mathematical Society, (1956 ), pp. 421– 439
work page 1956
-
[11]
J. E CKSTEIN , Some saddle-function splitting methods for convex program ming, Optimization Methods and Software, 4 (1994), pp. 75–83
work page 1994
-
[12]
J. E CKSTEIN AND D. P. B ERTSEKAS , On the douglas-rachford splitting method and the proximal point algorithm for maximal monotone operators, Mathematical Programming, 55 (1992), pp. 293– 318
work page 1992
-
[13]
J. E CKSTEIN AND W. YAO, Understanding the convergence of the alternating direction method of multipliers: Theoretical and computational perspectives , Pac. J. Optim., 11 (2015), pp. 619–644
work page 2015
-
[14]
, Approximate ADMM algorithms derived from Lagrangian split ting, Computational Opti- mization and Applications, 68 (2017), pp. 363–405
work page 2017
-
[15]
, Relative-error approximate versions of Douglas–Rachford splitting and special cases of the ADMM, Mathematical Programming, 170 (2018), pp. 417–444
work page 2018
-
[16]
E. E SSER , X. Z HANG , AND T. C HAN , A general framework for a class of first order primal-dual algorithms for tv minimization , Ucla Cam Report, (2009), pp. 09–67
work page 2009
-
[17]
M. F AZEL , T. K. P ONG , D. S UN, AND P. T SENG , Hankel matrix rank minimization with applica- tions to system identification and realization , SIAM Journal on Matrix Analysis and Applications, 34 (2013), pp. 946–977
work page 2013
-
[18]
M. F ORTIN AND R. G LOWINSKI , Chapter iii on decomposition-coordination methods using a n augmented lagrangian, in Studies in Mathematics and Its Applications, vol. 15, El sevier, 1983, pp. 97–146
work page 1983
-
[19]
D. G ABAY AND B. M ERCIER , A dual algorithm for the solution of nonlinear variational p roblems via finite element approximation , Computers & Mathematics with Applications, 2 (1976), pp. 1 7– 40
work page 1976
-
[20]
B. G AO AND F. M A, Symmetric alternating direction method with indefinite pro ximal regulariza- tion for linearly constrained convex optimization, Journal of Optimization Theory and Applications, 176 (2018), pp. 178–204
work page 2018
-
[21]
R. G LOWINSKI AND A. M ARROCO , Sur l’approximation, par ´el´ements finis d’ordre un, et la r ´esolution, par p ´enalisation-dualit´e d’une classe de probl `emes de dirichlet non lin ´eaires, ESAIM: Mathematical Modelling and Numerical Analysis-Mod ´ elisation Math´ ematique et Anal- yse Num´ erique, 9 (1975), pp. 41–76
work page 1975
-
[22]
M. L. N. G ONC¸ ALVES , M. M. A LVES, AND J. G. M ELO , Pointwise and ergodic convergence rates of a variable metric proximal alternating direction method of multipliers, Journal of Optimization Theory and Applications, 177 (2018), pp. 448–478. 19
work page 2018
-
[23]
Y. G U, B. J IANG , AND D. H AN, An indefinite-proximal-based strictly contractive Peacem an- Rachford splitting method, tech. rep., 2019
work page 2019
-
[24]
Y. G U AND N. YAMASHITA , An alternating direction method of multipliers with the BFG S update for structured convex quadratic optimization, arXiv e-prints arXiv:1903.02270, (2019)
work page internal anchor Pith review Pith/arXiv arXiv 1903
- [25]
- [26]
-
[27]
B. H E, H. L IU, Z. W ANG , AND X. Y UAN, A strictly contractive Peaceman–Rachford splitting method for convex programming, SIAM Journal on Optimization, 24 (2014), pp. 1011–1040
work page 2014
-
[28]
B. H E, F. M A, AND X. Y UAN, Optimal linearized alternating direction method of multip liers for convex programming, Avaliable on http://www. optimization-online. org, (201 7)
- [29]
- [30]
-
[31]
F. J IANG , Z. W U, AND X. C AI, Generalized admm with optimal indefinite proximal term for l in- early constrained convex optimization, Journal of Industrial & Management Optimization, (2018), pp. 183–202
work page 2018
-
[32]
K. K OH, S.-J. K IM, AND S. B OYD, An interior-point method for large-scale l1-regularized logistic regression, Journal of Machine learning research, 8 (2007), pp. 1519–1 555
work page 2007
-
[33]
M. L I, D. S UN, AND K.-C. T OH, A majorized admm with indefinite proximal terms for linearly constrained convex composite optimization , SIAM Journal on Optimization, 26 (2016), pp. 922– 950
work page 2016
-
[34]
P.-L. L IONS AND B. M ERCIER , Splitting algorithms for the sum of two nonlinear operators, SIAM Journal on Numerical Analysis, 16 (1979), pp. 964–979
work page 1979
-
[35]
P. A. L OTITO , L. A. P ARENTE , AND M. S OLODOV , A class of variable metric decomposition methods for monotone variational inclusions , J. Convex Anal, 16 (2009), pp. 857–880
work page 2009
-
[36]
Y. N ESTEROV , Gradient methods for minimizing composite functions, Mathematical Programming, 140 (2013), pp. 125–161
work page 2013
-
[37]
L. I. R UDIN , S. O SHER , AND E. F ATEMI , Nonlinear total variation based noise removal algo- rithms, Physica D: nonlinear phenomena, 60 (1992), pp. 259–268
work page 1992
-
[38]
H. T REVOR , T. R OBERT , AND F. JH, The elements of statistical learning: data mining, inferen ce, and prediction, 2009. 20
work page 2009
-
[39]
P. T SENG , Approximation accuracy, gradient methods, and error bound for structured convex op- timization, Mathematical Programming, 125 (2010), pp. 263–295
work page 2010
-
[40]
P. T SENG AND S. Y UN, A coordinate gradient descent method for nonsmooth separable minimiza- tion, Mathematical Programming, 117 (2009), pp. 387–423
work page 2009
- [41]
-
[42]
W. Y IN, S. O SHER , D. G OLDFARB , AND J. D ARBON , Bregman iterative algorithms for l1- minimization with applications to compressed sensing , SIAM Journal on Imaging sciences, 1 (2008), pp. 143–168
work page 2008
-
[43]
Y UAN, The improvement with relative errors of he et al
X. Y UAN, The improvement with relative errors of he et al. ’s inexact a lternating direction method for monotone variational inequalities, Mathematical and computer modelling, 42 (2005), pp. 1225– 1236. 21
work page 2005
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.