Resolvent Moreau identities without monotonicity: theory and applications to Gabay duality, Douglas--Rachford and ADMM
Pith reviewed 2026-05-20 09:21 UTC · model grok-4.3
The pith
A duality relationship between resolvents holds for any set-valued mapping and generalizes Moreau's identity without monotonicity.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
For any set-valued mapping with single-valued resolvent, a duality identity holds that relates the resolvent of the mapping to the resolvent of a complementary operator derived from it. This identity is a generalization of Moreau's identity that requires neither convexity nor monotonicity. The authors apply the identity to extend Gabay duality between Douglas-Rachford splitting and ADMM to the nonmonotone setting, derive explicit counterexamples to ADMM convergence via the easier-to-analyze Douglas-Rachford scheme, and introduce a class of convergent resolvent homotopy schemes for nonconvex problems.
What carries the argument
The resolvent Moreau identity for arbitrary set-valued mappings, which equates the resolvent of a mapping A to an expression involving the resolvent of its complementary operator and thereby carries the duality argument.
If this is right
- Gabay duality between Douglas-Rachford splitting and ADMM extends directly to nonmonotone inclusions.
- Convergence counterexamples for ADMM in open cases follow from direct analysis of the Douglas-Rachford scheme.
- A family of resolvent homotopy schemes converges for selected nonconvex-regularised least absolute deviations problems.
- The same identity supplies explicit conditions under which the new schemes remain well-defined and convergent.
Where Pith is reading between the lines
- The identity could be used to derive similar dualities for other primal-dual splitting methods beyond Douglas-Rachford and ADMM.
- The homotopy schemes might be adapted to additional nonconvex regularized inverse problems that lack strong convexity in the smooth term.
Load-bearing premise
The resolvents of the set-valued mappings must be well-defined and single-valued in the relevant domains.
What would settle it
A concrete nonmonotone set-valued mapping whose resolvent is single-valued but for which the claimed duality identity fails to hold at some point in the domain.
Figures
read the original abstract
Duality is most often defined as a relationship between convex functions. If those functions are nonconvex, classical duality breaks down. Notwithstanding, we show that another kind of duality still exists, not between the functions themselves, but between the so-called resolvent operators used to solve associated problems. In fact, this duality-like relationship holds for any set-valued mapping, and is a generalization of the Moreau's identity. We use this duality to study existing operator schemes and to design new ones. In particular, we show that the duality-like relationship Daniel Gabay illuminated between the Douglas--Rachford splitting (DR) and the Alternating Direction Method of Multipliers (ADMM) extends to nonmonotone inclusion problems. We use this relationship to provide explicit counterexamples to the convergence of ADMM in several open cases, by studying the (easier to analyse) DR scheme. Motivated by our observations, we design a class of convergent resolvent homotopy schemes and use them to solve nonconvex-regularised least absolute deviations problems. This important problem class has received little attention in the literature, since the convex component of the objective does not enjoy strong convexity.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript establishes a resolvent duality identity J_{A^{-1}}(x) = x - J_A(x) that holds for arbitrary set-valued mappings A whenever the resolvents exist at the evaluation point, without requiring monotonicity or convexity. This identity is used to generalize Gabay duality between Douglas-Rachford splitting and ADMM to nonmonotone inclusions, to construct explicit counterexamples showing nonconvergence of ADMM in previously open cases, and to design a family of convergent resolvent homotopy schemes that are applied to nonconvex-regularized least absolute deviations problems.
Significance. If the algebraic identity and its applications are correctly derived, the work is significant because it removes the monotonicity barrier that has limited operator-splitting methods to convex settings, supplies concrete counterexamples that clarify the boundary of convergence for ADMM, and introduces practical convergent schemes for an important nonconvex problem class that lacks strong convexity. The direct, parameter-free character of the core identity is a strength.
major comments (2)
- [§3.1, Theorem 3.2] §3.1, Theorem 3.2: the claim that the duality identity extends Gabay duality to nonmonotone inclusions requires explicit verification that the fixed-point sets of the DR and ADMM operators remain in bijection under the resolvent map when monotonicity is dropped; the current argument only shows the algebraic relation but does not address whether the iteration sequences are correspondingly related.
- [§4.2, Example 4.5] §4.2, Example 4.5: the counterexample for ADMM nonconvergence is constructed via the DR scheme; it is necessary to confirm that the chosen nonmonotone operator A satisfies the domain conditions under which both resolvents are single-valued everywhere, otherwise the reduction from ADMM to DR may not be valid for the reported divergence.
minor comments (2)
- [§2] The notation distinguishing single-valued resolvents from set-valued ones is introduced only in §2; repeating the single-valuedness assumption in the statements of the main applications (Theorems 3.2 and 5.1) would improve readability.
- [Figure 2] Figure 2 caption does not specify the step-size schedule used for the homotopy scheme; adding this detail would allow direct reproduction of the plotted trajectories.
Simulated Author's Rebuttal
We thank the referee for the positive evaluation and the recommendation of minor revision. The comments are constructive and we address them point by point below, making the necessary clarifications in the revised manuscript.
read point-by-point responses
-
Referee: [§3.1, Theorem 3.2]: the claim that the duality identity extends Gabay duality to nonmonotone inclusions requires explicit verification that the fixed-point sets of the DR and ADMM operators remain in bijection under the resolvent map when monotonicity is dropped; the current argument only shows the algebraic relation but does not address whether the iteration sequences are correspondingly related.
Authors: The referee correctly notes that the algebraic identity alone does not automatically guarantee the full correspondence of iteration sequences. However, because the DR and ADMM operators are defined pointwise via the resolvents, and the identity J_{A^{-1}}(x) = x - J_A(x) holds at every evaluation point where the resolvents exist, the fixed-point sets are in bijection by direct substitution: if z^* satisfies z^* = T_DR(z^*), then u^* = J_A(z^*) satisfies the fixed-point equation of the ADMM operator. The same pointwise relation maps each iterate of one scheme to the corresponding iterate of the other, so the sequences remain related throughout. We will insert a short clarifying paragraph after Theorem 3.2 to spell out this step-by-step correspondence. revision: yes
-
Referee: [§4.2, Example 4.5]: the counterexample for ADMM nonconvergence is constructed via the DR scheme; it is necessary to confirm that the chosen nonmonotone operator A satisfies the domain conditions under which both resolvents are single-valued everywhere, otherwise the reduction from ADMM to DR may not be valid for the reported divergence.
Authors: We agree that explicit domain verification is required for the reduction to be rigorous. In Example 4.5 the operator A is constructed so that both J_A and J_{A^{-1}} are single-valued and defined on all of R^2; this is verified by direct computation showing that for every x there exists a unique y satisfying x ∈ y + A(y). The same holds for A^{-1}. We will add a sentence in the example stating these domain conditions explicitly so that the validity of the ADMM–DR equivalence is immediate. revision: yes
Circularity Check
No significant circularity; identities follow directly from resolvent definitions
full rationale
The central claim is the resolvent duality identity J_{A^{-1}}(x) = x - J_A(x), which holds for any set-valued operator A whenever the resolvents exist at the relevant points. This follows immediately by algebraic rearrangement from the definition of the resolvent: if y = J_A(x) then x - y ∈ A(y), which is exactly the condition that x - y = J_{A^{-1}}(x). No monotonicity, convexity, or additional structure is required, and the paper presents this as a direct operator identity rather than a fitted or self-referential construction. Subsequent applications to Gabay duality, DR/ADMM extensions, counterexamples, and new schemes are built on this algebraic relation plus standard operator-splitting arguments; no load-bearing step reduces to a self-citation, ansatz, or renaming of a known empirical pattern. The derivation chain is therefore self-contained against external benchmarks.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Resolvents of set-valued mappings are well-defined operators to which the identity applies.
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
Theorem 4 (Generalized Moreau identity) ... JγDH(u) = u + γd + γM J^M_{γ^{-1}H}(-M^*(γ^{-1}u + d))
-
IndisputableMonolith/Foundation/RealityFromDistinction.leanreality_from_one_distinction unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
We show that the duality-like relationship ... holds for any set-valued mapping
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
Arag´ on Artacho and Jonathan M
Francisco J. Arag´ on Artacho and Jonathan M. Borwein. Global convergence of a non-convex Douglas–Rachford iteration.Journal of Global Optimization, 57(3):753–769, 2013
work page 2013
-
[2]
Francisco J. Arag´ on Artacho, Jonathan M. Borwein, and Matthew K. Tam. Douglas–Rachford feasibility methods for matrix completion problems.The ANZIAM Journal, 55(4):299–326, 2014
work page 2014
-
[3]
Arag´ on Artacho and Rub´ en Campoy
Francisco J. Arag´ on Artacho and Rub´ en Campoy. Solving graph coloring problems with the Douglas– Rachford algorithm.Set-Valued and Variational Analysis, 26:277–304, 2018
work page 2018
-
[4]
H.H. Bauschke and P. Combettes.Convex Analysis and Monotone Operator Theory in Hilbert Spaces. Springer, 2nd edition, 2017
work page 2017
-
[5]
Amir Beck.First-order methods in optimization. SIAM, Philadelphia, USA, 2017
work page 2017
-
[6]
Aaron Berk, Simone Brugiapaglia, and Tim Hoheisel. Lasso reloaded: A variational analysis per- spective with applications to compressed sensing.SIAM Journal on Mathematics of Data Science, 5(4):1102–1129, 2023
work page 2023
-
[7]
Jonathan M. Borwein and Matthew K. Tam. Reflection methods for inverse problems with appli- cations to protein conformation determination. InGeneralized Nash Equilibrium Problems, Bilevel Programming and MPEC. Springer, 2012
work page 2012
-
[8]
S. Boyd and L. Vandenberghe.Convex Optimization. Cambridge University Press, 2004
work page 2004
- [9]
-
[10]
Antonin Chambolle and Thomas Pock. A first–order primal–dual algorithm for convex problems with applications to imaging.Journal of Mathematical Imaging and Vision, 40:120–145, 2010
work page 2010
-
[11]
F. H. Clarke.Optimization and nonsmooth analysis, volume 5 ofClassics in Applied Mathematics. Society for Industrial and Applied Mathematics (SIAM), Philadelphia, PA, second edition, 1990
work page 1990
-
[12]
Minh N. Dao and Hung M. Phan. Linear convergence of projection algorithms.Mathematics of Operations Research, 44(2):715–738, 2019
work page 2019
-
[13]
Steven Diamond and Stephen Boyd. CVXPY: A Python-embedded modeling language for convex optimization.Journal of Machine Learning Research, 17(83):1–5, 2016
work page 2016
- [14]
-
[15]
Jonathan Eckstein and Wang Yao. Understanding the convergence of the alternating direction method of multipliers: Theoretical and computational perspectives.Pacific Journal of Optimization, 11(4):619–644, 2015
work page 2015
-
[16]
Jianqing Fan and Runze Li. Variable selection via nonconcave penalized likelihood and its oracle properties.Journal of the American Statistical Association, 96(456):1348–1360, 2001
work page 2001
-
[17]
Faraway.Linear models with Python
Julian J. Faraway.Linear models with Python. Chapman & Hall, 2020
work page 2020
-
[18]
D. Gabay. Applications of the method of multipliers to variational inequalities. InStudies in Mathematics and its Applications, volume 15, chapter 9, pages 299–331. Elsevier, 1983
work page 1983
-
[19]
Pontus Giselsson and Stephen Boyd. Linear convergence and metric selection for Douglas–Rachford splitting and ADMM.IEEE Transactions on Automatic Control, 62(2):532–544, 2017. 20
work page 2017
-
[20]
Trevor Hastie, Robert Tibshirani, and Ryan Tibshirani. Best subset, forward stepwise or lasso? analysis and recommendations based on extensive comparisons.Institute of Mathematical Statistics, 35:579–592, 2020
work page 2020
-
[21]
Bingsheng He and Xiaoming Yuan. On theo(1/n) convergence rate of the Douglas–Rachford alter- nating direction method.SIAM Journal on Numerical Analysis, 50:700–709, 2012
work page 2012
-
[22]
Robert Hesse and D. Russell Luke. Nonconvex notions of regularity and convergence of fundamental algorithms for feasibility problems.SIAM Journal on Optimization, 23(4):2397–2419, 2013
work page 2013
-
[23]
The interactive geometry software Cinderella.Amer- ican Mathematical Monthly, 107:760, 2000
Ulrich Kortenkamp and J¨ urgen Richter-Gebert. The interactive geometry software Cinderella.Amer- ican Mathematical Monthly, 107:760, 2000
work page 2000
- [24]
-
[25]
Scott B. Lindstrom and Brailey Sims. Survey: sixty years of Douglas–Rachford.Journal of the Australian Mathematical Society, 110(3):333–370, 2021
work page 2021
-
[26]
P.-L. Lions and B. Mercier. Splitting algorithms for the sum of two nonlinear operators.SIAM Journal on Numerical Analysis, 16:964–979, 1979
work page 1979
-
[27]
J.J. Moreau. Proximit´ e et dualit´ e dans un espace hilbertien.Bulletin de la Soci´ et´ e Math´ ematique de France, 93:273–299, 1965
work page 1965
-
[28]
Moursi and Yuriy Zinchenko.A Note on the Equivalence of Operator Splitting Methods, pages 331–349
Walaa M. Moursi and Yuriy Zinchenko.A Note on the Equivalence of Operator Splitting Methods, pages 331–349. Springer International Publishing, 2019
work page 2019
-
[29]
F. Pedregosa, G. Varoquaux, A. Gramfort, V. Michel, B. Thirion, O. Grisel, M. Blondel, P. Pretten- hofer, R. Weiss, V. Dubourg, J. Vanderplas, A. Passos, D. Cournapeau, M. Brucher, M. Perrot, and E. Duchesnay. Scikit-learn: Machine learning in Python.Journal of Machine Learning Research, 12:2825–2830, 2011
work page 2011
-
[30]
Dao, Nima Amjady, and Rakibuzzaman Shah
Tan Nhat Pham, Minh N. Dao, Nima Amjady, and Rakibuzzaman Shah. A proximal splitting algorithm for generalized dc programming with applications in signal recovery.European Journal of Operational Research, 326(1):42–53, 2025
work page 2025
-
[31]
Hung M. Phan. Linear convergence of the Douglas–Rachford method for two closed sets.Optimiza- tion, 65(2):369–385, 2016
work page 2016
-
[32]
Clarice Poon and Jingwei Liang. Trajectory of alternating direction method of multipliers and adaptive acceleration.Advances in neural information processing systems, 32, 2019
work page 2019
-
[33]
Andreas Themelis and Panagiotis Patrinos. Douglas–Rachford splitting and ADMM for nonconvex optimization: Tight convergence results.SIAM Journal on Optimization, 30(1):149–181, 2020
work page 2020
-
[34]
Andreas Themelis, Lorenzo Stella, and Panagiotis Patrinos. Douglas–Rachford splitting and admm for nonconvex optimization: accelerated and Newton-type linesearch algorithms.Computational Optimization and Applications, 82:395–440, 2022
work page 2022
-
[35]
Robert Tibshirani. Regression shrinkage and selection via the LASSO.Journal of the Royal Statis- tical Society, 58:267–288, 1996
work page 1996
-
[36]
Cun-Hui Zhang. Nearly unbiased variable selection under minimax concave penalty.The Annals of Statistics, 38(2):894–942, 2010. 21 A MCP and SCAD By Proposition 3, the well–defined and efficiently–computable expression for prox τ ϕmcp supplies a valid selection ofJ τ ∂ϕmcp [36]. We may thus obtain a closed–form selectantS γ(∂ϕmcp)−1 by combining proxτ ϕmcp...
work page 2010
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.