Spectral conjugate gradient projection methods for large-scale monotone equations without Lipschitz continuity
Pith reviewed 2026-05-20 18:06 UTC · model grok-4.3
The pith
Spectral conjugate gradient projection methods achieve global convergence for monotone equations under monotonicity alone.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The authors construct two spectral conjugate gradient projection methods. For the first, which generalizes the modified optimal Perry method via an eigenvalue-optimized scaling matrix, they prove global convergence under monotonicity of the equation mapping alone, without requiring Lipschitz continuity. Search directions satisfy a sufficient descent condition independent of the line search.
What carries the argument
Adaptive spectral parameter incorporated into conjugate gradient search directions, ensuring sufficient descent independently of the line search.
If this is right
- The first method applies to monotone problems where Lipschitz continuity fails to hold.
- Both methods remain derivative-free and projection-based, making them suitable for large-scale constrained systems.
- Practical effectiveness is confirmed on problems with dimensions up to 120,000 and on l1-regularized signal recovery and logistic regression.
- Global convergence holds for the first method solely from monotonicity.
Where Pith is reading between the lines
- The same spectral construction could be tested on other first-order methods that currently rely on Lipschitz assumptions.
- The line-search independence may simplify implementation in software libraries for constrained nonlinear equations.
- Extensions to stochastic or inexact variants become plausible once the descent property is decoupled from line search.
Load-bearing premise
The constructed search directions satisfy a sufficient descent condition that holds independently of the specific line search used.
What would settle it
A concrete monotone but non-Lipschitz mapping on which the first proposed method fails to converge to a solution.
Figures
read the original abstract
We introduce two derivative-free projection methods for large-scale systems of nonlinear monotone equations subject to convex constraints. Both methods incorporate an adaptive spectral parameter into established conjugate gradient frameworks: the first generalizes the modified optimal Perry method via an eigenvalue-optimized scaling matrix, and the second generalizes the Hager--Zhang-type conjugate gradient projection method via a spectral Dai--Liao parameter. The resulting search directions satisfy a sufficient descent condition independent of the line search. For the first method, we establish global convergence under monotonicity alone, without requiring Lipschitz continuity of the mapping. For the second, global convergence holds under the standard monotonicity and Lipschitz continuity assumptions. Numerical experiments on 18 test problems across dimensions up to 120{,}000, together with applications to $\ell_1$-regularized signal recovery and regularized logistic regression, confirm the practical effectiveness of the proposed approach.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript presents two derivative-free projection methods for large-scale systems of nonlinear monotone equations subject to convex constraints. The first generalizes the modified optimal Perry conjugate gradient method via an eigenvalue-optimized scaling matrix, while the second generalizes the Hager-Zhang method via a spectral Dai-Liao parameter. Both methods produce search directions satisfying a sufficient descent condition independent of the line search. Global convergence is established for the first method under monotonicity alone (without Lipschitz continuity) and for the second under monotonicity plus Lipschitz continuity. Numerical results on 18 test problems (dimensions up to 120,000) and applications to ℓ1-regularized signal recovery and regularized logistic regression are reported to demonstrate effectiveness.
Significance. If the global convergence result under monotonicity alone holds, the work would advance derivative-free methods for monotone equations by removing the need for a Lipschitz constant, which is often unavailable or prohibitive in large-scale settings. The line-search-independent sufficient descent property is a technically useful feature. The scale of the numerical tests and the inclusion of real applications add practical value, though the overall significance depends on resolving the well-definedness of the algorithm under the stated hypotheses.
major comments (1)
- [Assumptions and line-search procedure] Assumptions section (and line-search description): The claim of global convergence for the first method under 'monotonicity alone, without requiring Lipschitz continuity' is load-bearing for the central contribution, yet the line search (find α>0 s.t. ⟨F(x_k + α d_k), d_k⟩ ≥ σ ⟨F(x_k), d_k⟩, σ∈(0,1)) is not guaranteed to terminate. With g(α)=⟨F(x_k + α d_k), d_k⟩ and g(0)<0 by the sufficient-descent property, existence of a suitable α_k typically invokes continuity of g (hence of F) and the intermediate-value theorem. Monotonicity alone does not imply continuity of F: R^n→R^n, so the algorithm may not be well-defined on the hypothesis class stated in the abstract and assumptions. The manuscript should either add continuity of F to the assumptions or provide an alternative argument that avoids it.
minor comments (2)
- [Abstract] Abstract: the dimension '120{,}000' uses nonstandard comma formatting that appears to be a LaTeX artifact and should be rendered as 120000 or 120,000 consistently.
- [Introduction or Section 3] The manuscript would benefit from a short table comparing the two methods' spectral parameters, descent constants, and convergence assumptions for quick reference.
Simulated Author's Rebuttal
We thank the referee for the careful reading and constructive comments on our manuscript. We appreciate the identification of the issue concerning the well-definedness of the algorithm. We respond to the major comment below.
read point-by-point responses
-
Referee: Assumptions section (and line-search description): The claim of global convergence for the first method under 'monotonicity alone, without requiring Lipschitz continuity' is load-bearing for the central contribution, yet the line search (find α>0 s.t. ⟨F(x_k + α d_k), d_k⟩ ≥ σ ⟨F(x_k), d_k⟩, σ∈(0,1)) is not guaranteed to terminate. With g(α)=⟨F(x_k + α d_k), d_k⟩ and g(0)<0 by the sufficient-descent property, existence of a suitable α_k typically invokes continuity of g (hence of F) and the intermediate-value theorem. Monotonicity alone does not imply continuity of F: R^n→R^n, so the algorithm may not be well-defined on the hypothesis class stated in the abstract and assumptions. The manuscript should either add continuity of F to the assumptions or provide an alternative argument that avoids it.
Authors: We thank the referee for highlighting this crucial point. We concur that continuity of F is necessary to ensure that g(α) is continuous, thereby allowing the application of the intermediate value theorem to confirm the existence of a suitable α > 0 satisfying the line search condition. Since monotonicity alone does not guarantee continuity, we will update the assumptions section to explicitly state that F is continuous. This revision ensures the algorithm is rigorously well-defined under the hypotheses, while the global convergence for the first method still holds under monotonicity and continuity without Lipschitz continuity. We do not see a way to avoid this assumption while maintaining the current line search framework. revision: yes
Circularity Check
No significant circularity; derivation is self-contained
full rationale
The paper constructs two new spectral CG projection methods for monotone equations, derives search directions that satisfy a sufficient descent condition independently of the line search, and proves global convergence for the first method directly from monotonicity (without Lipschitz continuity) using standard projection and descent arguments. No step reduces a claimed result to a fitted parameter, self-citation chain, or definitional tautology; the sufficient-descent property and convergence analysis are established from the problem assumptions and algorithm construction without importing uniqueness theorems or ansatzes from prior self-work. The central claims therefore remain independent of the inputs.
Axiom & Free-Parameter Ledger
axioms (2)
- domain assumption The nonlinear mapping is monotone on the feasible set.
- domain assumption The feasible set is convex and closed.
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
For the first method, we establish global convergence under monotonicity alone, without requiring Lipschitz continuity of the mapping.
-
IndisputableMonolith/Foundation/BranchSelection.leanbranch_selection unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
The resulting search directions satisfy a sufficient descent condition independent of the line search.
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
A new inexact alternating directions method for monotone variational in- equalities
Bingsheng He et al. “A new inexact alternating directions method for monotone variational in- equalities”. In:Mathematical Programming92 (2002), pp. 103–118
work page 2002
-
[2]
An extragradient algorithm for monotone variational inequalities
Yu V Malitsky and VV3276035 Semenov. “An extragradient algorithm for monotone variational inequalities”. In:Cybernetics and Systems Analysis50.2 (2014), pp. 271–277
work page 2014
-
[3]
Yun-Bin Zhao and Duan Li. “Monotonicity of fixed point and normal mappings associated with variational inequality and its application”. In:SIAM Journal on Optimization11.4 (2001), pp. 962– 973
work page 2001
-
[4]
A methodology for solving chemical equilibrium sys- tems
Keith Meintjes and Alexander P Morgan. “A methodology for solving chemical equilibrium sys- tems”. In:Applied Mathematics and Computation22.4 (1987), pp. 333–361
work page 1987
-
[5]
Calculation of complex chemical equilibria
Frank J Zeleznik and Sanford Gordon. “Calculation of complex chemical equilibria”. In:Industrial & Engineering Chemistry60.6 (1968), pp. 27–57
work page 1968
-
[6]
M´ ario AT Figueiredo, Robert D Nowak, and Stephen J Wright. “Gradient projection for sparse reconstruction: Application to compressed sensing and other inverse problems”. In:IEEE Journal of selected topics in signal processing1.4 (2007), pp. 586–597. 16
work page 2007
-
[7]
A note on the spectral gradient projection method for nonlinear monotone equations with applications
Auwal Bala Abubakar, Poom Kumam, and Hassan Mohammad. “A note on the spectral gradient projection method for nonlinear monotone equations with applications”. In:Computational and Applied Mathematics39.2 (2020), p. 129
work page 2020
-
[8]
A Class of Relaxed-Inertial Derivative-Free Projection Method beyond Monotonicity with Application
Abdulkarim Hassan Ibrahim, Mohammed Alshahrani, and Suliman Al-Homidan. “A Class of Relaxed-Inertial Derivative-Free Projection Method beyond Monotonicity with Application”. In: Arabian Journal of Mathematics14.1 (Apr. 1, 2025), pp. 53–84.issn: 2193-5351.doi:10.1007/ s40065-024-00491-y
work page 2025
-
[9]
Learning understandable neural networks with nonnegative weight constraints
Jan Chorowski and Jacek M Zurada. “Learning understandable neural networks with nonnegative weight constraints”. In:IEEE transactions on neural networks and learning systems26.1 (2014), pp. 62–69
work page 2014
-
[10]
Su-Su Chen and Bo Tian. “Gramian solutions and soliton interactions for a generalized (3+ 1)- dimensional variable-coefficient Kadomtsev–Petviashvili equation in a plasma or fluid”. In:Pro- ceedings of the Royal Society A475.2228 (2019), p. 20190122
work page 2019
-
[11]
Xin-Yi Gao. “Mathematical view with observational/experimental consideration on certain (2+ 1)-dimensional waves in the cosmic/laboratory dusty plasmas”. In:Applied Mathematics Letters 91 (2019), pp. 165–172
work page 2019
-
[12]
Jorge Nocedal and Stephen J Wright.Numerical optimization. Springer, 1999
work page 1999
-
[13]
Newton-type methods with generalized distances for constrained optimization
N Alfredo Iusem and V Michael Solodov. “Newton-type methods with generalized distances for constrained optimization”. In:Optimization41.3 (1997), pp. 257–278
work page 1997
-
[14]
Function minimization by conjugate gradients
Reeves Fletcher and Colin M Reeves. “Function minimization by conjugate gradients”. In:The computer journal7.2 (1964), pp. 149–154
work page 1964
-
[15]
The conjugate gradient method in extremal problems
Boris Teodorovich Polyak. “The conjugate gradient method in extremal problems”. In:USSR Computational Mathematics and Mathematical Physics9.4 (1969), pp. 94–112
work page 1969
-
[16]
Magnus Rudolph Hestenes, Eduard Stiefel, et al.Methods of conjugate gradients for solving linear systems. Vol. 49. 1. NBS Washington, DC, 1952
work page 1952
-
[17]
A nonlinear conjugate gradient method with a strong global convergence property
Yu-Hong Dai and Yaxiang Yuan. “A nonlinear conjugate gradient method with a strong global convergence property”. In:SIAM Journal on optimization10.1 (1999), pp. 177–182
work page 1999
-
[18]
A PRP type method for systems of monotone equations
Wanyou Cheng. “A PRP type method for systems of monotone equations”. In:Mathematical and Computer Modelling50.1-2 (2009), pp. 15–20
work page 2009
-
[19]
A class of derivative-free methods for large-scale nonlinear monotone equations
Qingna Li and Dong-Hui Li. “A class of derivative-free methods for large-scale nonlinear monotone equations”. In:IMA Journal of Numerical Analysis31.4 (2011), pp. 1625–1635
work page 2011
-
[20]
A projection method for convex constrained monotone nonlinear equations with applications
JK Liu and SJ Li. “A projection method for convex constrained monotone nonlinear equations with applications”. In:Computers & Mathematics with Applications70.10 (2015), pp. 2442–2453
work page 2015
-
[21]
Yunhai Xiao and Hong Zhu. “A conjugate gradient method to solve convex constrained monotone equations with applications in compressive sensing”. In:Journal of Mathematical Analysis and Applications405.1 (2013), pp. 310–319
work page 2013
-
[22]
Spectral gradient projection method for solving nonlinear monotone equations
Li Zhang and Weijun Zhou. “Spectral gradient projection method for solving nonlinear monotone equations”. In:Journal of Computational and Applied Mathematics196.2 (2006), pp. 478–484
work page 2006
-
[23]
Two-point step size gradient methods
Jonathan Barzilai and Jonathan M Borwein. “Two-point step size gradient methods”. In:IMA journal of numerical analysis8.1 (1988), pp. 141–148
work page 1988
-
[24]
Nonmonotone spectral projected gradient methods on convex sets
Ernesto G Birgin, Jos´ eMario Mart´ ınez, and Marcos Raydan. “Nonmonotone spectral projected gradient methods on convex sets”. In:SIAM Journal on Optimization10.4 (2000), pp. 1196–1211. doi:10.1137/S1052623497330963
-
[25]
Auwal Bala Abubakar et al. “New hybrid three-term spectral-conjugate gradient method for find- ing solutions of nonlinear monotone operator equations with applications”. In:Mathematics and Computers in Simulation201 (2022), pp. 670–683.doi:10.1016/j.matcom.2021.07.005
-
[26]
A globally convergent inexact Newton method for sys- tems of monotone equations
Michael V Solodov and Benav F Svaiter. “A globally convergent inexact Newton method for sys- tems of monotone equations”. In:Reformulation: Nonsmooth, piecewise smooth, semismooth and smoothing methods(1999), pp. 355–369
work page 1999
-
[27]
Spectral gradient projection method for monotone nonlinear equations with convex constraints
Zhensheng Yu et al. “Spectral gradient projection method for monotone nonlinear equations with convex constraints”. In:Applied numerical mathematics59.10 (2009), pp. 2416–2423. 17
work page 2009
-
[28]
XY Wang, SJ Li, and XP Kou. “A self-adaptive three-term conjugate gradient method for monotone nonlinear equations with convex constraints”. In:Calcolo53 (2016), pp. 133–145
work page 2016
-
[29]
Peiting Gao and Chuanjiang He. “An efficient three-term conjugate gradient method for nonlinear monotone equations with convex constraints”. In:Calcolo55.4 (2018), p. 53
work page 2018
-
[30]
Li Zheng, Lei Yang, and Yong Liang. “A modified spectral gradient projection method for solving non-linear monotone equations with convex constraints and its application”. In:IEEE Access8 (2020), pp. 92677–92686
work page 2020
-
[31]
Some descent three-term conjugate gradient methods and their global convergence
Li Zhang, Weijun Zhou, and Donghui Li. “Some descent three-term conjugate gradient methods and their global convergence”. In:Optimisation Methods and Software22.4 (2007), pp. 697–711
work page 2007
-
[32]
On three-term conjugate gradient algorithms for unconstrained optimization
Neculai Andrei. “On three-term conjugate gradient algorithms for unconstrained optimization”. In:Applied Mathematics and Computation219.11 (2013), pp. 6316–6327
work page 2013
-
[33]
Yasushi Narushima, Hiroshi Yabe, and John A Ford. “A three-term conjugate gradient method with sufficient descent property for unconstrained optimization”. In:SIAM Journal on Optimization21.1 (2011), pp. 212–230
work page 2011
-
[34]
A conjugate gradient projection method for solving equations with convex constraints
Li Zheng, Lei Yang, and Yong Liang. “A conjugate gradient projection method for solving equations with convex constraints”. In:Journal of Computational and Applied Mathematics375 (2020), p. 112781
work page 2020
-
[35]
Abdulkarim Hassan Ibrahim, Mohammed Alshahrani, and Suliman Al-Homidan. “Two classes of spectral three-term derivative-free method for solving nonlinear equations with application”. In: Numerical Algorithms(2023), pp. 1–21
work page 2023
-
[36]
Pengjie Liu et al. “A three-term CGPM-based algorithm without Lipschitz continuity for con- strained nonlinear monotone equations with applications”. In:Applied Numerical Mathematics 175 (2022), pp. 98–107
work page 2022
-
[37]
A Robbins-Monro type algorithm for computing global min- imizer of generalized conic functions
Pengjie Liu. “A three-term derivative-free projection method with BFGS-like update and its accel- erated variant”. In:Optimization(2025). Published online 07 Jul 2025.doi:10.1080/02331934. 2025.2526727
-
[38]
Mohammed Yusuf Waziri, Yusuf Muhammad Kufena, and Abubakar Sani Halilu. “Double direction three-term spectral conjugate gradient method for solving symmetric nonlinear equations”. In: Results in Control and Optimization6 (2022), p. 100072.doi:10.1016/j.rico.2021.100072
-
[39]
Descent Perry conjugate gradient methods for systems of monotone nonlinear equations
Mohammed Yusuf Waziri, Kabiru Ahmed Hungu, and Jamilu Sabi’u. “Descent Perry conjugate gradient methods for systems of monotone nonlinear equations”. In:Numerical Algorithms85 (2020), pp. 763–785.doi:10.1007/s11075-019-00836-1
-
[40]
Jamilu Sabi’u et al. “Modified optimal Perry conjugate gradient method for solving system of monotone equations with applications”. In:Applied Numerical Mathematics184 (2023), pp. 431– 445
work page 2023
-
[41]
A new conjugate gradient method with guaranteed descent and an efficient line search
William W Hager and Hongchao Zhang. “A new conjugate gradient method with guaranteed descent and an efficient line search”. In:SIAM Journal on optimization16.1 (2005), pp. 170–192
work page 2005
-
[42]
A modified conjugate gradient algorithm
Avinoam Perry. “A modified conjugate gradient algorithm”. In:Operations Research26.6 (1978), pp. 1073–1078
work page 1978
-
[43]
Another improved Wei–Yao–Liu nonlinear conjugate gradient method with sufficient descent property
Zhifeng Dai and Fenghua Wen. “Another improved Wei–Yao–Liu nonlinear conjugate gradient method with sufficient descent property”. In:Applied Mathematics and Computation218.14 (2012), pp. 7421–7430
work page 2012
-
[44]
Two optimal Hager-Zhang conju- gate gradient methods for solving monotone nonlinear equations
Jamilu Sabi’u, Abdullah Shah, and Mohammed Yusuf Waziri. “Two optimal Hager-Zhang conju- gate gradient methods for solving monotone nonlinear equations”. In:Applied Numerical Mathe- matics153 (2020), pp. 217–233
work page 2020
-
[45]
William La Cruz, Jos´ e Mart´ ınez, and Marcos Raydan. “Spectral residual method without gradient information for solving large-scale nonlinear systems of equations”. In:Mathematics of computation 75.255 (2006), pp. 1429–1448
work page 2006
-
[46]
Taiyong Song and Zexian Liu. “An efficient inertial subspace minimization CG algorithm with convergence rate analysis for constrained nonlinear monotone equations”. In:Journal of Compu- tational and Applied Mathematics446 (2024), p. 115873. 18
work page 2024
-
[47]
Abdulkarim Hassan Ibrahim and Suliman Al-Homidan. “Two-step inertial derivative-free projec- tion method for solving nonlinear equations with application”. In:Journal of Computational and Applied Mathematics(2024), p. 116071
work page 2024
-
[48]
Two descent Dai-Yuan conjugate gradient methods for systems of monotone nonlinear equations
Mohammed Yusuf Waziri and Kabiru Ahmed. “Two descent Dai-Yuan conjugate gradient methods for systems of monotone nonlinear equations”. In:Journal of Scientific Computing90 (2022), pp. 1– 53
work page 2022
-
[49]
Limited memory BFGS method for nonlinear monotone equations
Weijun Zhou and Donghui Li. “Limited memory BFGS method for nonlinear monotone equations”. In:Journal of Computational Mathematics(2007), pp. 89–96
work page 2007
-
[50]
Scaled three-term derivative-free methods for solving large-scale nonlinear monotone equations
Qun Li and Bing Zheng. “Scaled three-term derivative-free methods for solving large-scale nonlinear monotone equations”. In:Numerical Algorithms87.3 (2021), pp. 1343–1367
work page 2021
-
[51]
Benchmarking optimization software with performance profiles
Elizabeth D Dolan and Jorge J Mor´ e. “Benchmarking optimization software with performance profiles”. In:Mathematical programming91 (2002), pp. 201–213
work page 2002
-
[52]
Signal recovery by proximal forward-backward split- ting
Patrick L Combettes and Val´ erie R Wajs. “Signal recovery by proximal forward-backward split- ting”. In:Multiscale modeling & simulation4.4 (2005), pp. 1168–1200
work page 2005
- [53]
-
[54]
Non-smooth equations based method for L1-norm problems with applications to compressed sensing
Yunhai Xiao, Qiuyu Wang, and Qingjie Hu. “Non-smooth equations based method for L1-norm problems with applications to compressed sensing”. In:Nonlinear Analysis: Theory, Methods & Applications74.11 (2011), pp. 3570–3577
work page 2011
-
[55]
LIBSVM: A library for support vector machines
Chih-Chung Chang and Chih-Jen Lin. “LIBSVM: A library for support vector machines”. In: ACM Transactions on Intelligent Systems and Technology2.3 (2011), 27:1–27:27.doi:10.1145/ 1961189.1961199
-
[56]
Jinbao Jian et al. “A family of inertial derivative-free projection methods for constrained nonlinear pseudo-monotone equations with applications”. In:Computational and Applied Mathematics41 (2022), p. 309.doi:10.1007/s40314-022-02019-6. 19
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.