Extending Linear Convergence of the Proximal Point Algorithm: The Quasar-Convex Case
Pith reviewed 2026-05-18 18:53 UTC · model grok-4.3
The pith
The proximal point algorithm generates a minimizing sequence with O(ε^{-1}) rate for quasar-convex functions and linear convergence under strong quasar-convexity.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
This work investigates the properties of the proximity operator for quasar-convex functions and establishes the convergence of the proximal point algorithm to a global minimizer with a particular focus on its convergence rate. In particular, we demonstrate: (i) the generated sequence is minimizing and achieves an O(ε^{-1}) complexity rate for quasar-convex functions; (ii) under strong quasar-convexity, the sequence converges linearly and attains an O(ln(ε^{-1})) complexity rate. These results extend known convergence rates from the (strongly) convex to the (strongly) quasar-convex setting.
What carries the argument
The proximity operator of a quasar-convex function, which satisfies the monotonicity-type inequalities required to extend the standard convex convergence arguments of the proximal point method.
If this is right
- The proximal point sequence converges to a global minimizer for any quasar-convex objective.
- An O(ε^{-1}) iteration bound holds for reaching an ε-minimizer when the objective is quasar-convex.
- Under strong quasar-convexity the iterates converge linearly to the minimizer.
- The iteration complexity improves to O(ln(ε^{-1})) when strong quasar-convexity holds.
- The same rates apply to the special case of star-convex functions, some of which appear novel even in that setting.
Where Pith is reading between the lines
- Quasar-convexity offers a practical modeling assumption for certain non-convex problems that still admit global minimization via proximal methods.
- Similar rate extensions may hold for other proximal-type algorithms when applied to quasar-convex objectives.
- The numerical experiments indicate that the predicted rates are observable on concrete quasar-convex instances.
- The framework could guide analysis of first-order methods on loss landscapes that satisfy quasar-convexity but violate convexity.
Load-bearing premise
The proximity operator of a quasar-convex function satisfies the key inequalities needed to carry over the standard convex convergence arguments.
What would settle it
A concrete quasar-convex function for which the proximal point iterates either fail to approach the global minimizer or require more than O(ε^{-1}) steps to reach ε accuracy.
Figures
read the original abstract
This work investigates the properties of the proximity operator for quasar-convex functions and establishes the convergence of the proximal point algorithm to a global minimizer with a particular focus on its convergence rate. In particular, we demonstrate: (i) the generated sequence is mi\-ni\-mi\-zing and achieves an $\mathcal{O}(\varepsilon^{-1})$ complexity rate for quasar-convex functions; (ii) under strong quasar-convexity, the sequence converges linearly and attains an $\mathcal{O}(\ln(\varepsilon^{-1}))$ complexity rate. These results extend known convergence rates from the (strongly) convex to the (strongly) quasar-convex setting. To the best of our knowledge, some findings are novel even for the special case of (strongly) star-convex functions. Numerical experiments corroborate our theoretical results.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper investigates properties of the proximity operator for quasar-convex functions and establishes convergence rates for the proximal point algorithm. It shows that the generated sequence is minimizing and achieves an O(ε^{-1}) complexity rate for quasar-convex functions; under strong quasar-convexity the sequence converges linearly with O(ln(ε^{-1})) complexity. These extend known convex rates to the quasar-convex setting, with some results novel even for star-convex functions. Numerical experiments are included to corroborate the theory.
Significance. If the derivations hold, the work meaningfully extends the applicability of proximal-point methods to a wider class of non-convex problems that includes star-convex functions. Credit is due for the explicit construction of the quasar-convex analogue of the subdifferential inequality (Section 3) and the uniform treatment that avoids circularity or post-hoc parameter fitting, yielding clean, falsifiable rates.
minor comments (3)
- [Abstract] Abstract: the phrase 'some findings are novel even for the special case of (strongly) star-convex functions' is left unspecified; a brief parenthetical identifying the novel statements would improve readability.
- [Section 3] Section 3: the key proximal inequality is derived cleanly, but a short remark comparing the quasar parameter μ=1 case directly to the classical convex subdifferential inequality would help readers verify the reduction.
- Notation: the symbol for the quasar-convexity parameter is introduced without an explicit forward reference to its range; adding 'μ ∈ (0,1]' at first use would prevent minor confusion.
Simulated Author's Rebuttal
We thank the referee for their positive summary of our work on the proximal point algorithm for quasar-convex functions and for recommending minor revision. The assessment that the results extend convex rates and include novel aspects even for star-convex functions is appreciated. As the report lists no specific major comments, we have no point-by-point rebuttals to provide.
Circularity Check
No significant circularity detected
full rationale
The paper establishes the required proximal inequalities for quasar-convex functions directly from the function definition in Section 3, then carries over the standard convex distance-decrease argument for the proximal point algorithm without any reduction to fitted inputs, self-citations, or ansatzes. The O(ε^{-1}) and linear rates follow from these explicit derivations rather than by construction from prior results or data fits. The analysis for the star-convex special case is handled uniformly within the same framework, rendering the derivation self-contained against external benchmarks.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Quasar-convex functions admit a proximity operator whose iterates remain minimizing and satisfy the stated complexity bounds.
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
h(x*) − h(x) ≤ (1/κβ)⟨x* − z, x − x*⟩ − (γ/2)‖x* − x‖² (Proposition 15)
-
IndisputableMonolith/Foundation/BranchSelection.leanbranch_selection unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
O(ε⁻¹) complexity for quasar-convex, O(ln ε⁻¹) for strongly quasar-convex (Theorems 18, 20, Propositions 23, 25)
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Forward citations
Cited by 2 Pith papers
-
Quasar-Convex Optimization: Fundamental Properties and High-Order Proximal-Point Methods
Quasar-convex functions admit high-order proximal algorithms with linear convergence for p=2 and superlinear for p>2 under suitable conditions.
-
Robust Learning Meets Quasar-Convex Optimization: Inexact High-Order Proximal-Point Methods
Robust learning problems are formulated as quasar-convex optimization, and HiPPA is proposed as an inexact high-order proximal method with global and superlinear convergence guarantees.
Reference graph
Works this paper leans on
-
[1]
M. Ahookhosh, A. Iusem, A. Kabgani, F. Lara , Asymptotic conver- gence analysis of high-order proximal-point methods beyond sublinear rates, arXiv: 2505.20484v1 , (2025)
-
[2]
N.T. An, N.M. Nam , Convergence analysis of a proximal point algorithm for minimizing differences of functions, Optimization, 66, 129–147, (2017)
work page 2017
-
[3]
K.J. Arrow, B.H. Chenery, B.S. Minhas, R.M. Solow, Capital-labor substitution and economic efficiency, Review Econ. Stat. , 43(3), 225–250, (1961)
work page 1961
-
[4]
H. Attouch, J. Bolte, B.F. Svaiter , Convergence of descent me- thods for semi-algebraic and tame problems: proximal algorithms, forward– backward splitting, and regularized Gauss–Seidel methods. Math. Program. 137, 91–129, (2013)
work page 2013
-
[5]
Asymptotic Cones and Functions in Opti- mization and Variational Inequalities
A. Auslender, M. Teboulle. “Asymptotic Cones and Functions in Opti- mization and Variational Inequalities”. Springer-Verlag, New York, (2003)
work page 2003
-
[6]
M. Avriel, W.E. Diewert, S. Schaible, I. Zang . “Generalized Con- cavity”. SIAM, Classics in Applied Mathematics, Philadelphia, (2010)
work page 2010
-
[7]
S. Banert, R.I. Bot ¸, A general double-proximal gradient algorithm for d.c. programming, Math. Programm., 178, 301–326, (2019)
work page 2019
-
[8]
Convex Analysis and Monotone Ope- rators Theory in Hilbert Spaces
H.H. Bauschke, P.L. Combettes. “Convex Analysis and Monotone Ope- rators Theory in Hilbert Spaces”. CMS Books in Mathematics . Springer- Verlag, second edition, (2017)
work page 2017
-
[9]
Generalized Convexity and Optimization
A. Cambini, L. Martein . “Generalized Convexity and Optimization”. Springer-Verlag, Berlin-Heidelberg, (2009)
work page 2009
-
[10]
Z. Chen, Y. Lu, H. Wang, Y. Liu, T. Li, Quantum Langevin dynamics for optimization, Commun. Math. Phys. , 406, 52, (2025)
work page 2025
-
[11]
Optimization and Nonsmooth Analysis
F.H. Clarke . “Optimization and Nonsmooth Analysis”. SIAM, New York, (1990). 23
work page 1990
- [12]
-
[13]
P.L. Combettes, T. Pennanen, Proximal methods for cohypomonotone operators, SIAM J. Control Optim. , 43, 731–742, (2004)
work page 2004
- [14]
-
[15]
S.-M. Grad, F. Lara, Solving mixed variational inequalities beyond con- vexity. J. Optim. Theory Appl. , 190, 565–580, (2021)
work page 2021
-
[16]
S.-M. Grad, F. Lara , An extension of the proximal point algorithm beyond convexity, J. Global Optim. , 82, 313–329, (2022)
work page 2022
-
[17]
S.-M. Grad, F. Lara, R.T. Marcavillaca , Relaxed-inertial proximal point type algorithms for quasiconvex minimization, J. Global Optim. , 85, 615–635, (2023)
work page 2023
-
[18]
S.-M. Grad, F. Lara, R.T. Marcavillaca, Strongly quasiconvex func- tions: what we know (so far), J. Optim. Theory Appl., 205, paper 38, (2025)
work page 2025
-
[19]
G¨uler, On the convergence of the proximal point algorithm for convex minimization, SIAM J
O. G¨uler, On the convergence of the proximal point algorithm for convex minimization, SIAM J. Control Optim. , 29(2), 403–419, (1991)
work page 1991
-
[20]
S. Guminov, A. Gasnikov, I. Kuruzov , Accelerated methods for weakly-quasi-convex optimization problems, Comput. Manag. Sci. 20, Arti- cle number: 36, (2023)
work page 2023
-
[21]
Handbook of Generalized Convexity and Generalized Monotonicity
N. Hadjisavvas, S. Komlosi, S. Schaible . “Handbook of Generalized Convexity and Generalized Monotonicity”. Springer-Verlag, Boston, (2005)
work page 2005
-
[22]
N. Hadjisavvas, F. Lara, R.T. Marcavillaca, P.T. Vuong , Heavy ball and Nesterov accelerations with Hessian-driven damping for nonconvex optimization, arXiv: 2506.15632 , (2025)
- [23]
-
[24]
J. Hermant, J.-F. Aujol, C. Dossal, A. Rondepierre , Study of the behaviour of Nesterov accelerated gradient in a non convex setting: the strongly quasar convex case, hal-04589853, (2024)
work page 2024
- [25]
- [26]
-
[27]
Newton-type methods for optimization and variational problems
A.F. Izmailov, M.V. Solodov. “Newton-type methods for optimization and variational problems”. I Cham: Springer. (2014)
work page 2014
-
[28]
A. Kabgani, F. Lara , Strong subdifferentials: theory and applications in nonconvex optimization, J. Global Optim. , 84, 349–368, (2022)
work page 2022
- [29]
-
[30]
Lara, On strongly quasiconvex functions: existence results and proxi- mal point algorithms, J
F. Lara, On strongly quasiconvex functions: existence results and proxi- mal point algorithms, J. Optim. Theory Appl. , 192, 891–911, (2022)
work page 2022
- [31]
- [32]
-
[33]
The structure of American economy, 1919-1939: an em- pirical application of equilibrium analysis
W.W. Leontief. “The structure of American economy, 1919-1939: an em- pirical application of equilibrium analysis”. Cambridge, MA: Harvard Uni- versity Press, (1941)
work page 1919
-
[34]
A. Lowy, J. Ullman, S.J. Wright , How to make the gradients small privately: improved rates for differentially private non-convex optimization, ICML-24, article number 1335, 32904–32923, (2024)
work page 2024
-
[35]
Z. Lu, Iterative reweighted minimization methods forℓp regularized uncon- strained nonlinear programming, Math. Programm., 147, 277–307, (2014)
work page 2014
-
[36]
Martinet, Regularisation d’inequations variationelles par approxima- tions successives, Rev
B. Martinet, Regularisation d’inequations variationelles par approxima- tions successives, Rev. Francaise Inf. Rech. Oper., 154–159, (1970)
work page 1970
-
[37]
Martinet, Determination approch´ ee d’un point fixe d’une application pseudo-contractante, C.R
B. Martinet, Determination approch´ ee d’un point fixe d’une application pseudo-contractante, C.R. Acad. Sci. Paris , 274, 163–165, (1972)
work page 1972
-
[38]
A. Mas-Colell, M.D. Whinston, J.R. Green . “Microeconomic The- ory”. Oxford: Oxford University Press, (1995)
work page 1995
-
[39]
D. McFadden, Constant elasticity of substitution production functions, The Review of Economic Studies, 30, no. 2, pp. 73–83, (1963)
work page 1963
- [40]
-
[41]
I. Necoara, Y. Nesterov, F. Glineur, Linear convergence of first order methods for non-strongly convex optimization, Math. Programm., 175, 69– 107, (2019)
work page 2019
-
[42]
Y. Nesterov, B.T. Polyak, Cubic regularization of newton method and its global performance, Math. Programm., 108, 177–205, (2006). 25
work page 2006
- [43]
-
[44]
T. Pennanen , Local convergence of the proximal point algorithm and multiplier methods without monotonicity, Math. Oper. Res. , 27, 170–191, (2002)
work page 2002
-
[45]
B.T. Polyak , Existence theorems and convergence of minimizing se- quences in extremum problems with restrictions, Soviet Math. , 7, 72–75, (1966)
work page 1966
-
[46]
Rockafellar, Monotone operators and proximal point algorithms, SIAM J
R.T. Rockafellar, Monotone operators and proximal point algorithms, SIAM J. Control Optim. , 14, 877–898, (1976)
work page 1976
-
[47]
W.Y. Sun, R.J.B. Sampaio, M.A.B. Candido , Proximal point algo- rithm for minimization of DC function,J. Comp. Math., 21, 452–462, (2003)
work page 2003
-
[48]
M. Ulbrich . “Semismooth Newton methods for variational inequalities and constrained optimization problems in function spaces”. MOS-SIAM Se- ries on Optimization. (2011)
work page 2011
-
[49]
A.A. Vladimirov, Ju.E. Nesterov, Ju.N. Chekanov , O ravnomerno kvazivypuklyh funkcionalah [On uniformly quasiconvex functionals], Vestn. Mosk. un-ta, vycis. mat. i kibern. , 4, 18–27, (1978)
work page 1978
- [50]
-
[51]
F. Wen, L. Chu, R.C. Qiu , A nonsmooth version of Newton’s method, Math. Programm., 58(1), 353–367, (1993)
work page 1993
-
[52]
F. Wen, L. Chu, R.C. Qiu , A survey on nonconvex regularization-based sparse and low-rank recovery in signal processing, statistics, and machine learning, IEEE Access, 6, 69883–69906, (2018)
work page 2018
-
[53]
Y. Xu, A. Zeevi, Towards optimal problem dependent generalization error bounds in statistical learning theory, Math. Oper. Res., 50, 40–67, (2025). 26
work page 2025
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.