Gradient projection and conditional gradient methods for constrained nonconvex minimization

Andrey Tremba; Boris Polyak; Maxim Balashov

arxiv: 1906.11580 · v1 · pith:EQ4EYDFEnew · submitted 2019-06-27 · 🧮 math.OC

Gradient projection and conditional gradient methods for constrained nonconvex minimization

Maxim Balashov , Boris Polyak , Andrey Tremba This is my paper

Pith reviewed 2026-05-25 14:57 UTC · model grok-4.3

classification 🧮 math.OC

keywords gradient projectionconditional gradientmanifold optimizationnonconvex minimizationlinear convergenceLojasiewicz conditionFrank-Wolfe method

0 comments

The pith

Gradient projection on manifolds achieves global linear convergence when the objective meets the Lezanski-Polyak-Lojasiewicz condition.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper develops versions of the gradient projection algorithm suited to minimizing smooth functions on smooth manifolds such as spheres. It shows that this algorithm converges globally to a stationary point at a linear rate once the objective satisfies the Lezanski-Polyak-Lojasiewicz condition on the manifold. The authors also study the conditional gradient (Frank-Wolfe) algorithm and give conditions under which its full-step version converges globally with a linear rate. These guarantees apply under minimal natural assumptions on the problem data and supply convergence rates for two classical first-order methods on nonconvex constrained problems.

Core claim

Under the Lezanski-Polyak-Lojasiewicz condition on the manifold the gradient projection algorithm converges globally to a stationary point with linear rate; certain additional conditions likewise guarantee global linear convergence for the full-step conditional gradient method.

What carries the argument

The Lezanski-Polyak-Lojasiewicz condition on a manifold, which supplies a lower bound relating the gradient norm to the suboptimality gap.

If this is right

Gradient projection becomes a globally convergent method with an explicit linear rate for manifold-constrained problems obeying the condition.
The conditional gradient method can achieve global linear convergence without explicit projections under the stated conditions.
Problems such as optimization over the sphere gain reliable first-order solvers once the objective meets the manifold LPŁ inequality.
Linear rates allow practical stopping criteria based on observed function-value decrease.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same LPŁ condition could be checked numerically on concrete manifold problems to predict whether linear convergence will be observed in practice.
Rate comparisons between gradient projection and conditional gradient on identical instances would clarify when one method is preferable.
Extensions to other retraction-based methods on manifolds might follow if the LPŁ inequality can be verified after retraction.

Load-bearing premise

The objective function satisfies the Lezanski-Polyak-Lojasiewicz condition on the manifold.

What would settle it

A concrete smooth function on the unit sphere for which the gradient projection iterates fail to approach a stationary point at a linear rate while all other stated assumptions hold.

Figures

Figures reproduced from arXiv: 1906.11580 by Andrey Tremba, Boris Polyak, Maxim Balashov.

**Figure 2.** Figure 2: Stationary point (0, 0) is not a solution. 5. Appendix Proposition 1. Suppose that the set Q ⊂ R n is proximally smooth with constant R > 0, the function f has the Lipschitz continuous gradient f 0 with constant L1 > 0 and the point x0 is a local minimum in the problem minQ f. Then −f 0 (x0) ∈ N(Q, x0). Proof. If f 0 (x0) = 0 then −f 0 (x0) = 0 ∈ N(Q, x0) by the definition of normal cone. Assume that f 0 (… view at source ↗

**Figure 3.** Figure 3: Angle ϕ. Hence f(x) − f(x0) ≤ −kf 0 (x0)kε(p, q) + L1 2 ε 2 ≤ −kf 0 (x0)kεC + L1 2 ε 2 < 0 for sufficiently small ε > 0. A contradiction. Acknowledgements The work was supported by Russian Science Foundation (Project 16-11-10015). References [1] P.-A. Absil, R. Mahony, R. Sepulchre, Optimization Algorithms on Matrix Manifolds, Princeton University Press, Princeton and Oxford, 2008. [2] M. V. Balashov, Uni… view at source ↗

read the original abstract

Minimization of a smooth function on a sphere or, more generally, on a smooth manifold, is the simplest non-convex optimization problem. It has a lot of applications. Our goal is to propose a version of the gradient projection algorithm for its solution and to obtain results that guarantee convergence of the algorithm under some minimal natural assumptions. We use the Lezanski-Polyak-Lojasiewicz condition on a manifold to prove the global linear convergence of the algorithm. Another method well fitted for the problem is the conditional gradient (Frank-Wolfe) algorithm. We examine some conditions which guarantee global convergence of full-step version of the method with linear rate.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

Extends gradient projection and full-step Frank-Wolfe to manifolds with linear rates under the manifold LPL condition, but the incremental novelty over prior Polyak-style results is not obvious from the abstract.

read the letter

The paper shows global linear convergence for gradient projection on a smooth manifold when the objective meets the manifold version of the Lezanski-Polyak-Lojasiewicz condition. It gives a similar analysis for the full-step conditional gradient method under additional conditions that guarantee the linear rate. Both results follow directly from the assumption without extra fitting or hidden steps, which keeps the argument clean. The proofs appear to adapt the classical Euclidean arguments to the manifold geometry in a standard way. The stress-test note confirms no internal contradictions or unjustified limit swaps in the setup. The main limitation is that everything rests on the LPL condition holding on the manifold. That assumption is strong, and the paper does not claim it holds for arbitrary smooth functions, so the linear rate is conditional by design. The abstract also gives no explicit comparison table or citation breakdown against earlier manifold or nonconvex work, which makes it hard to judge how much is genuinely new versus a re-derivation. This is a paper for people already working on nonconvex optimization with manifold constraints, such as sphere or equality-constrained problems. A reader who uses these methods and accepts the LPL condition will get concrete rates they can cite. The thinking is coherent on its own terms, so the manuscript is worth sending to a referee who can check the full derivations and the literature placement. I would not desk-reject it.

Referee Report

0 major / 2 minor

Summary. The manuscript proposes a gradient-projection algorithm for minimizing a smooth function on a smooth manifold (e.g., the sphere) and proves its global linear convergence under the Lezanski-Polyak-Lojasiewicz condition adapted to the manifold. It additionally examines conditions guaranteeing global linear convergence of the full-step conditional-gradient (Frank-Wolfe) method.

Significance. If the stated proofs hold, the work supplies explicit linear-rate guarantees for two standard first-order methods on manifolds under a single, clearly identified geometric assumption (the manifold LPL inequality). This conditional structure is a strength: the paper does not claim the rates for arbitrary smooth objectives. The result is a modest but useful extension of classical LPL-based analysis to the constrained manifold setting.

minor comments (2)

[Abstract] Abstract: the spelling 'Lezanski-Polyak-Lojasiewicz' should be aligned with the conventional 'Łojasiewicz-Polyak-Łojasiewicz' (or LPL) throughout the manuscript for consistency with the literature.
[Introduction / Main results] The manuscript asserts that 'proofs exist' for the linear rates; the full derivations, error-bound constants, and handling of the tangent-space projection should be cross-checked against the manifold geometry in the main text to ensure the constants are explicit.

Simulated Author's Rebuttal

0 responses · 0 unresolved

We thank the referee for the careful reading and positive assessment of the manuscript, including the recognition of its conditional linear-rate guarantees under the manifold LPL inequality. The recommendation for minor revision is appreciated.

Circularity Check

0 steps flagged

No significant circularity

full rationale

The paper establishes convergence of gradient-projection and conditional-gradient methods on manifolds under the explicit hypothesis that the objective satisfies the Lezanski-Polyak-Lojasiewicz inequality on the manifold. Linear rates are derived directly from this stated assumption; the proofs are conditional by design and do not reduce any claimed rate to a fitted parameter, a renamed input, or a self-citation chain. No enumerated circularity pattern appears in the derivation architecture.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 0 invented entities

The abstract invokes standard domain assumptions from smooth optimization; no free parameters, invented entities, or ad-hoc axioms are mentioned.

axioms (2)

domain assumption The objective function is smooth on the manifold.
Required for gradients to exist and for the projection step to be well-defined.
domain assumption The constraint set is a smooth manifold.
Stated explicitly as the setting for both algorithms.

pith-pipeline@v0.9.0 · 5633 in / 1191 out tokens · 37522 ms · 2026-05-25T14:57:35.851353+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

We use the Lezanski-Polyak-Lojasiewicz condition on a manifold to prove the global linear convergence of the algorithm.
IndisputableMonolith/Foundation/RealityFromDistinction.lean reality_from_one_distinction unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

Theorem 1 … for any function f … with the Lipschitz continuous gradient and for any proximally smooth set Q … converges to the set of stationary points

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

37 extracted references · 37 canonical work pages · 1 internal anchor

[1]

Absil, R

P.-A. Absil, R. Mahony, R. Sepulchre, Optimization Algorithms on Matrix Manifolds, Prince- ton University Press, Princeton and Oxford, 2008

work page 2008
[2]

M. V. Balashov, Uniformly convex subsets of the Hilbert space with modulus of convexity of the second order, Journal of Math. Anal. Appl., 377:2 (2011), 754–761

work page 2011
[3]

M. V. Balashov, Maximization of a function with Lipschitz continuous gradient, Journal of Mathematical Sciences, 209:1, (2015), 12–18

work page 2015
[4]

M. V. Balashov, About the gradient projection algorithm for a strongly convex function and a proximally smooth set, Journal of Convex Analysis, 24:2, (2017), 493–500

work page 2017
[5]

M. V. Balashov, E. S. Polovinkin, M-strongly convex subsets and their generating sets, Sbornik: Mathematics, 191:1 (2000), 25–60

work page 2000
[6]

M. V. Balashov, G. E. Ivanov, Weakly convex and proximally smooth sets in Banach spaces, Izv. RAN. Ser. Mat., 73:3 (2009), 23–66

work page 2009
[7]

Bertsekas, Mathematical Programming, Athena-Publishing, 2013

D. Bertsekas, Mathematical Programming, Athena-Publishing, 2013

work page 2013
[8]

S. Boyd, L. Vanderberghe, Convex Optimization, Cambridge University Press, 2004

work page 2004
[9]

F. H. Clarke, Yu. S. Ledyaev, R. J. Stern, P. R. Wolenski, Nonsmooth Analysis and Control Theory, Springer-Verlag New-York Inc., 1998

work page 1998
[10]

F. H. Clarke, R. J. Stern, P. R. Wolenski, Proximal smoothness and lower– C2 property, J. Convex Anal., 2:1-2 (1995), 117–144

work page 1995
[11]

A. R. Conn, N. I. M. Gould, Ph. L. Toint, Trust-Region Methods, SIAM, Philadelphia, 2000

work page 2000
[12]

Fraikin, Yu

C. Fraikin, Yu. Nesterov, P. Van Dooren, A gradient-type algorithm optimizing the coupling between matrices, Linear Algebra and its Applications, 429 (2008), 1229–1242

work page 2008
[13]

O. P. Ferreira, A. N. Iusem, and S. Z. N´ emeth, Concepts and techniques of optimization on the sphere, TOP, 22:3 (2014), 1148–1170

work page 2014
[14]

R. M. Freund, P. Grigas, New analysis and results for the Frank-Wolfe method, Mathem. Progr., 155:1-2 (2016), 199–230. GRADIENT METHODS 23

work page 2016
[15]

Frank, Ph

M. Frank, Ph. Wolfe, Algorithm for quadratic programming, Nav. Res. Log. Quart., 3:1-2 (1956), 95–110

work page 1956
[16]

W. W. Hager, Minimizing a quadratic over a sphere, SIAM J. Optim. and Contr., 12:1 (2001), 188–208

work page 2001
[17]

B. Gao, X. Liu, X. Chen, Ya. Yuan, On the Lojasiewicz exponent of the quadratic sphere constrained optimization problem, (2016), arXiv:1611.08781v2

work page arXiv 2016
[18]

A. A. Goldstein, Convex programming in Hilbert space, Bull. Amer. Math. Soc., 70:5 (1964), 709-710

work page 1964
[19]

Karimi, J

H. Karimi, J. Nutini, M. Schmidt, Linear convergence of gradient and proximal-gradient methods under the Polyak-Lojasiewicz condition. In: Frasconi P., Landwehr N., Manco G., Vreeken J. (eds) Machine Learning and Knowledge Discovery in Databases. Lecture Notes in Computer Science, vol 9851. Springer, 2016

work page 2016
[20]

L. V. Kantorovich, Functional analysis and applied mathematics, Russ. Math. Surveys, 3:6(28) (1948), 89–185

work page 1948
[21]

A. N. Kolmogorov, S. V. Fomin, Elements of the Theory of Functions and Functional Analysis, Dover Publications, 1999

work page 1999
[22]

E. S. Levitin, B. T. Polyak, Constrained minimization methods, USSR Comp. Math. and Math. Phys., 6:5 (1966), 787–823

work page 1966
[23]

Leˇ zanski, ¨Uber das Minimumproblem von Funktionalen in Banachschen R¨ aumen, Bull

T. Leˇ zanski, ¨Uber das Minimumproblem von Funktionalen in Banachschen R¨ aumen, Bull. Acad. Pol. Sci., S´ er. Sci. Math. Astron. Phys., 10 (1962), 107–110

work page 1962
[24]

Leˇ zanski,¨Uber das Minimumproblem f¨ ur Funktionale in Banachschen R¨ aumen, Mathe- matische Annalen, 152 (1963), 271–274

T. Leˇ zanski,¨Uber das Minimumproblem f¨ ur Funktionale in Banachschen R¨ aumen, Mathe- matische Annalen, 152 (1963), 271–274

work page 1963
[25]

D. G. Luenberger, The gradient projection methods along geodesics, Management Science, 18:11 (1972), 620–631

work page 1972
[26]

Convergence Rate of Frank-Wolfe for Non-Convex Objectives

S. Lacoste-Julien, Convergence rate of Frank-Wolfe for non-convex objectives, (2016), arXiv:1607.00345

work page internal anchor Pith review Pith/arXiv arXiv 2016
[27]

Lojasiewicz, Une propriet e topologique des sous-ensembles analytiques reels, Les Equations aux Derivees Partielles, CNRS, Paris, (1963), 87–89

S. Lojasiewicz, Une propriet e topologique des sous-ensembles analytiques reels, Les Equations aux Derivees Partielles, CNRS, Paris, (1963), 87–89

work page 1963
[28]

da Cruz Neto, L

J.X. da Cruz Neto, L. L. De Lima, P. R. Oliveira, Geodesic algorithms on Riemannian manifolds, Balkan J. of Geom. and its Appl., 3:2 (1998), 89–100

work page 1998
[29]

Nesterov, Lectures on Convex Optimization, Springer, 2018

Yu. Nesterov, Lectures on Convex Optimization, Springer, 2018

work page 2018
[30]

Nesterov, A

Yu. Nesterov, A. Nemirovski Interior-point Polynomial Algorithms in Convex Programming, SIAM, 1994

work page 1994
[31]

B. T. Polyak, Introduction to Optimization. New York, Optimization Software, 1987

work page 1987
[32]

B. T. Polyak, Local programming, USSR Computational Mathematics and Mathematical Phys. 41:9 (2001), 1259–1266

work page 2001
[33]

B. T. Polyak, Gradient Methods for the minimization of functionals, USSR Comp. Math. and Math. Phys., 3:4 (1963), 643–653

work page 1963
[34]

B. T. Polyak, Existence theorems and convergence of minimizing sequences for extremal problems with constraints, Soviet Math. Dokl., 1966, 7, 72–75

work page 1966
[35]

Udri¸ ste, Convex Functions and Optimization Methods on Riemannian Manifolds, Mathe- matics and Its Applications series, vol

C. Udri¸ ste, Convex Functions and Optimization Methods on Riemannian Manifolds, Mathe- matics and Its Applications series, vol. 297, Springer, 1994

work page 1994
[36]

Vial, Strong and weak convexity of sets and functions, Mathematics of Operations Research, 8:2 (1983), 231–259

J.-Ph. Vial, Strong and weak convexity of sets and functions, Mathematics of Operations Research, 8:2 (1983), 231–259

work page 1983
[37]

Weber, S

M. Weber, S. Sra, Riemannian Frank-Wolfe with application to the geometric mean of positive deﬁnite matrices, (2018), arXiv:1710.10770v2 V. A. Trapeznikov Institute of Control Sciences of Russian Academy of Sciences, 65 Profsoyuznaya street, Moscow 117997, Russia. E-mail address: balashov73@mail.ru, boris@ipu.ru, atremba@ipu.ru

work page arXiv 2018

[1] [1]

Absil, R

P.-A. Absil, R. Mahony, R. Sepulchre, Optimization Algorithms on Matrix Manifolds, Prince- ton University Press, Princeton and Oxford, 2008

work page 2008

[2] [2]

M. V. Balashov, Uniformly convex subsets of the Hilbert space with modulus of convexity of the second order, Journal of Math. Anal. Appl., 377:2 (2011), 754–761

work page 2011

[3] [3]

M. V. Balashov, Maximization of a function with Lipschitz continuous gradient, Journal of Mathematical Sciences, 209:1, (2015), 12–18

work page 2015

[4] [4]

M. V. Balashov, About the gradient projection algorithm for a strongly convex function and a proximally smooth set, Journal of Convex Analysis, 24:2, (2017), 493–500

work page 2017

[5] [5]

M. V. Balashov, E. S. Polovinkin, M-strongly convex subsets and their generating sets, Sbornik: Mathematics, 191:1 (2000), 25–60

work page 2000

[6] [6]

M. V. Balashov, G. E. Ivanov, Weakly convex and proximally smooth sets in Banach spaces, Izv. RAN. Ser. Mat., 73:3 (2009), 23–66

work page 2009

[7] [7]

Bertsekas, Mathematical Programming, Athena-Publishing, 2013

D. Bertsekas, Mathematical Programming, Athena-Publishing, 2013

work page 2013

[8] [8]

S. Boyd, L. Vanderberghe, Convex Optimization, Cambridge University Press, 2004

work page 2004

[9] [9]

F. H. Clarke, Yu. S. Ledyaev, R. J. Stern, P. R. Wolenski, Nonsmooth Analysis and Control Theory, Springer-Verlag New-York Inc., 1998

work page 1998

[10] [10]

F. H. Clarke, R. J. Stern, P. R. Wolenski, Proximal smoothness and lower– C2 property, J. Convex Anal., 2:1-2 (1995), 117–144

work page 1995

[11] [11]

A. R. Conn, N. I. M. Gould, Ph. L. Toint, Trust-Region Methods, SIAM, Philadelphia, 2000

work page 2000

[12] [12]

Fraikin, Yu

C. Fraikin, Yu. Nesterov, P. Van Dooren, A gradient-type algorithm optimizing the coupling between matrices, Linear Algebra and its Applications, 429 (2008), 1229–1242

work page 2008

[13] [13]

O. P. Ferreira, A. N. Iusem, and S. Z. N´ emeth, Concepts and techniques of optimization on the sphere, TOP, 22:3 (2014), 1148–1170

work page 2014

[14] [14]

R. M. Freund, P. Grigas, New analysis and results for the Frank-Wolfe method, Mathem. Progr., 155:1-2 (2016), 199–230. GRADIENT METHODS 23

work page 2016

[15] [15]

Frank, Ph

M. Frank, Ph. Wolfe, Algorithm for quadratic programming, Nav. Res. Log. Quart., 3:1-2 (1956), 95–110

work page 1956

[16] [16]

W. W. Hager, Minimizing a quadratic over a sphere, SIAM J. Optim. and Contr., 12:1 (2001), 188–208

work page 2001

[17] [17]

B. Gao, X. Liu, X. Chen, Ya. Yuan, On the Lojasiewicz exponent of the quadratic sphere constrained optimization problem, (2016), arXiv:1611.08781v2

work page arXiv 2016

[18] [18]

A. A. Goldstein, Convex programming in Hilbert space, Bull. Amer. Math. Soc., 70:5 (1964), 709-710

work page 1964

[19] [19]

Karimi, J

H. Karimi, J. Nutini, M. Schmidt, Linear convergence of gradient and proximal-gradient methods under the Polyak-Lojasiewicz condition. In: Frasconi P., Landwehr N., Manco G., Vreeken J. (eds) Machine Learning and Knowledge Discovery in Databases. Lecture Notes in Computer Science, vol 9851. Springer, 2016

work page 2016

[20] [20]

L. V. Kantorovich, Functional analysis and applied mathematics, Russ. Math. Surveys, 3:6(28) (1948), 89–185

work page 1948

[21] [21]

A. N. Kolmogorov, S. V. Fomin, Elements of the Theory of Functions and Functional Analysis, Dover Publications, 1999

work page 1999

[22] [22]

E. S. Levitin, B. T. Polyak, Constrained minimization methods, USSR Comp. Math. and Math. Phys., 6:5 (1966), 787–823

work page 1966

[23] [23]

Leˇ zanski, ¨Uber das Minimumproblem von Funktionalen in Banachschen R¨ aumen, Bull

T. Leˇ zanski, ¨Uber das Minimumproblem von Funktionalen in Banachschen R¨ aumen, Bull. Acad. Pol. Sci., S´ er. Sci. Math. Astron. Phys., 10 (1962), 107–110

work page 1962

[24] [24]

Leˇ zanski,¨Uber das Minimumproblem f¨ ur Funktionale in Banachschen R¨ aumen, Mathe- matische Annalen, 152 (1963), 271–274

T. Leˇ zanski,¨Uber das Minimumproblem f¨ ur Funktionale in Banachschen R¨ aumen, Mathe- matische Annalen, 152 (1963), 271–274

work page 1963

[25] [25]

D. G. Luenberger, The gradient projection methods along geodesics, Management Science, 18:11 (1972), 620–631

work page 1972

[26] [26]

Convergence Rate of Frank-Wolfe for Non-Convex Objectives

S. Lacoste-Julien, Convergence rate of Frank-Wolfe for non-convex objectives, (2016), arXiv:1607.00345

work page internal anchor Pith review Pith/arXiv arXiv 2016

[27] [27]

Lojasiewicz, Une propriet e topologique des sous-ensembles analytiques reels, Les Equations aux Derivees Partielles, CNRS, Paris, (1963), 87–89

S. Lojasiewicz, Une propriet e topologique des sous-ensembles analytiques reels, Les Equations aux Derivees Partielles, CNRS, Paris, (1963), 87–89

work page 1963

[28] [28]

da Cruz Neto, L

J.X. da Cruz Neto, L. L. De Lima, P. R. Oliveira, Geodesic algorithms on Riemannian manifolds, Balkan J. of Geom. and its Appl., 3:2 (1998), 89–100

work page 1998

[29] [29]

Nesterov, Lectures on Convex Optimization, Springer, 2018

Yu. Nesterov, Lectures on Convex Optimization, Springer, 2018

work page 2018

[30] [30]

Nesterov, A

Yu. Nesterov, A. Nemirovski Interior-point Polynomial Algorithms in Convex Programming, SIAM, 1994

work page 1994

[31] [31]

B. T. Polyak, Introduction to Optimization. New York, Optimization Software, 1987

work page 1987

[32] [32]

B. T. Polyak, Local programming, USSR Computational Mathematics and Mathematical Phys. 41:9 (2001), 1259–1266

work page 2001

[33] [33]

B. T. Polyak, Gradient Methods for the minimization of functionals, USSR Comp. Math. and Math. Phys., 3:4 (1963), 643–653

work page 1963

[34] [34]

B. T. Polyak, Existence theorems and convergence of minimizing sequences for extremal problems with constraints, Soviet Math. Dokl., 1966, 7, 72–75

work page 1966

[35] [35]

Udri¸ ste, Convex Functions and Optimization Methods on Riemannian Manifolds, Mathe- matics and Its Applications series, vol

C. Udri¸ ste, Convex Functions and Optimization Methods on Riemannian Manifolds, Mathe- matics and Its Applications series, vol. 297, Springer, 1994

work page 1994

[36] [36]

Vial, Strong and weak convexity of sets and functions, Mathematics of Operations Research, 8:2 (1983), 231–259

J.-Ph. Vial, Strong and weak convexity of sets and functions, Mathematics of Operations Research, 8:2 (1983), 231–259

work page 1983

[37] [37]

Weber, S

M. Weber, S. Sra, Riemannian Frank-Wolfe with application to the geometric mean of positive deﬁnite matrices, (2018), arXiv:1710.10770v2 V. A. Trapeznikov Institute of Control Sciences of Russian Academy of Sciences, 65 Profsoyuznaya street, Moscow 117997, Russia. E-mail address: balashov73@mail.ru, boris@ipu.ru, atremba@ipu.ru

work page arXiv 2018