Weak convergence of the stochastic proximal point method in metric spaces

Nicholas Pischke

arxiv: 2605.20805 · v1 · pith:ZTSH7UPWnew · submitted 2026-05-20 · 🧮 math.OC

Weak convergence of the stochastic proximal point method in metric spaces

Nicholas Pischke This is my paper

Pith reviewed 2026-05-21 04:02 UTC · model grok-4.3

classification 🧮 math.OC

keywords weak convergencestochastic proximal pointHadamard spacesconvex optimizationmetric spacesquasi-Fejer monotonicityintegral functionalsnonpositive curvature

0 comments

The pith

Stochastic proximal point method converges weakly almost surely in Hadamard spaces

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper establishes that a stochastic proximal point method for minimizing a convex integral function produces iterates that converge weakly almost surely in complete geodesic metric spaces of nonpositive curvature. This holds under a mild growth condition on the objective that generalizes Lipschitz continuity and is used both to define the iterates and to prove convergence of their average values. The argument combines an existing result on stochastic processes satisfying a stochastic form of quasi-Fejér monotonicity with a new step showing almost sure convergence of mean function values to the infimum. A reader would care because the result applies in general nonlinear geometries without local compactness or stronger regularity assumptions that limited earlier work.

Core claim

The paper proves the almost sure weak convergence of a stochastic proximal point method for minimizing a convex integral function in complete geodesic metric spaces of nonpositive curvature. The method is formulated using a mild growth condition on the function that generalizes Lipschitz continuity. The proof relies on a weak almost sure convergence theorem for stochastic processes in these spaces that satisfy a stochastic variant of quasi-Fejér monotonicity together with a new argument establishing almost sure convergence of the mean function values to the minimal value.

What carries the argument

The stochastic proximal point iteration in Hadamard spaces under a mild growth condition on the objective, carried by stochastic quasi-Fejér monotonicity of the process.

If this is right

The iterates converge weakly almost surely to a minimizer of the convex integral function.
The mean values of the objective function converge almost surely to the minimal value.
The result applies directly in general Hadamard spaces without requiring local compactness.
The technique extends prior convergence theorems for stochastic processes in these spaces.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same mild growth condition and quasi-Fejér arguments may apply to other stochastic first-order methods in non-Euclidean metric spaces.
The framework could support optimization tasks on data structures naturally modeled by hyperbolic or tree-like geometries.
Additional assumptions on the function might yield explicit convergence rates under the same geometric setting.

Load-bearing premise

The objective function satisfies a mild growth condition that generalizes Lipschitz continuity to allow formulation of the method and to secure convergence of mean function values.

What would settle it

A concrete convex integral function on a non-locally compact Hadamard space that meets the mild growth condition but for which the generated sequence fails to converge weakly almost surely.

read the original abstract

We prove the almost sure weak convergence of a stochastic proximal point method for minimizing a convex integral function in the general nonlinear context of complete geodesic metric spaces of nonpositive curvature (so-called Hadamard spaces), solving a problem of M. Ba\v{c}\'ak. This method, formulated in the context of a mild growth condition on the function which generalizes Lipschitz continuity, was previously only considered in the context of strong metric regularity conditions or in the context of locally compact spaces. The proof is a combination of a weak almost sure convergence theorem for stochastic processes in Hadamard spaces which confine to a stochastic variant of quasi-Fej\'er monotonicity, due to previous work of the author, together with a new argument for proving the almost sure convergence of the mean function values of the process towards the minimal value.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

This paper relaxes the assumptions for almost sure weak convergence of the stochastic proximal point method to a mild growth condition in general Hadamard spaces and claims to solve Bačák's open problem.

read the letter

This paper establishes almost sure weak convergence for the stochastic proximal point method when minimizing a convex integral function in complete geodesic metric spaces of nonpositive curvature. It does so under a mild growth condition rather than the stronger assumptions used before. The new element is an argument showing that the mean function values converge almost surely to the infimum. This combines with a prior result on stochastic quasi-Fejer monotonicity from the same author to get the overall convergence. The approach solves the specific open problem posed by Bačák. What works well is the relaxation of assumptions to something that generalizes Lipschitz continuity. This broadens the applicability to more general Hadamard spaces, including those that are not locally compact. A soft spot is the dependence on the earlier quasi-Fejer theorem. That makes the current paper less self-contained, and any gaps in the prior work would carry over. The stress-test note about whether the growth condition suffices for the mean-value step without tacit compactness is worth checking. If the proof relies on something not implied by the stated condition, the claimed relaxation would not fully hold. The math appears formally grounded based on the structure described, though without the full details it's hard to spot every step. This paper is aimed at researchers in optimization over metric spaces, especially those interested in stochastic methods and convergence in nonlinear settings. A reader familiar with proximal point algorithms and Hadamard space geometry would get the most out of it. It deserves a serious referee because it targets an open problem and introduces a new technical argument for the mean convergence. I recommend sending this to peer review. Referees should focus on validating the new mean-value argument against the growth condition in non-locally compact cases.

Referee Report

2 major / 2 minor

Summary. The paper proves almost sure weak convergence of the stochastic proximal point method for minimizing a convex integral functional over complete geodesic metric spaces of nonpositive curvature (Hadamard spaces). The argument combines a prior stochastic quasi-Fejér monotonicity theorem with a new mean-value step showing that E[f(X_n)] converges almost surely to the infimum of the objective, under a mild growth condition on the integrand that generalizes Lipschitz continuity. This is presented as resolving an open question of Bačák by removing the need for local compactness or strong metric regularity.

Significance. If the new mean-value argument is valid without tacit compactness assumptions, the result would meaningfully extend stochastic proximal methods to general Hadamard spaces. The clean modular structure—invoking the author's earlier quasi-Fejér theorem and adding a targeted convergence-of-means step—offers a reusable template for other stochastic algorithms in nonlinear metric settings. The mild growth condition is a natural and verifiable relaxation of prior hypotheses.

major comments (2)

[Proof of the mean-value convergence (new argument combining quasi-Fejér with expectation control)] The new argument for almost sure convergence of the mean values E[f(X_n)] to inf f (described in the abstract and proof outline) invokes the mild growth condition both to define the stochastic proximal mapping and to control the mean-value step. In non-locally compact Hadamard spaces this step risks failure if recession directions of the integrand are not uniformly controlled; the manuscript should explicitly verify that no hidden local-compactness or uniform-integrability property is used, or supply a counter-example space where the growth condition alone is insufficient.
[Invocation of the prior stochastic quasi-Fejér theorem] The central weak-convergence claim depends on the stochastic quasi-Fejér theorem from the author's prior work. The manuscript should include a self-contained verification that all hypotheses of that theorem (e.g., the specific form of the stochastic perturbation and the growth condition) are satisfied by the proximal-point iteration defined here.

minor comments (2)

[Abstract and introduction] A direct citation to the specific open problem posed by Bačák (including the reference) would help readers locate the exact statement being solved.
[Preliminaries] Notation for the stochastic proximal mapping and the mild growth condition should be introduced with a displayed definition before its first use in the main theorems.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the careful reading and constructive comments on our manuscript. We address each major comment below and indicate planned revisions to strengthen the presentation.

read point-by-point responses

Referee: [Proof of the mean-value convergence (new argument combining quasi-Fejér with expectation control)] The new argument for almost sure convergence of the mean values E[f(X_n)] to inf f (described in the abstract and proof outline) invokes the mild growth condition both to define the stochastic proximal mapping and to control the mean-value step. In non-locally compact Hadamard spaces this step risks failure if recession directions of the integrand are not uniformly controlled; the manuscript should explicitly verify that no hidden local-compactness or uniform-integrability property is used, or supply a counter-example space where the growth condition alone is insufficient.

Authors: We appreciate the referee's concern about potential tacit assumptions in non-locally compact settings. The mild growth condition is formulated precisely to control the expectations and recession behavior of the integrand via convexity, ensuring the mean-value step proceeds without local compactness or additional uniform-integrability hypotheses. In the revised manuscript we will insert a short clarifying paragraph (or lemma) immediately after the statement of the growth condition that explicitly confirms the argument uses only the given hypotheses and the geodesic properties of Hadamard spaces. We therefore see no need for a counter-example. revision: yes
Referee: [Invocation of the prior stochastic quasi-Fejér theorem] The central weak-convergence claim depends on the stochastic quasi-Fejér theorem from the author's prior work. The manuscript should include a self-contained verification that all hypotheses of that theorem (e.g., the specific form of the stochastic perturbation and the growth condition) are satisfied by the proximal-point iteration defined here.

Authors: We agree that a self-contained verification improves readability and rigor. In the revised version we will add a dedicated subsection (or short appendix) that systematically checks every hypothesis of the stochastic quasi-Fejér monotonicity theorem against the stochastic proximal-point iteration, including the precise form of the perturbation and the compatibility of the growth condition. revision: yes

Circularity Check

1 steps flagged

Central weak convergence theorem rests on author's prior stochastic quasi-Fejér result

specific steps

self citation load bearing [Abstract]
"The proof is a combination of a weak almost sure convergence theorem for stochastic processes in Hadamard spaces which confine to a stochastic variant of quasi-Fejér monotonicity, due to previous work of the author, together with a new argument for proving the almost sure convergence of the mean function values of the process towards the minimal value."

The central almost sure weak convergence result is obtained by invoking the author's earlier theorem on stochastic quasi-Fejér monotonicity; that prior result is not re-proved or externally benchmarked here, so the new paper's main theorem reduces in part to self-citation rather than standing fully on independent external verification.

full rationale

The paper's proof explicitly combines a new argument for almost sure convergence of mean function values with a weak almost sure convergence theorem for stochastic quasi-Fejér processes taken from the author's previous work. This self-citation is load-bearing for the main claim but is not the sole content; the new mean-value step provides independent material. No other circular patterns (self-definitional fits, ansatz smuggling, or renaming) appear in the provided derivation outline. The result therefore receives a moderate circularity score rather than a high one.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 0 invented entities

The proof rests on standard properties of Hadamard spaces and a domain-specific growth condition; no free parameters or new entities are introduced.

axioms (2)

standard math Hadamard spaces are complete geodesic metric spaces with nonpositive curvature
Defines the ambient space in which the proximal point iteration and weak convergence are studied.
domain assumption The convex integral function satisfies a mild growth condition generalizing Lipschitz continuity
Required to formulate the stochastic proximal mapping and to prove convergence of the mean function values.

pith-pipeline@v0.9.0 · 5658 in / 1289 out tokens · 35894 ms · 2026-05-21T04:02:46.279232+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

Lemma 3.1 (stochastic quasi-Fejér monotonicity) and Lemma 3.4 (stochastic recursive inequality for mean values) together with growth condition (A2)
IndisputableMonolith/Foundation/AlexanderDuality.lean alexander_duality_circle_linking unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

weak convergence in separable Hadamard spaces under (A1)+(A2)

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

41 extracted references · 41 canonical work pages · 2 internal anchors

[1]

Alber, A.N

Ya.I. Alber, A.N. Iusem, and M.V. Solodov. On the projected subgradient method for nonsmooth convex optimization in a Hilbert space.Mathematical Programming, 81:23–35, 1998

work page 1998
[2]

Aleksandrov

A.D. Aleksandrov. A theorem on triangles in a metric space and some of its applications.Trudy Matem- aticheskogo Instituta imeni V.A. Steklova, 38:5–23, 1951

work page 1951
[3]

Alexander, V

S. Alexander, V. Kapovitch, and A. Petrunin.Alexandrov Geometry: Foundations, volume 236 ofGraduate Studies in Mathematics. American Mathematical Society, Providence, RI, 2024

work page 2024
[4]

Asi and J.C

H. Asi and J.C. Duchi. Stochastic (approximate) proximal point methods: Convergence, optimality, and adaptivity.SIAM Journal on Optimization, 29(3):2257–2290, 2019

work page 2019
[5]

Aubin and H

J.-P. Aubin and H. Frankowska.Set-Valued Analysis. Springer, New York, 2009

work page 2009
[6]

Baˇ c´ ak

M. Baˇ c´ ak. The proximal point algorithm in metric spaces.Israel Journal of Mathematics, 194(2):689–701, 2013

work page 2013
[7]

Baˇ c´ ak

M. Baˇ c´ ak. Computing medians and means in Hadamard spaces.SIAM Journal of Optimization, 24(3):1542– 1566, 2014

work page 2014
[8]

Baˇ c´ ak.Convex analysis and optimization in Hadamard spaces, volume 22 ofDe Gruyter Series in Nonlinear Analysis and Applications

M. Baˇ c´ ak.Convex analysis and optimization in Hadamard spaces, volume 22 ofDe Gruyter Series in Nonlinear Analysis and Applications. Walter de Gruyter GmbH, Berlin/Boston, 2014

work page 2014
[9]

Baˇ c´ ak

M. Baˇ c´ ak. A variational approach to stochastic minimization of convex functionals.Pure and Applied Functional Analysis, 3(2):287–295, 2018

work page 2018
[10]

Baˇ c´ ak

M. Baˇ c´ ak. Old and new challenges in Hadamard spaces.Japanese Journal of Mathematics, 18(2):115–168, 2023

work page 2023
[11]

Baˇ c´ ak, I

M. Baˇ c´ ak, I. Searston, and B. Sims. Alternating projections in CAT(0) spaces.Journal of Mathematical Analysis and Applications, 385:599–607, 2012

work page 2012
[12]

Bertsekas

D.P. Bertsekas. Incremental proximal methods for large scale convex optimization.Mathematical Program- ming. Series B, 129:163–195, 2011

work page 2011
[13]

Bertsekas

D.P. Bertsekas. Incremental gradient, subgradient, and proximal methods for convex optimization: A sur- vey. In S. Sra, S. Nowozin, and S.J. Wright, editors,Optimization for Machine Learning, Neural Information Processing Series, pages 85–120. The MIT Press, Cambridge, Massachusetts, 2012

work page 2012
[14]

Billera, S.P

L.J. Billera, S.P. Holmes, and K. Vogtmann. Geometry of the space of phylogenetic trees.Advances in Applied Mathematics, 27(4):733–767, 2001

work page 2001
[15]

Br´ ezis and P.L

H. Br´ ezis and P.L. Lions. Produits infinis de resolvantes.Israel Journal of Mathematics, 29(4):329–345, 1978

work page 1978
[16]

Bridson and A

M.R. Bridson and A. Haefliger.Metric Spaces of Non-Positive Curvature, volume 319 ofGrundlehren der mathematischen Wissenschaften. Springer Berlin, Heidelberg, 1999

work page 1999
[17]

Bruhat and J

F. Bruhat and J. Tits. Groupes r´ eductifs sur un corps local. I. Donn´ ees radicielles valu´ ees.Publications Math´ ematiques de l’Institut des Hautes ´Etudes Scientifiques, 41:5–251, 1972

work page 1972
[18]

Castaing and M

C. Castaing and M. Valadier.Convex Analysis and Measurable Multifunctions, volume 580 ofLecture Notes in Mathematics. Springer Berlin, Heidelberg, 1977

work page 1977
[19]

Combettes and J.C

P.L. Combettes and J.C. Pesquet. Stochastic quasi-Fej´ er block-coordinate fixed point iterations with ran- dom sweeping.SIAM Journal on Optimization, 25(2):1221–1248, 2015

work page 2015
[20]

Dhompongsa, W.A

S. Dhompongsa, W.A. Kirk, and B. Sims. Fixed points of uniformly Lipschitzian mappings.Nonlinear Analysis. Theory, Methods & Applications, 65:762–772, 2006

work page 2006
[21]

Goodwin, A.S

A. Goodwin, A.S. Lewis, G. L´ opez-Acedo, and A. Nicolae. Stochastic and incremental subgradient methods for convex optimization on Hadamard spaces.Mathematical Programming, 2026. To appear

work page 2026
[22]

M. Gromov. Hyperbolic groups. In S.M. Gersten, editor,Essays in group theory, volume 8 ofMathematical Sciences Research Institute Publications, pages 75–263. Springer, New York, 1987

work page 1987
[23]

O. G¨ uler. On the convergence of the proximal point algorithm for convex minimization.SIAM Journal on Control and Optimization, 29:403—-419, 1991

work page 1991
[24]

J. Jost. Equilibrium maps between metric spaces.Calculus of Variations and Partial Differential Equations, 2:173–204, 1994

work page 1994
[25]

J. Jost. Convex functionals and generalized harmonic maps into spaces of nonpositive curvature.Commen- tarii Mathematici Helvetici, 70:659–673, 1995

work page 1995
[26]

Kirk and B

W.A. Kirk and B. Panyanak. A concept of convergence in geodesic spaces.Nonlinear Analysis. Theory, Methods & Applications, 68:3689–3696, 2008

work page 2008
[27]

Klenke.Probability Theory: A Comprehensive Course

A. Klenke.Probability Theory: A Comprehensive Course. Universitext. Springer Cham, 3rd edition, 2020. 12 N. PISCHKE

work page 2020
[28]

Martinet

B. Martinet. R´ egularisation din´ equations variationnelles par approximations successives.Revue fran¸ caise d’informatique et de recherche op´ erationnelle, 4:154–159, 1970

work page 1970
[29]

U. Mayer. Gradient flows on nonpositively curved metric spaces and harmonic maps.Communications in Analysis and Geometry, 6:199–253, 1998

work page 1998
[30]

Nemirovski, A

A. Nemirovski, A. Juditsky, G. Lan, and A. Shapiro. Robust stochastic approximation approach to sto- chastic programming.SIAM Journal of Optimization, 19:1574–1609, 2009

work page 2009
[31]

M. Neri, N. Pischke, and T. Powell. An abstract effective convergence theorem for stochastic processes, with applications to stochastic approximation, 2026. Preprint,https://arxiv.org/abs/2504.12922

work page internal anchor Pith review Pith/arXiv arXiv 2026
[32]

Neri and T

M. Neri and T. Powell. On quantitative convergence for stochastic processes: Crossings, fluctuations and martingales.Transactions of the American Mathematical Society, Series B, 12:974–1019, 2025

work page 2025
[33]

Neri and T

M. Neri and T. Powell. A quantitative Robbins-Siegmund theorem.The Annals of Applied Probability, 36(1):636–651, 2026

work page 2026
[34]

B.J. Pettis. On integration in vector spaces.Transactions of the American Mathematical Society, 44:277– 304, 1938

work page 1938
[35]

N. Pischke. On Busemann subgradient methods for stochastic minimization in Hadamard spaces, 2026. Preprint,https://arxiv.org/abs/2602.08127

work page arXiv 2026
[36]

Convergence guarantees for stochastic algorithms solving non-unique problems in metric spaces

N. Pischke and T. Powell. Convergence guarantees for stochastic algorithms solving non-unique problems in metric spaces, 2026. Preprint,https://arxiv.org/abs/2605.06129

work page internal anchor Pith review Pith/arXiv arXiv 2026
[37]

Rockafellar

R.T. Rockafellar. Convex integral functionals and duality. In E.H. Zarantonello, editor,Contributions to Nonlinear Functional Analysis, pages 215–236. Academic Press, New York, 1971

work page 1971
[38]

Rockafellar

R.T. Rockafellar. Monotone operators and the proximal point algorithm.SIAM Journal of Control and Optimization, 14:877–898, 1976

work page 1976
[39]

Ryu and S

E.K. Ryu and S. Boyd. Stochastic Proximal Iteration: A Non-Asymptotic Improvement upon Stochastic Gradient Descent. working draft, accessed 2026,https://ernestryu.com/papers/spi.pdf

work page 2026
[40]

Williams.Probability with martingales

D. Williams.Probability with martingales. Cambridge University Press, 1991

work page 1991
[41]

Zhang and S

H. Zhang and S. Sra. First-order methods for geodesically convex optimization. In V. Feldman, A. Rakhlin, and O. Shamir, editors,Proceedings of the 29th Annual Conference on Learning Theory (COLT), volume 49 ofProceedings of Machine Learning Research, pages 1617–1638. PMLR, 2016

work page 2016

[1] [1]

Alber, A.N

Ya.I. Alber, A.N. Iusem, and M.V. Solodov. On the projected subgradient method for nonsmooth convex optimization in a Hilbert space.Mathematical Programming, 81:23–35, 1998

work page 1998

[2] [2]

Aleksandrov

A.D. Aleksandrov. A theorem on triangles in a metric space and some of its applications.Trudy Matem- aticheskogo Instituta imeni V.A. Steklova, 38:5–23, 1951

work page 1951

[3] [3]

Alexander, V

S. Alexander, V. Kapovitch, and A. Petrunin.Alexandrov Geometry: Foundations, volume 236 ofGraduate Studies in Mathematics. American Mathematical Society, Providence, RI, 2024

work page 2024

[4] [4]

Asi and J.C

H. Asi and J.C. Duchi. Stochastic (approximate) proximal point methods: Convergence, optimality, and adaptivity.SIAM Journal on Optimization, 29(3):2257–2290, 2019

work page 2019

[5] [5]

Aubin and H

J.-P. Aubin and H. Frankowska.Set-Valued Analysis. Springer, New York, 2009

work page 2009

[6] [6]

Baˇ c´ ak

M. Baˇ c´ ak. The proximal point algorithm in metric spaces.Israel Journal of Mathematics, 194(2):689–701, 2013

work page 2013

[7] [7]

Baˇ c´ ak

M. Baˇ c´ ak. Computing medians and means in Hadamard spaces.SIAM Journal of Optimization, 24(3):1542– 1566, 2014

work page 2014

[8] [8]

Baˇ c´ ak.Convex analysis and optimization in Hadamard spaces, volume 22 ofDe Gruyter Series in Nonlinear Analysis and Applications

M. Baˇ c´ ak.Convex analysis and optimization in Hadamard spaces, volume 22 ofDe Gruyter Series in Nonlinear Analysis and Applications. Walter de Gruyter GmbH, Berlin/Boston, 2014

work page 2014

[9] [9]

Baˇ c´ ak

M. Baˇ c´ ak. A variational approach to stochastic minimization of convex functionals.Pure and Applied Functional Analysis, 3(2):287–295, 2018

work page 2018

[10] [10]

Baˇ c´ ak

M. Baˇ c´ ak. Old and new challenges in Hadamard spaces.Japanese Journal of Mathematics, 18(2):115–168, 2023

work page 2023

[11] [11]

Baˇ c´ ak, I

M. Baˇ c´ ak, I. Searston, and B. Sims. Alternating projections in CAT(0) spaces.Journal of Mathematical Analysis and Applications, 385:599–607, 2012

work page 2012

[12] [12]

Bertsekas

D.P. Bertsekas. Incremental proximal methods for large scale convex optimization.Mathematical Program- ming. Series B, 129:163–195, 2011

work page 2011

[13] [13]

Bertsekas

D.P. Bertsekas. Incremental gradient, subgradient, and proximal methods for convex optimization: A sur- vey. In S. Sra, S. Nowozin, and S.J. Wright, editors,Optimization for Machine Learning, Neural Information Processing Series, pages 85–120. The MIT Press, Cambridge, Massachusetts, 2012

work page 2012

[14] [14]

Billera, S.P

L.J. Billera, S.P. Holmes, and K. Vogtmann. Geometry of the space of phylogenetic trees.Advances in Applied Mathematics, 27(4):733–767, 2001

work page 2001

[15] [15]

Br´ ezis and P.L

H. Br´ ezis and P.L. Lions. Produits infinis de resolvantes.Israel Journal of Mathematics, 29(4):329–345, 1978

work page 1978

[16] [16]

Bridson and A

M.R. Bridson and A. Haefliger.Metric Spaces of Non-Positive Curvature, volume 319 ofGrundlehren der mathematischen Wissenschaften. Springer Berlin, Heidelberg, 1999

work page 1999

[17] [17]

Bruhat and J

F. Bruhat and J. Tits. Groupes r´ eductifs sur un corps local. I. Donn´ ees radicielles valu´ ees.Publications Math´ ematiques de l’Institut des Hautes ´Etudes Scientifiques, 41:5–251, 1972

work page 1972

[18] [18]

Castaing and M

C. Castaing and M. Valadier.Convex Analysis and Measurable Multifunctions, volume 580 ofLecture Notes in Mathematics. Springer Berlin, Heidelberg, 1977

work page 1977

[19] [19]

Combettes and J.C

P.L. Combettes and J.C. Pesquet. Stochastic quasi-Fej´ er block-coordinate fixed point iterations with ran- dom sweeping.SIAM Journal on Optimization, 25(2):1221–1248, 2015

work page 2015

[20] [20]

Dhompongsa, W.A

S. Dhompongsa, W.A. Kirk, and B. Sims. Fixed points of uniformly Lipschitzian mappings.Nonlinear Analysis. Theory, Methods & Applications, 65:762–772, 2006

work page 2006

[21] [21]

Goodwin, A.S

A. Goodwin, A.S. Lewis, G. L´ opez-Acedo, and A. Nicolae. Stochastic and incremental subgradient methods for convex optimization on Hadamard spaces.Mathematical Programming, 2026. To appear

work page 2026

[22] [22]

M. Gromov. Hyperbolic groups. In S.M. Gersten, editor,Essays in group theory, volume 8 ofMathematical Sciences Research Institute Publications, pages 75–263. Springer, New York, 1987

work page 1987

[23] [23]

O. G¨ uler. On the convergence of the proximal point algorithm for convex minimization.SIAM Journal on Control and Optimization, 29:403—-419, 1991

work page 1991

[24] [24]

J. Jost. Equilibrium maps between metric spaces.Calculus of Variations and Partial Differential Equations, 2:173–204, 1994

work page 1994

[25] [25]

J. Jost. Convex functionals and generalized harmonic maps into spaces of nonpositive curvature.Commen- tarii Mathematici Helvetici, 70:659–673, 1995

work page 1995

[26] [26]

Kirk and B

W.A. Kirk and B. Panyanak. A concept of convergence in geodesic spaces.Nonlinear Analysis. Theory, Methods & Applications, 68:3689–3696, 2008

work page 2008

[27] [27]

Klenke.Probability Theory: A Comprehensive Course

A. Klenke.Probability Theory: A Comprehensive Course. Universitext. Springer Cham, 3rd edition, 2020. 12 N. PISCHKE

work page 2020

[28] [28]

Martinet

B. Martinet. R´ egularisation din´ equations variationnelles par approximations successives.Revue fran¸ caise d’informatique et de recherche op´ erationnelle, 4:154–159, 1970

work page 1970

[29] [29]

U. Mayer. Gradient flows on nonpositively curved metric spaces and harmonic maps.Communications in Analysis and Geometry, 6:199–253, 1998

work page 1998

[30] [30]

Nemirovski, A

A. Nemirovski, A. Juditsky, G. Lan, and A. Shapiro. Robust stochastic approximation approach to sto- chastic programming.SIAM Journal of Optimization, 19:1574–1609, 2009

work page 2009

[31] [31]

M. Neri, N. Pischke, and T. Powell. An abstract effective convergence theorem for stochastic processes, with applications to stochastic approximation, 2026. Preprint,https://arxiv.org/abs/2504.12922

work page internal anchor Pith review Pith/arXiv arXiv 2026

[32] [32]

Neri and T

M. Neri and T. Powell. On quantitative convergence for stochastic processes: Crossings, fluctuations and martingales.Transactions of the American Mathematical Society, Series B, 12:974–1019, 2025

work page 2025

[33] [33]

Neri and T

M. Neri and T. Powell. A quantitative Robbins-Siegmund theorem.The Annals of Applied Probability, 36(1):636–651, 2026

work page 2026

[34] [34]

B.J. Pettis. On integration in vector spaces.Transactions of the American Mathematical Society, 44:277– 304, 1938

work page 1938

[35] [35]

N. Pischke. On Busemann subgradient methods for stochastic minimization in Hadamard spaces, 2026. Preprint,https://arxiv.org/abs/2602.08127

work page arXiv 2026

[36] [36]

Convergence guarantees for stochastic algorithms solving non-unique problems in metric spaces

N. Pischke and T. Powell. Convergence guarantees for stochastic algorithms solving non-unique problems in metric spaces, 2026. Preprint,https://arxiv.org/abs/2605.06129

work page internal anchor Pith review Pith/arXiv arXiv 2026

[37] [37]

Rockafellar

R.T. Rockafellar. Convex integral functionals and duality. In E.H. Zarantonello, editor,Contributions to Nonlinear Functional Analysis, pages 215–236. Academic Press, New York, 1971

work page 1971

[38] [38]

Rockafellar

R.T. Rockafellar. Monotone operators and the proximal point algorithm.SIAM Journal of Control and Optimization, 14:877–898, 1976

work page 1976

[39] [39]

Ryu and S

E.K. Ryu and S. Boyd. Stochastic Proximal Iteration: A Non-Asymptotic Improvement upon Stochastic Gradient Descent. working draft, accessed 2026,https://ernestryu.com/papers/spi.pdf

work page 2026

[40] [40]

Williams.Probability with martingales

D. Williams.Probability with martingales. Cambridge University Press, 1991

work page 1991

[41] [41]

Zhang and S

H. Zhang and S. Sra. First-order methods for geodesically convex optimization. In V. Feldman, A. Rakhlin, and O. Shamir, editors,Proceedings of the 29th Annual Conference on Learning Theory (COLT), volume 49 ofProceedings of Machine Learning Research, pages 1617–1638. PMLR, 2016

work page 2016