Weak convergence of the stochastic proximal point method in metric spaces
Pith reviewed 2026-05-21 04:02 UTC · model grok-4.3
The pith
Stochastic proximal point method converges weakly almost surely in Hadamard spaces
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The paper proves the almost sure weak convergence of a stochastic proximal point method for minimizing a convex integral function in complete geodesic metric spaces of nonpositive curvature. The method is formulated using a mild growth condition on the function that generalizes Lipschitz continuity. The proof relies on a weak almost sure convergence theorem for stochastic processes in these spaces that satisfy a stochastic variant of quasi-Fejér monotonicity together with a new argument establishing almost sure convergence of the mean function values to the minimal value.
What carries the argument
The stochastic proximal point iteration in Hadamard spaces under a mild growth condition on the objective, carried by stochastic quasi-Fejér monotonicity of the process.
If this is right
- The iterates converge weakly almost surely to a minimizer of the convex integral function.
- The mean values of the objective function converge almost surely to the minimal value.
- The result applies directly in general Hadamard spaces without requiring local compactness.
- The technique extends prior convergence theorems for stochastic processes in these spaces.
Where Pith is reading between the lines
- The same mild growth condition and quasi-Fejér arguments may apply to other stochastic first-order methods in non-Euclidean metric spaces.
- The framework could support optimization tasks on data structures naturally modeled by hyperbolic or tree-like geometries.
- Additional assumptions on the function might yield explicit convergence rates under the same geometric setting.
Load-bearing premise
The objective function satisfies a mild growth condition that generalizes Lipschitz continuity to allow formulation of the method and to secure convergence of mean function values.
What would settle it
A concrete convex integral function on a non-locally compact Hadamard space that meets the mild growth condition but for which the generated sequence fails to converge weakly almost surely.
read the original abstract
We prove the almost sure weak convergence of a stochastic proximal point method for minimizing a convex integral function in the general nonlinear context of complete geodesic metric spaces of nonpositive curvature (so-called Hadamard spaces), solving a problem of M. Ba\v{c}\'ak. This method, formulated in the context of a mild growth condition on the function which generalizes Lipschitz continuity, was previously only considered in the context of strong metric regularity conditions or in the context of locally compact spaces. The proof is a combination of a weak almost sure convergence theorem for stochastic processes in Hadamard spaces which confine to a stochastic variant of quasi-Fej\'er monotonicity, due to previous work of the author, together with a new argument for proving the almost sure convergence of the mean function values of the process towards the minimal value.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper proves almost sure weak convergence of the stochastic proximal point method for minimizing a convex integral functional over complete geodesic metric spaces of nonpositive curvature (Hadamard spaces). The argument combines a prior stochastic quasi-Fejér monotonicity theorem with a new mean-value step showing that E[f(X_n)] converges almost surely to the infimum of the objective, under a mild growth condition on the integrand that generalizes Lipschitz continuity. This is presented as resolving an open question of Bačák by removing the need for local compactness or strong metric regularity.
Significance. If the new mean-value argument is valid without tacit compactness assumptions, the result would meaningfully extend stochastic proximal methods to general Hadamard spaces. The clean modular structure—invoking the author's earlier quasi-Fejér theorem and adding a targeted convergence-of-means step—offers a reusable template for other stochastic algorithms in nonlinear metric settings. The mild growth condition is a natural and verifiable relaxation of prior hypotheses.
major comments (2)
- [Proof of the mean-value convergence (new argument combining quasi-Fejér with expectation control)] The new argument for almost sure convergence of the mean values E[f(X_n)] to inf f (described in the abstract and proof outline) invokes the mild growth condition both to define the stochastic proximal mapping and to control the mean-value step. In non-locally compact Hadamard spaces this step risks failure if recession directions of the integrand are not uniformly controlled; the manuscript should explicitly verify that no hidden local-compactness or uniform-integrability property is used, or supply a counter-example space where the growth condition alone is insufficient.
- [Invocation of the prior stochastic quasi-Fejér theorem] The central weak-convergence claim depends on the stochastic quasi-Fejér theorem from the author's prior work. The manuscript should include a self-contained verification that all hypotheses of that theorem (e.g., the specific form of the stochastic perturbation and the growth condition) are satisfied by the proximal-point iteration defined here.
minor comments (2)
- [Abstract and introduction] A direct citation to the specific open problem posed by Bačák (including the reference) would help readers locate the exact statement being solved.
- [Preliminaries] Notation for the stochastic proximal mapping and the mild growth condition should be introduced with a displayed definition before its first use in the main theorems.
Simulated Author's Rebuttal
We thank the referee for the careful reading and constructive comments on our manuscript. We address each major comment below and indicate planned revisions to strengthen the presentation.
read point-by-point responses
-
Referee: [Proof of the mean-value convergence (new argument combining quasi-Fejér with expectation control)] The new argument for almost sure convergence of the mean values E[f(X_n)] to inf f (described in the abstract and proof outline) invokes the mild growth condition both to define the stochastic proximal mapping and to control the mean-value step. In non-locally compact Hadamard spaces this step risks failure if recession directions of the integrand are not uniformly controlled; the manuscript should explicitly verify that no hidden local-compactness or uniform-integrability property is used, or supply a counter-example space where the growth condition alone is insufficient.
Authors: We appreciate the referee's concern about potential tacit assumptions in non-locally compact settings. The mild growth condition is formulated precisely to control the expectations and recession behavior of the integrand via convexity, ensuring the mean-value step proceeds without local compactness or additional uniform-integrability hypotheses. In the revised manuscript we will insert a short clarifying paragraph (or lemma) immediately after the statement of the growth condition that explicitly confirms the argument uses only the given hypotheses and the geodesic properties of Hadamard spaces. We therefore see no need for a counter-example. revision: yes
-
Referee: [Invocation of the prior stochastic quasi-Fejér theorem] The central weak-convergence claim depends on the stochastic quasi-Fejér theorem from the author's prior work. The manuscript should include a self-contained verification that all hypotheses of that theorem (e.g., the specific form of the stochastic perturbation and the growth condition) are satisfied by the proximal-point iteration defined here.
Authors: We agree that a self-contained verification improves readability and rigor. In the revised version we will add a dedicated subsection (or short appendix) that systematically checks every hypothesis of the stochastic quasi-Fejér monotonicity theorem against the stochastic proximal-point iteration, including the precise form of the perturbation and the compatibility of the growth condition. revision: yes
Circularity Check
Central weak convergence theorem rests on author's prior stochastic quasi-Fejér result
specific steps
-
self citation load bearing
[Abstract]
"The proof is a combination of a weak almost sure convergence theorem for stochastic processes in Hadamard spaces which confine to a stochastic variant of quasi-Fejér monotonicity, due to previous work of the author, together with a new argument for proving the almost sure convergence of the mean function values of the process towards the minimal value."
The central almost sure weak convergence result is obtained by invoking the author's earlier theorem on stochastic quasi-Fejér monotonicity; that prior result is not re-proved or externally benchmarked here, so the new paper's main theorem reduces in part to self-citation rather than standing fully on independent external verification.
full rationale
The paper's proof explicitly combines a new argument for almost sure convergence of mean function values with a weak almost sure convergence theorem for stochastic quasi-Fejér processes taken from the author's previous work. This self-citation is load-bearing for the main claim but is not the sole content; the new mean-value step provides independent material. No other circular patterns (self-definitional fits, ansatz smuggling, or renaming) appear in the provided derivation outline. The result therefore receives a moderate circularity score rather than a high one.
Axiom & Free-Parameter Ledger
axioms (2)
- standard math Hadamard spaces are complete geodesic metric spaces with nonpositive curvature
- domain assumption The convex integral function satisfies a mild growth condition generalizing Lipschitz continuity
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
Lemma 3.1 (stochastic quasi-Fejér monotonicity) and Lemma 3.4 (stochastic recursive inequality for mean values) together with growth condition (A2)
-
IndisputableMonolith/Foundation/AlexanderDuality.leanalexander_duality_circle_linking unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
weak convergence in separable Hadamard spaces under (A1)+(A2)
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
Ya.I. Alber, A.N. Iusem, and M.V. Solodov. On the projected subgradient method for nonsmooth convex optimization in a Hilbert space.Mathematical Programming, 81:23–35, 1998
work page 1998
-
[2]
A.D. Aleksandrov. A theorem on triangles in a metric space and some of its applications.Trudy Matem- aticheskogo Instituta imeni V.A. Steklova, 38:5–23, 1951
work page 1951
-
[3]
S. Alexander, V. Kapovitch, and A. Petrunin.Alexandrov Geometry: Foundations, volume 236 ofGraduate Studies in Mathematics. American Mathematical Society, Providence, RI, 2024
work page 2024
-
[4]
H. Asi and J.C. Duchi. Stochastic (approximate) proximal point methods: Convergence, optimality, and adaptivity.SIAM Journal on Optimization, 29(3):2257–2290, 2019
work page 2019
-
[5]
J.-P. Aubin and H. Frankowska.Set-Valued Analysis. Springer, New York, 2009
work page 2009
- [6]
- [7]
-
[8]
M. Baˇ c´ ak.Convex analysis and optimization in Hadamard spaces, volume 22 ofDe Gruyter Series in Nonlinear Analysis and Applications. Walter de Gruyter GmbH, Berlin/Boston, 2014
work page 2014
- [9]
- [10]
-
[11]
M. Baˇ c´ ak, I. Searston, and B. Sims. Alternating projections in CAT(0) spaces.Journal of Mathematical Analysis and Applications, 385:599–607, 2012
work page 2012
- [12]
-
[13]
D.P. Bertsekas. Incremental gradient, subgradient, and proximal methods for convex optimization: A sur- vey. In S. Sra, S. Nowozin, and S.J. Wright, editors,Optimization for Machine Learning, Neural Information Processing Series, pages 85–120. The MIT Press, Cambridge, Massachusetts, 2012
work page 2012
-
[14]
L.J. Billera, S.P. Holmes, and K. Vogtmann. Geometry of the space of phylogenetic trees.Advances in Applied Mathematics, 27(4):733–767, 2001
work page 2001
-
[15]
H. Br´ ezis and P.L. Lions. Produits infinis de resolvantes.Israel Journal of Mathematics, 29(4):329–345, 1978
work page 1978
-
[16]
M.R. Bridson and A. Haefliger.Metric Spaces of Non-Positive Curvature, volume 319 ofGrundlehren der mathematischen Wissenschaften. Springer Berlin, Heidelberg, 1999
work page 1999
-
[17]
F. Bruhat and J. Tits. Groupes r´ eductifs sur un corps local. I. Donn´ ees radicielles valu´ ees.Publications Math´ ematiques de l’Institut des Hautes ´Etudes Scientifiques, 41:5–251, 1972
work page 1972
-
[18]
C. Castaing and M. Valadier.Convex Analysis and Measurable Multifunctions, volume 580 ofLecture Notes in Mathematics. Springer Berlin, Heidelberg, 1977
work page 1977
-
[19]
P.L. Combettes and J.C. Pesquet. Stochastic quasi-Fej´ er block-coordinate fixed point iterations with ran- dom sweeping.SIAM Journal on Optimization, 25(2):1221–1248, 2015
work page 2015
-
[20]
S. Dhompongsa, W.A. Kirk, and B. Sims. Fixed points of uniformly Lipschitzian mappings.Nonlinear Analysis. Theory, Methods & Applications, 65:762–772, 2006
work page 2006
-
[21]
A. Goodwin, A.S. Lewis, G. L´ opez-Acedo, and A. Nicolae. Stochastic and incremental subgradient methods for convex optimization on Hadamard spaces.Mathematical Programming, 2026. To appear
work page 2026
-
[22]
M. Gromov. Hyperbolic groups. In S.M. Gersten, editor,Essays in group theory, volume 8 ofMathematical Sciences Research Institute Publications, pages 75–263. Springer, New York, 1987
work page 1987
-
[23]
O. G¨ uler. On the convergence of the proximal point algorithm for convex minimization.SIAM Journal on Control and Optimization, 29:403—-419, 1991
work page 1991
-
[24]
J. Jost. Equilibrium maps between metric spaces.Calculus of Variations and Partial Differential Equations, 2:173–204, 1994
work page 1994
-
[25]
J. Jost. Convex functionals and generalized harmonic maps into spaces of nonpositive curvature.Commen- tarii Mathematici Helvetici, 70:659–673, 1995
work page 1995
-
[26]
W.A. Kirk and B. Panyanak. A concept of convergence in geodesic spaces.Nonlinear Analysis. Theory, Methods & Applications, 68:3689–3696, 2008
work page 2008
-
[27]
Klenke.Probability Theory: A Comprehensive Course
A. Klenke.Probability Theory: A Comprehensive Course. Universitext. Springer Cham, 3rd edition, 2020. 12 N. PISCHKE
work page 2020
- [28]
-
[29]
U. Mayer. Gradient flows on nonpositively curved metric spaces and harmonic maps.Communications in Analysis and Geometry, 6:199–253, 1998
work page 1998
-
[30]
A. Nemirovski, A. Juditsky, G. Lan, and A. Shapiro. Robust stochastic approximation approach to sto- chastic programming.SIAM Journal of Optimization, 19:1574–1609, 2009
work page 2009
-
[31]
M. Neri, N. Pischke, and T. Powell. An abstract effective convergence theorem for stochastic processes, with applications to stochastic approximation, 2026. Preprint,https://arxiv.org/abs/2504.12922
work page internal anchor Pith review Pith/arXiv arXiv 2026
-
[32]
M. Neri and T. Powell. On quantitative convergence for stochastic processes: Crossings, fluctuations and martingales.Transactions of the American Mathematical Society, Series B, 12:974–1019, 2025
work page 2025
-
[33]
M. Neri and T. Powell. A quantitative Robbins-Siegmund theorem.The Annals of Applied Probability, 36(1):636–651, 2026
work page 2026
-
[34]
B.J. Pettis. On integration in vector spaces.Transactions of the American Mathematical Society, 44:277– 304, 1938
work page 1938
- [35]
-
[36]
Convergence guarantees for stochastic algorithms solving non-unique problems in metric spaces
N. Pischke and T. Powell. Convergence guarantees for stochastic algorithms solving non-unique problems in metric spaces, 2026. Preprint,https://arxiv.org/abs/2605.06129
work page internal anchor Pith review Pith/arXiv arXiv 2026
-
[37]
R.T. Rockafellar. Convex integral functionals and duality. In E.H. Zarantonello, editor,Contributions to Nonlinear Functional Analysis, pages 215–236. Academic Press, New York, 1971
work page 1971
-
[38]
R.T. Rockafellar. Monotone operators and the proximal point algorithm.SIAM Journal of Control and Optimization, 14:877–898, 1976
work page 1976
- [39]
-
[40]
Williams.Probability with martingales
D. Williams.Probability with martingales. Cambridge University Press, 1991
work page 1991
-
[41]
H. Zhang and S. Sra. First-order methods for geodesically convex optimization. In V. Feldman, A. Rakhlin, and O. Shamir, editors,Proceedings of the 29th Annual Conference on Learning Theory (COLT), volume 49 ofProceedings of Machine Learning Research, pages 1617–1638. PMLR, 2016
work page 2016
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.