pith. sign in

arxiv: 2605.20805 · v1 · pith:ZTSH7UPWnew · submitted 2026-05-20 · 🧮 math.OC

Weak convergence of the stochastic proximal point method in metric spaces

Pith reviewed 2026-05-21 04:02 UTC · model grok-4.3

classification 🧮 math.OC
keywords weak convergencestochastic proximal pointHadamard spacesconvex optimizationmetric spacesquasi-Fejer monotonicityintegral functionalsnonpositive curvature
0
0 comments X

The pith

Stochastic proximal point method converges weakly almost surely in Hadamard spaces

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper establishes that a stochastic proximal point method for minimizing a convex integral function produces iterates that converge weakly almost surely in complete geodesic metric spaces of nonpositive curvature. This holds under a mild growth condition on the objective that generalizes Lipschitz continuity and is used both to define the iterates and to prove convergence of their average values. The argument combines an existing result on stochastic processes satisfying a stochastic form of quasi-Fejér monotonicity with a new step showing almost sure convergence of mean function values to the infimum. A reader would care because the result applies in general nonlinear geometries without local compactness or stronger regularity assumptions that limited earlier work.

Core claim

The paper proves the almost sure weak convergence of a stochastic proximal point method for minimizing a convex integral function in complete geodesic metric spaces of nonpositive curvature. The method is formulated using a mild growth condition on the function that generalizes Lipschitz continuity. The proof relies on a weak almost sure convergence theorem for stochastic processes in these spaces that satisfy a stochastic variant of quasi-Fejér monotonicity together with a new argument establishing almost sure convergence of the mean function values to the minimal value.

What carries the argument

The stochastic proximal point iteration in Hadamard spaces under a mild growth condition on the objective, carried by stochastic quasi-Fejér monotonicity of the process.

If this is right

  • The iterates converge weakly almost surely to a minimizer of the convex integral function.
  • The mean values of the objective function converge almost surely to the minimal value.
  • The result applies directly in general Hadamard spaces without requiring local compactness.
  • The technique extends prior convergence theorems for stochastic processes in these spaces.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The same mild growth condition and quasi-Fejér arguments may apply to other stochastic first-order methods in non-Euclidean metric spaces.
  • The framework could support optimization tasks on data structures naturally modeled by hyperbolic or tree-like geometries.
  • Additional assumptions on the function might yield explicit convergence rates under the same geometric setting.

Load-bearing premise

The objective function satisfies a mild growth condition that generalizes Lipschitz continuity to allow formulation of the method and to secure convergence of mean function values.

What would settle it

A concrete convex integral function on a non-locally compact Hadamard space that meets the mild growth condition but for which the generated sequence fails to converge weakly almost surely.

read the original abstract

We prove the almost sure weak convergence of a stochastic proximal point method for minimizing a convex integral function in the general nonlinear context of complete geodesic metric spaces of nonpositive curvature (so-called Hadamard spaces), solving a problem of M. Ba\v{c}\'ak. This method, formulated in the context of a mild growth condition on the function which generalizes Lipschitz continuity, was previously only considered in the context of strong metric regularity conditions or in the context of locally compact spaces. The proof is a combination of a weak almost sure convergence theorem for stochastic processes in Hadamard spaces which confine to a stochastic variant of quasi-Fej\'er monotonicity, due to previous work of the author, together with a new argument for proving the almost sure convergence of the mean function values of the process towards the minimal value.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The paper proves almost sure weak convergence of the stochastic proximal point method for minimizing a convex integral functional over complete geodesic metric spaces of nonpositive curvature (Hadamard spaces). The argument combines a prior stochastic quasi-Fejér monotonicity theorem with a new mean-value step showing that E[f(X_n)] converges almost surely to the infimum of the objective, under a mild growth condition on the integrand that generalizes Lipschitz continuity. This is presented as resolving an open question of Bačák by removing the need for local compactness or strong metric regularity.

Significance. If the new mean-value argument is valid without tacit compactness assumptions, the result would meaningfully extend stochastic proximal methods to general Hadamard spaces. The clean modular structure—invoking the author's earlier quasi-Fejér theorem and adding a targeted convergence-of-means step—offers a reusable template for other stochastic algorithms in nonlinear metric settings. The mild growth condition is a natural and verifiable relaxation of prior hypotheses.

major comments (2)
  1. [Proof of the mean-value convergence (new argument combining quasi-Fejér with expectation control)] The new argument for almost sure convergence of the mean values E[f(X_n)] to inf f (described in the abstract and proof outline) invokes the mild growth condition both to define the stochastic proximal mapping and to control the mean-value step. In non-locally compact Hadamard spaces this step risks failure if recession directions of the integrand are not uniformly controlled; the manuscript should explicitly verify that no hidden local-compactness or uniform-integrability property is used, or supply a counter-example space where the growth condition alone is insufficient.
  2. [Invocation of the prior stochastic quasi-Fejér theorem] The central weak-convergence claim depends on the stochastic quasi-Fejér theorem from the author's prior work. The manuscript should include a self-contained verification that all hypotheses of that theorem (e.g., the specific form of the stochastic perturbation and the growth condition) are satisfied by the proximal-point iteration defined here.
minor comments (2)
  1. [Abstract and introduction] A direct citation to the specific open problem posed by Bačák (including the reference) would help readers locate the exact statement being solved.
  2. [Preliminaries] Notation for the stochastic proximal mapping and the mild growth condition should be introduced with a displayed definition before its first use in the main theorems.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the careful reading and constructive comments on our manuscript. We address each major comment below and indicate planned revisions to strengthen the presentation.

read point-by-point responses
  1. Referee: [Proof of the mean-value convergence (new argument combining quasi-Fejér with expectation control)] The new argument for almost sure convergence of the mean values E[f(X_n)] to inf f (described in the abstract and proof outline) invokes the mild growth condition both to define the stochastic proximal mapping and to control the mean-value step. In non-locally compact Hadamard spaces this step risks failure if recession directions of the integrand are not uniformly controlled; the manuscript should explicitly verify that no hidden local-compactness or uniform-integrability property is used, or supply a counter-example space where the growth condition alone is insufficient.

    Authors: We appreciate the referee's concern about potential tacit assumptions in non-locally compact settings. The mild growth condition is formulated precisely to control the expectations and recession behavior of the integrand via convexity, ensuring the mean-value step proceeds without local compactness or additional uniform-integrability hypotheses. In the revised manuscript we will insert a short clarifying paragraph (or lemma) immediately after the statement of the growth condition that explicitly confirms the argument uses only the given hypotheses and the geodesic properties of Hadamard spaces. We therefore see no need for a counter-example. revision: yes

  2. Referee: [Invocation of the prior stochastic quasi-Fejér theorem] The central weak-convergence claim depends on the stochastic quasi-Fejér theorem from the author's prior work. The manuscript should include a self-contained verification that all hypotheses of that theorem (e.g., the specific form of the stochastic perturbation and the growth condition) are satisfied by the proximal-point iteration defined here.

    Authors: We agree that a self-contained verification improves readability and rigor. In the revised version we will add a dedicated subsection (or short appendix) that systematically checks every hypothesis of the stochastic quasi-Fejér monotonicity theorem against the stochastic proximal-point iteration, including the precise form of the perturbation and the compatibility of the growth condition. revision: yes

Circularity Check

1 steps flagged

Central weak convergence theorem rests on author's prior stochastic quasi-Fejér result

specific steps
  1. self citation load bearing [Abstract]
    "The proof is a combination of a weak almost sure convergence theorem for stochastic processes in Hadamard spaces which confine to a stochastic variant of quasi-Fejér monotonicity, due to previous work of the author, together with a new argument for proving the almost sure convergence of the mean function values of the process towards the minimal value."

    The central almost sure weak convergence result is obtained by invoking the author's earlier theorem on stochastic quasi-Fejér monotonicity; that prior result is not re-proved or externally benchmarked here, so the new paper's main theorem reduces in part to self-citation rather than standing fully on independent external verification.

full rationale

The paper's proof explicitly combines a new argument for almost sure convergence of mean function values with a weak almost sure convergence theorem for stochastic quasi-Fejér processes taken from the author's previous work. This self-citation is load-bearing for the main claim but is not the sole content; the new mean-value step provides independent material. No other circular patterns (self-definitional fits, ansatz smuggling, or renaming) appear in the provided derivation outline. The result therefore receives a moderate circularity score rather than a high one.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 0 invented entities

The proof rests on standard properties of Hadamard spaces and a domain-specific growth condition; no free parameters or new entities are introduced.

axioms (2)
  • standard math Hadamard spaces are complete geodesic metric spaces with nonpositive curvature
    Defines the ambient space in which the proximal point iteration and weak convergence are studied.
  • domain assumption The convex integral function satisfies a mild growth condition generalizing Lipschitz continuity
    Required to formulate the stochastic proximal mapping and to prove convergence of the mean function values.

pith-pipeline@v0.9.0 · 5658 in / 1289 out tokens · 35894 ms · 2026-05-21T04:02:46.279232+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

41 extracted references · 41 canonical work pages · 2 internal anchors

  1. [1]

    Alber, A.N

    Ya.I. Alber, A.N. Iusem, and M.V. Solodov. On the projected subgradient method for nonsmooth convex optimization in a Hilbert space.Mathematical Programming, 81:23–35, 1998

  2. [2]

    Aleksandrov

    A.D. Aleksandrov. A theorem on triangles in a metric space and some of its applications.Trudy Matem- aticheskogo Instituta imeni V.A. Steklova, 38:5–23, 1951

  3. [3]

    Alexander, V

    S. Alexander, V. Kapovitch, and A. Petrunin.Alexandrov Geometry: Foundations, volume 236 ofGraduate Studies in Mathematics. American Mathematical Society, Providence, RI, 2024

  4. [4]

    Asi and J.C

    H. Asi and J.C. Duchi. Stochastic (approximate) proximal point methods: Convergence, optimality, and adaptivity.SIAM Journal on Optimization, 29(3):2257–2290, 2019

  5. [5]

    Aubin and H

    J.-P. Aubin and H. Frankowska.Set-Valued Analysis. Springer, New York, 2009

  6. [6]

    Baˇ c´ ak

    M. Baˇ c´ ak. The proximal point algorithm in metric spaces.Israel Journal of Mathematics, 194(2):689–701, 2013

  7. [7]

    Baˇ c´ ak

    M. Baˇ c´ ak. Computing medians and means in Hadamard spaces.SIAM Journal of Optimization, 24(3):1542– 1566, 2014

  8. [8]

    Baˇ c´ ak.Convex analysis and optimization in Hadamard spaces, volume 22 ofDe Gruyter Series in Nonlinear Analysis and Applications

    M. Baˇ c´ ak.Convex analysis and optimization in Hadamard spaces, volume 22 ofDe Gruyter Series in Nonlinear Analysis and Applications. Walter de Gruyter GmbH, Berlin/Boston, 2014

  9. [9]

    Baˇ c´ ak

    M. Baˇ c´ ak. A variational approach to stochastic minimization of convex functionals.Pure and Applied Functional Analysis, 3(2):287–295, 2018

  10. [10]

    Baˇ c´ ak

    M. Baˇ c´ ak. Old and new challenges in Hadamard spaces.Japanese Journal of Mathematics, 18(2):115–168, 2023

  11. [11]

    Baˇ c´ ak, I

    M. Baˇ c´ ak, I. Searston, and B. Sims. Alternating projections in CAT(0) spaces.Journal of Mathematical Analysis and Applications, 385:599–607, 2012

  12. [12]

    Bertsekas

    D.P. Bertsekas. Incremental proximal methods for large scale convex optimization.Mathematical Program- ming. Series B, 129:163–195, 2011

  13. [13]

    Bertsekas

    D.P. Bertsekas. Incremental gradient, subgradient, and proximal methods for convex optimization: A sur- vey. In S. Sra, S. Nowozin, and S.J. Wright, editors,Optimization for Machine Learning, Neural Information Processing Series, pages 85–120. The MIT Press, Cambridge, Massachusetts, 2012

  14. [14]

    Billera, S.P

    L.J. Billera, S.P. Holmes, and K. Vogtmann. Geometry of the space of phylogenetic trees.Advances in Applied Mathematics, 27(4):733–767, 2001

  15. [15]

    Br´ ezis and P.L

    H. Br´ ezis and P.L. Lions. Produits infinis de resolvantes.Israel Journal of Mathematics, 29(4):329–345, 1978

  16. [16]

    Bridson and A

    M.R. Bridson and A. Haefliger.Metric Spaces of Non-Positive Curvature, volume 319 ofGrundlehren der mathematischen Wissenschaften. Springer Berlin, Heidelberg, 1999

  17. [17]

    Bruhat and J

    F. Bruhat and J. Tits. Groupes r´ eductifs sur un corps local. I. Donn´ ees radicielles valu´ ees.Publications Math´ ematiques de l’Institut des Hautes ´Etudes Scientifiques, 41:5–251, 1972

  18. [18]

    Castaing and M

    C. Castaing and M. Valadier.Convex Analysis and Measurable Multifunctions, volume 580 ofLecture Notes in Mathematics. Springer Berlin, Heidelberg, 1977

  19. [19]

    Combettes and J.C

    P.L. Combettes and J.C. Pesquet. Stochastic quasi-Fej´ er block-coordinate fixed point iterations with ran- dom sweeping.SIAM Journal on Optimization, 25(2):1221–1248, 2015

  20. [20]

    Dhompongsa, W.A

    S. Dhompongsa, W.A. Kirk, and B. Sims. Fixed points of uniformly Lipschitzian mappings.Nonlinear Analysis. Theory, Methods & Applications, 65:762–772, 2006

  21. [21]

    Goodwin, A.S

    A. Goodwin, A.S. Lewis, G. L´ opez-Acedo, and A. Nicolae. Stochastic and incremental subgradient methods for convex optimization on Hadamard spaces.Mathematical Programming, 2026. To appear

  22. [22]

    M. Gromov. Hyperbolic groups. In S.M. Gersten, editor,Essays in group theory, volume 8 ofMathematical Sciences Research Institute Publications, pages 75–263. Springer, New York, 1987

  23. [23]

    O. G¨ uler. On the convergence of the proximal point algorithm for convex minimization.SIAM Journal on Control and Optimization, 29:403—-419, 1991

  24. [24]

    J. Jost. Equilibrium maps between metric spaces.Calculus of Variations and Partial Differential Equations, 2:173–204, 1994

  25. [25]

    J. Jost. Convex functionals and generalized harmonic maps into spaces of nonpositive curvature.Commen- tarii Mathematici Helvetici, 70:659–673, 1995

  26. [26]

    Kirk and B

    W.A. Kirk and B. Panyanak. A concept of convergence in geodesic spaces.Nonlinear Analysis. Theory, Methods & Applications, 68:3689–3696, 2008

  27. [27]

    Klenke.Probability Theory: A Comprehensive Course

    A. Klenke.Probability Theory: A Comprehensive Course. Universitext. Springer Cham, 3rd edition, 2020. 12 N. PISCHKE

  28. [28]

    Martinet

    B. Martinet. R´ egularisation din´ equations variationnelles par approximations successives.Revue fran¸ caise d’informatique et de recherche op´ erationnelle, 4:154–159, 1970

  29. [29]

    U. Mayer. Gradient flows on nonpositively curved metric spaces and harmonic maps.Communications in Analysis and Geometry, 6:199–253, 1998

  30. [30]

    Nemirovski, A

    A. Nemirovski, A. Juditsky, G. Lan, and A. Shapiro. Robust stochastic approximation approach to sto- chastic programming.SIAM Journal of Optimization, 19:1574–1609, 2009

  31. [31]

    M. Neri, N. Pischke, and T. Powell. An abstract effective convergence theorem for stochastic processes, with applications to stochastic approximation, 2026. Preprint,https://arxiv.org/abs/2504.12922

  32. [32]

    Neri and T

    M. Neri and T. Powell. On quantitative convergence for stochastic processes: Crossings, fluctuations and martingales.Transactions of the American Mathematical Society, Series B, 12:974–1019, 2025

  33. [33]

    Neri and T

    M. Neri and T. Powell. A quantitative Robbins-Siegmund theorem.The Annals of Applied Probability, 36(1):636–651, 2026

  34. [34]

    B.J. Pettis. On integration in vector spaces.Transactions of the American Mathematical Society, 44:277– 304, 1938

  35. [35]

    N. Pischke. On Busemann subgradient methods for stochastic minimization in Hadamard spaces, 2026. Preprint,https://arxiv.org/abs/2602.08127

  36. [36]

    Convergence guarantees for stochastic algorithms solving non-unique problems in metric spaces

    N. Pischke and T. Powell. Convergence guarantees for stochastic algorithms solving non-unique problems in metric spaces, 2026. Preprint,https://arxiv.org/abs/2605.06129

  37. [37]

    Rockafellar

    R.T. Rockafellar. Convex integral functionals and duality. In E.H. Zarantonello, editor,Contributions to Nonlinear Functional Analysis, pages 215–236. Academic Press, New York, 1971

  38. [38]

    Rockafellar

    R.T. Rockafellar. Monotone operators and the proximal point algorithm.SIAM Journal of Control and Optimization, 14:877–898, 1976

  39. [39]

    Ryu and S

    E.K. Ryu and S. Boyd. Stochastic Proximal Iteration: A Non-Asymptotic Improvement upon Stochastic Gradient Descent. working draft, accessed 2026,https://ernestryu.com/papers/spi.pdf

  40. [40]

    Williams.Probability with martingales

    D. Williams.Probability with martingales. Cambridge University Press, 1991

  41. [41]

    Zhang and S

    H. Zhang and S. Sra. First-order methods for geodesically convex optimization. In V. Feldman, A. Rakhlin, and O. Shamir, editors,Proceedings of the 29th Annual Conference on Learning Theory (COLT), volume 49 ofProceedings of Machine Learning Research, pages 1617–1638. PMLR, 2016