On the diameter of subgradient sequences in o-minimal structures
Pith reviewed 2026-05-18 00:09 UTC · model grok-4.3
The pith
Subgradient sequences for locally Lipschitz o-minimal functions have diameters controlled by function value variation plus double sums of step sizes.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
We show that the diameter of any subgradient sequence is related to the variation in function values, with error terms dominated by a double summation of step sizes. Consequently, we prove that bounded subgradient sequences converge if the step sizes are of order 1/k. The proof uses Lipschitz L-regular stratifications in o-minimal structures to analyze subgradient sequences via their projections onto different strata.
What carries the argument
Lipschitz L-regular stratifications that decompose the domain and allow projection of the sequence onto individual strata for local analysis.
If this is right
- Bounded subgradient sequences converge when the step sizes decrease as order 1/k.
- The diameter of the sequence remains controlled by the net change in function values along the sequence.
- Error contributions to the diameter bound are governed by double summations of the step sizes.
Where Pith is reading between the lines
- The diameter bound may supply a template for proving convergence of related first-order methods under the same definability hypothesis.
- Relaxing the polynomially bounded condition while retaining o-minimality could be tested by constructing explicit examples outside the current setting.
Load-bearing premise
The functions are locally Lipschitz and definable in a polynomially bounded o-minimal structure so that the required stratifications exist.
What would settle it
A locally Lipschitz function definable in a polynomially bounded o-minimal structure together with a bounded subgradient sequence using steps of size 1/k that fails to converge would refute the convergence claim.
Figures
read the original abstract
We study subgradient sequences of locally Lipschitz functions definable in a polynomially bounded o-minimal structure. We show that the diameter of any subgradient sequence is related to the variation in function values, with error terms dominated by a double summation of step sizes. Consequently, we prove that bounded subgradient sequences converge if the step sizes are of order $1/k$. The proof uses Lipschitz $L$-regular stratifications in o-minimal structures to analyze subgradient sequences via their projections onto different strata.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript studies subgradient sequences for locally Lipschitz functions definable in polynomially bounded o-minimal structures. It claims that the diameter of any such sequence is controlled by the variation in function values plus error terms dominated by a double summation of step sizes. The proof proceeds by invoking Lipschitz L-regular stratifications to project the sequence onto strata and transfer the bound; as a consequence, bounded sequences are shown to converge when the step sizes are of order 1/k.
Significance. If the diameter bound and its transfer via projections hold, the work supplies a geometric convergence tool for nonsmooth subgradient methods in the definable setting, extending classical results beyond convex or smooth cases. The explicit reliance on o-minimal stratifications is a methodological strength that could support further results on tame optimization.
major comments (2)
- [Proof of the main theorem (stratification decomposition)] The projection argument (abstract and proof of the main diameter theorem): the original g_k lies in ∂f(x_k), but after projection onto a stratum S the sequence satisfies a subgradient relation only for the restriction f|S. It is not shown that the normal components are absorbed into the double-sum error term without additional constants that may accumulate across the finite number of strata transitions; this step is load-bearing for the claimed diameter bound.
- [Derivation of the diameter bound] Error-term domination (statement following the diameter relation): the claim that projection-induced errors are dominated by the double summation of step sizes requires an explicit estimate showing uniformity with respect to the Lipschitz constants of the strata and the number of transitions; without this, the subsequent convergence statement for 1/k steps may fail to hold uniformly.
minor comments (2)
- [Abstract] The abstract and introduction could state the precise polynomial boundedness assumption on the o-minimal structure earlier, as it is used to guarantee the existence of the Lipschitz L-regular stratification.
- [Main theorem statement] Notation for the double summation of step sizes is introduced late; defining it explicitly in the statement of the main theorem would improve readability.
Simulated Author's Rebuttal
We thank the referee for their thorough review and insightful comments on our manuscript. We address each major comment below and have made revisions to strengthen the proof as suggested.
read point-by-point responses
-
Referee: [Proof of the main theorem (stratification decomposition)] The projection argument (abstract and proof of the main diameter theorem): the original g_k lies in ∂f(x_k), but after projection onto a stratum S the sequence satisfies a subgradient relation only for the restriction f|S. It is not shown that the normal components are absorbed into the double-sum error term without additional constants that may accumulate across the finite number of strata transitions; this step is load-bearing for the claimed diameter bound.
Authors: We appreciate this observation. The proof relies on the properties of Lipschitz L-regular stratifications in polynomially bounded o-minimal structures, where the finite stratification ensures only finitely many transitions occur. The normal components are controlled by the local Lipschitz constant of f, and due to the definability, these constants are uniform across the strata. We have added a new lemma in the revised manuscript that explicitly bounds the accumulation of these constants by a factor depending only on the number of strata and the global Lipschitz constant, which is then absorbed into the double-sum error term without affecting the overall bound. revision: yes
-
Referee: [Derivation of the diameter bound] Error-term domination (statement following the diameter relation): the claim that projection-induced errors are dominated by the double summation of step sizes requires an explicit estimate showing uniformity with respect to the Lipschitz constants of the strata and the number of transitions; without this, the subsequent convergence statement for 1/k steps may fail to hold uniformly.
Authors: The referee correctly identifies the need for uniformity in the error estimate. In the original manuscript, the domination is argued using the o-minimal tameness, but we agree that an explicit estimate enhances clarity. We have included in the revision a detailed calculation showing that the projection errors are bounded by K * sum_{i=1}^k sum_{j=1}^i alpha_i alpha_j, where K is a constant depending on the maximum Lipschitz constant over the finite strata and the maximum number of transitions (which is bounded by the number of strata). This ensures the convergence for step sizes of order 1/k holds uniformly. revision: yes
Circularity Check
No circularity: derivation relies on external o-minimal stratifications
full rationale
The paper derives the diameter bound for subgradient sequences and the consequent convergence for 1/k step sizes by invoking Lipschitz L-regular stratifications of definable functions in polynomially bounded o-minimal structures. These stratifications and their projection properties are standard external results from o-minimal geometry, not defined, fitted, or assumed within the paper. The analysis decomposes the sequence across strata, relates diameter to function-value variation plus a double sum of step sizes, and transfers the subgradient relation via the restriction to each stratum; none of these steps reduce by the paper's own equations to a self-definition, a fitted input renamed as prediction, or a load-bearing self-citation. The central claims therefore remain independent of the paper's inputs and are supported by external mathematical facts.
Axiom & Free-Parameter Ledger
axioms (2)
- domain assumption The functions under study are locally Lipschitz and definable in a polynomially bounded o-minimal structure.
- domain assumption Lipschitz L-regular stratifications exist and can be used to analyze the sequences via projections onto strata.
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
We show that the diameter of any subgradient sequence is related to the variation in function values, with error terms dominated by a double summation of step sizes.
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Forward citations
Cited by 1 Pith paper
-
On convergence rates of subgradient descent on semialgebraic functions
Under Lipschitz stratification assumptions that hold automatically for semialgebraic functions, constant-step subgradient descent achieves explicit rates that improve with fewer strata and recover smooth-case rates up...
Reference graph
Works this paper leans on
-
[1]
Proximal random reshuffling under local lipschitz continuity.arXiv preprint arXiv:2408.07182,
[JLL24] Cedric Josz, Lexiao Lai, and Xiaopeng Li. Proximal random reshuffling under local lipschitz continuity.arXiv preprint arXiv:2408.07182,
- [2]
-
[3]
Sur les trajectoires du gradient d’une fonction analytique.Seminari di geometria, 1983:115–117,
[Ło82] Stanislaw Łojasiewicz. Sur les trajectoires du gradient d’une fonction analytique.Seminari di geometria, 1983:115–117,
work page 1983
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.