pith. sign in

arxiv: 2605.29839 · v1 · pith:HL7UIJKBnew · submitted 2026-05-28 · 🧮 math.ST · cs.IT· math.IT· physics.data-an· stat.ML· stat.TH

The Topological Stability Index: A Variance-Based Measure for Persistence Barcodes

Pith reviewed 2026-06-29 00:24 UTC · model grok-4.3

classification 🧮 math.ST cs.ITmath.ITphysics.data-anstat.MLstat.TH
keywords Topological Stability IndexPersistence BarcodesRényi EntropyCollision ProbabilityVariance MeasureTopological Data AnalysisNormalized IndexLifetime Dispersion
0
0 comments X

The pith

The normalized Topological Stability Index on persistence barcodes is an affine function of the collision probability and thus a monotone reparametrization of Rényi entropy of order two.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper defines the Topological Stability Index as a variance measure on the lifetimes in a persistence barcode, capturing absolute dispersion rather than only relative proportions. A scale-invariant version called cvTSI is shown to equal an affine transform of the sum of squared normalized lifetimes. This identity directly connects the variance-based index to the Rényi entropy of order two. The authors also introduce a first-moment companion called TSigI and derive scaling, translation-invariance, and update rules for the measures. Experiments indicate that the index responds to magnitude variations and stochastic fluctuations where entropy summaries stay flat.

Core claim

The normalized cvTSI is an affine function of the collision probability sum p_i squared, and therefore a monotone reparametrization of the Rényi entropy of order two, establishing an explicit algebraic bridge between variance-based and entropy-based summaries for persistence barcodes.

What carries the argument

cvTSI, the normalized coefficient of variation of persistence lifetimes, which satisfies an algebraic identity with the collision probability sum p_i squared.

If this is right

  • TSI remains sensitive to heterogeneous feature scales while persistent entropy does not.
  • Explicit update formulas allow efficient recomputation when bars are inserted or deleted.
  • TSigI supplies the typical lifetime scale as a complement to the dispersion measured by TSI.
  • The index detects stochastic fluctuations in time series that leave entropy unchanged.
  • Translation invariance under lifetime shifts follows directly from the variance definition.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The algebraic link suggests that existing Rényi-entropy algorithms could be reused to compute variance summaries on barcodes without separate code paths.
  • Hybrid descriptors that combine cvTSI with higher-order Rényi entropies might separate scale effects from distribution shape more cleanly than either family alone.
  • Because the relation is exact, any theoretical guarantee proved for one summary transfers immediately to the other after the affine reparametrization.

Load-bearing premise

Normalizing the lifetimes to form a probability distribution does not erase the absolute-scale information that the original variance measure is meant to retain.

What would settle it

A set of bar lengths for which the computed cvTSI differs numerically from one minus the sum of squared normalized lengths.

Figures

Figures reproduced from arXiv: 2605.29839 by Ioannis Diamantis, Joris Kirchner.

Figure 1
Figure 1. Figure 1: Two persistence barcodes with identical total persistence but different distri￾butions of lifetimes. The top barcode exhibits moderate variability, with one long bar and several shorter ones, whereas the bottom barcode shows a highly uneven distribution, concentrating persistence in a small number of longer bars while the remaining bars are significantly shorter. This highlights that the TSI measures dispe… view at source ↗
Figure 2
Figure 2. Figure 2: Extremal configurations for the TSI under fixed total persistence L and number of bars n. A barcode with equal bar lengths ℓi = L/n minimizes the TSI (left), while a barcode in which one bar has length L and the remaining n−1 bars have length zero (shown schematically) maximizes the TSI (right). 2.2. Behaviour under Scaling and Translation. We now examine how TSI behaves under simple transformations of the… view at source ↗
Figure 3
Figure 3. Figure 3: Effect of uniform scaling and death shifts on a barcode. Uniform scaling mul￾tiplies all lifetimes by the same factor and causes the TSI to scale quadratically, whereas shifting all death times by a constant leaves the TSI unchanged. 2.3. Addition and Removal of Bars. We now describe how the TSI changes under insertion or deletion of a bar. To do this, we need the following lemma. Lemma 2.2. Let B be a bar… view at source ↗
Figure 4
Figure 4. Figure 4: Effect of inserting a new bar. If the new bar length is close to the mean lifetime, the TSI may decrease; if it is sufficiently far from the mean, the TSI increases. This threshold behavior is quantified by the variance barrier in the preceding corollary. This sensitivity to the position of the inserted bar has an important consequence for stability. As a consequence, if ℓ → 0, TSI(B ∪ {[b, b + ℓ)}) → n B … view at source ↗
Figure 5
Figure 5. Figure 5: Non-continuity of the TSI under insertion of arbitrarily short bars. As the added bar length ℓ tends to zero, the Wasserstein distance to the original barcode vanishes, while the TSI converges to a value that in general differs from the original one. Lemma 2.4 (Bounds relative to the empty diagram). Let B = {[bi , di)} n B i=1 be a barcode. Then, for every 2 ≤ p ≤ ∞, one has (10) TSI(B) ≤ 4(n B) −2/p dWp (… view at source ↗
Figure 6
Figure 6. Figure 6: Two barcodes with the same normalized lifetime distribution (1/3, 2/3) and therefore the same persistent entropy, but different total scale. The right barcode is obtained by uniformly scaling the lifetimes of the left by a factor of two. Persistent entropy is invariant under such scaling, whereas the TSI scales quadratically. since x log x → 0 as x → 0 +. Thus, persistent entropy provides a scale-invariant… view at source ↗
Figure 7
Figure 7. Figure 7: Extremal behavior of the normalized TSI and R´enyi entropy of order two. Uniform lifetime distributions minimize cvTSI and maximize H2, whereas maximally con￾centrated lifetime distributions maximize cvTSI and minimize H2. Proposition 3.5. Let p = (p1, . . . , pnB ) be a probability vector of the form pi = 1 nB + εi , X i εi = 0, with ∥ε∥ sufficiently small. Then E(B) = log(n B) − n B − 1 2nB cvTSI(B) + O(… view at source ↗
Figure 8
Figure 8. Figure 8: Two disjoint circles with radii 1 and 1 4 . (A) Point cloud sampled with increasing density. (B) Persistence diagram showing two dominant H1 features corresponding to the two loops. (C) Evolution of TSI, persistent entropy, and cvTSI as a function of sample size. Intertwined circles: We next consider two intersecting circles, as illustrated in [PITH_FULL_IMAGE:figures/full_fig_p025_8.png] view at source ↗
Figure 9
Figure 9. Figure 9: Two intertwined circles of equal radius. (A) Point cloud. (B) Persistence diagram showing multiple H1 features arising from the overlapping structure. (C) Evolution of TSI, persistent entropy, and cvTSI as a function of sample size [PITH_FULL_IMAGE:figures/full_fig_p025_9.png] view at source ↗
Figure 10
Figure 10. Figure 10: Random sampling on intertwined circles. (A) A random realization of the sampled point cloud. (B) Corresponding persistence diagram, illustrating variability due to stochastic sampling. (C) Monte Carlo estimates (mean and variability) of TSI, persistent entropy, cvTSI and cvTSI n as functions of sample size based on 100 simulations per point. To conclude, this experiment highlights a fundamental distinctio… view at source ↗
Figure 11
Figure 11. Figure 11: Random sampling on intertwined circles using the Alpha complex. (A) Per￾sistence diagram corresponding to the point cloud of [PITH_FULL_IMAGE:figures/full_fig_p027_11.png] view at source ↗
Figure 12
Figure 12. Figure 12: Effect of noise on TSI, TSigI, entropy, and cvTSI. 4.3. Time series: geometric Brownian motion. We now consider synthetic time series generated by geometric Brownian motion (GBM). Using Takens embedding (the interested readers is referred to [19, 20]), we reconstruct a point cloud from each time series and compute the associated persistence diagrams (for an illustration see Figures 13 and 14). (a) Realiza… view at source ↗
Figure 13
Figure 13. Figure 13: (A) A simulated example of geometric Brownian motion with drift and volatil￾ity (µ, σ) = (0, 0.01) (B) 3 dimensional Takens embedding of the curve with delay parameter τ = 3. (C) Corresponding persistence diagram of Alpha filtration [PITH_FULL_IMAGE:figures/full_fig_p028_13.png] view at source ↗
Figure 14
Figure 14. Figure 14: TSI, persistent entropy, and cvTSI as functions of volatility (A) and drift (B). Each value is averaged over 100 Monte Carlo simulations. The persistence diagrams of GBM exhibit a large number of short-lived features, reflecting the absence of strong underlying topological structure. This is consistent with the stochastic and memoryless nature of the process. To assess sensitivity to model parameters, we … view at source ↗
read the original abstract

We introduce the \emph{Topological Stability Index} (TSI), a variance-based scalar measure for persistence barcodes that quantifies the dispersion of persistence lifetimes. Unlike persistent entropy, which depends only on normalized weights, the TSI captures absolute variability and is sensitive to heterogeneous feature scales. We establish fundamental properties of the TSI, including its scaling behavior, invariance under lifetime translation and explicit update formulas under insertion and deletion of bars. We also consider a complementary first-moment-type quantity, the Topological Signal Index (TSigI), which captures the typical scale of persistence lifetimes and provides additional interpretability alongside the TSI. We further introduce a normalized version, $cv\text{TSI}$, which is scale invariant and admits an explicit algebraic relation to the R\'enyi entropy of order two. In particular, $cv\text{TSI}$ is an affine function of the collision probability $\sum_i p_i^2$, and therefore a monotone reparametrization of the R\'enyi entropy, providing a direct link between variance-based and entropy-based summaries in topological data analysis. Numerical experiments on synthetic data and stochastic time series demonstrate that the TSI captures structural variability complementary to entropy: it is relatively insensitive to deterministic trends, while responding strongly to stochastic fluctuations and variations in persistence magnitude.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

0 major / 2 minor

Summary. The paper introduces the Topological Stability Index (TSI) as a variance-based scalar summary of persistence lifetimes in barcodes, claiming to establish its scaling behavior, translation invariance, and explicit update formulas under bar insertion/deletion. It also defines a complementary first-moment index (TSigI) and a normalized scale-invariant variant cvTSI, asserting that cvTSI is an affine function of the collision probability ∑p_i² and hence a monotone reparametrization of the Rényi entropy of order two. Numerical experiments on unspecified synthetic data and stochastic time series are presented to argue that TSI captures structural variability complementary to persistent entropy.

Significance. The explicit algebraic identity relating cvTSI to the collision probability (and thus to Rényi-2 entropy) is a direct, parameter-free link between variance-based and entropy-based topological summaries; this is a genuine strength if the supporting derivations are supplied. The claimed invariance and update formulas, if rigorously established, would make TSI practically useful for tracking topological changes in streaming or time-series data where absolute scale matters. The normalization step for cvTSI is intentional and does not undermine the entropy relation, contrary to the stress-test concern.

minor comments (2)
  1. The abstract asserts that fundamental properties, scaling behavior, invariance, and explicit update formulas are established, yet the provided text gives no indication of where the derivations appear (e.g., which section or proposition).
  2. Experimental claims rest on 'synthetic data and stochastic time series' whose generation procedures, barcode extraction methods, and quantitative comparison metrics are not described in the visible text.

Simulated Author's Rebuttal

0 responses · 0 unresolved

We thank the referee for the positive assessment of the manuscript, the accurate summary of its contributions, and the recommendation of minor revision. The referee correctly identifies the algebraic link between cvTSI and the Rényi-2 collision probability as a key strength, along with the invariance and update formulas. No major comments appear in the report.

Circularity Check

0 steps flagged

No significant circularity identified

full rationale

The paper's central algebraic claim—that cvTSI equals an affine transformation of the collision probability ∑p_i²—follows immediately from the definition of cvTSI as the squared coefficient of variation of the lifetime vector after normalization (var(l)/mean(l)² = n∑p_i² − 1). This is an explicit identity derived from the variance formula and the change of variables p_i = l_i/S; it does not rely on fitting, self-citation, or any prior result that itself depends on the target relation. No other load-bearing steps in the provided abstract or description reduce by construction to their inputs. The derivation remains self-contained.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The construction applies standard definitions of variance and normalized probabilities to barcode lifetimes; no free parameters, ad-hoc axioms, or new entities are introduced in the abstract.

axioms (1)
  • domain assumption Persistence lifetimes admit a discrete probability distribution after normalization to unit sum.
    Required for both the variance definition of TSI and the algebraic relation to collision probability.

pith-pipeline@v0.9.1-grok · 5774 in / 1263 out tokens · 29040 ms · 2026-06-29T00:24:57.700795+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

20 extracted references · 13 canonical work pages

  1. [1]

    A Survey of Vectorization Methods in Topological Data Analysis

    Dashti Ali et al. “A Survey of Vectorization Methods in Topological Data Analysis”. In:IEEE Trans- actions on Pattern Analysis and Machine Intelligence45.12 (Dec. 2023), pp. 14069–14080.issn: 1939-3539.doi:10.1109/TPAMI.2023.3308391.url:https://ieeexplore.ieee.org/abstract/ document/10235748

  2. [2]

    DTM-Based Filtrations

    Hirokazu Anai et al. “DTM-Based Filtrations”. en. In:Topological Data Analysis. Ed. by Nils A. Baas et al. Cham: Springer International Publishing, 2020, pp. 33–66.isbn: 978-3-030-43408-3.doi: 10.1007/978-3-030-43408-3_2

  3. [3]

    Persistent entropy for separating topo- logical features from noise in vietoris-rips complexes

    Nieves Atienza, Rocio Gonzalez-Diaz, and Matteo Rucco. “Persistent entropy for separating topo- logical features from noise in vietoris-rips complexes”. en. In:Journal of Intelligent Information Systems52.3 (June 2019), pp. 637–655.issn: 1573-7675.doi:10.1007/s10844-017-0473-4.url: https://doi.org/10.1007/s10844-017-0473-4

  4. [4]

    On the stability of persistent entropy and new summary functions for topological data analysis

    Nieves Atienza, Rocio Gonzalez-D´ ıaz, and Manuel Soriano-Trigueros. “On the stability of persistent entropy and new summary functions for topological data analysis”. In:Pattern Recognition107 (Nov. 2020), p. 107509.issn: 0031-3203.doi:10 . 1016 / j . patcog . 2020 . 107509.url:https : //www.sciencedirect.com/science/article/pii/S0031320320303125

  5. [5]

    A Better Bound on the Variance

    Rajendra Bhatia and Chandler Davis. “A Better Bound on the Variance”. In:The American Math- ematical Monthly107.4 (2000), pp. 353–357.issn: 0002-9890.doi:10.2307/2589180.url:https: //www.jstor.org/stable/2589180

  6. [6]

    Statistical Topological Data Analysis using Persistence Landscapes

    Peter Bubenik. “Statistical Topological Data Analysis using Persistence Landscapes”. In:Journal of Machine Learning Research16.3 (2015), pp. 77–102.issn: 1533-7928.url:http://jmlr.org/ papers/v16/bubenik15a.html

  7. [7]

    Robust Topological Inference: Distance To a Measure and Kernel Distance

    Fr´ ed´ eric Chazal et al. “Robust Topological Inference: Distance To a Measure and Kernel Distance”. In:Journal of Machine Learning Research18.159 (2018), pp. 1–40.issn: 1533-7928.url:http : //jmlr.org/papers/v18/15-484.html

  8. [8]

    Persistence Curves: A canonical framework for summarizing persistence diagrams

    Yu-Min Chung and Austin Lawson. “Persistence Curves: A canonical framework for summarizing persistence diagrams”. en. In:Advances in Computational Mathematics48.1 (Jan. 2022), p. 6.issn: 1572-9044.doi:10.1007/s10444-021-09893-4.url:https://doi.org/10.1007/s10444- 021- 09893-4

  9. [9]

    Stability of Persistence Diagrams

    David Cohen-Steiner, Herbert Edelsbrunner, and John Harer. “Stability of Persistence Diagrams”. In:Discrete & Computational Geometry37.1 (2007), pp. 103–120.issn: 0179-5376.doi:10.1007/ s00454-006-1276-5.url:https://doi.org/10.1007/s00454-006-1276-5

  10. [10]

    The Shape of Data: Topology Meets Analytics A Practical Introduction to Topological Analytics and the Stability Index (TSI) in Business

    Ioannis Diamantis. “The Shape of Data: Topology Meets Analytics A Practical Introduction to Topological Analytics and the Stability Index (TSI) in Business”. en. In: (2025). Preprint. arXiv: 2511.13503 [stat.ML]

  11. [11]

    Confidence sets for persistence diagrams

    Brittany Terese Fasy et al. “Confidence sets for persistence diagrams”. In:The Annals of Statistics 42.6 (Dec. 2014), pp. 2301–2339.issn: 0090-5364, 2168-8966.doi:10.1214/14-AOS1252.url:https: //projecteuclid.org/journals/annals- of- statistics/volume- 42/issue- 6/Confidence- sets-for-persistence-diagrams/10.1214/14-AOS1252.full

  12. [12]

    Limit theorems for persistence diagrams

    Yasuaki Hiraoka, Tomoyuki Shirai, and Khanh Duy Trinh. “Limit theorems for persistence diagrams”. In:The Annals of Applied Probability28.5 (Oct. 2018), pp. 2740–2780.issn: 1050-5164, 2168-8737. doi:10.1214/17-AAP1371.url:https://projecteuclid.org/journals/annals-of-applied- probability/volume-28/issue-5/Limit-theorems-for-persistence-diagrams/10.1214/17- ...

  13. [13]

    1987, pp

    D.H Lehmer. “On the compounding of certain means”. In:Journal of Mathematical Analysis and Applications36.1 (1971), pp. 183–200.issn: 0022-247X.doi:https://doi.org/10.1016/0022- 247X(71)90029-1.url:https://www.sciencedirect.com/science/article/pii/0022247X71900291. REFERENCES 31

  14. [15]

    Probability measures on the space of persistence diagrams

    Yuriy Mileyko, Sayan Mukherjee, and John Harer. “Probability measures on the space of persistence diagrams”. en. In:Inverse Problems27.12 (Nov. 2011), p. 124007.issn: 0266-5611.doi:10.1088/ 0266-5611/27/12/124007.url:https://doi.org/10.1088/0266-5611/27/12/124007

  15. [16]

    On measures of entropy and information

    Alfr´ ed R´ enyi. “On measures of entropy and information”. In:Proceedings of the Fourth Berkeley Symposium on Mathematical Statistics and Probability, Volume 1: Contributions to the Theory of Statistics. Berkeley, CA: University of California Press, 1961, pp. 547–561

  16. [17]

    Hypothesis testing for topological data analysis

    Andrew Robinson and Katharine Turner. “Hypothesis testing for topological data analysis”. en. In: Journal of Applied and Computational Topology1.2 (Dec. 2017), pp. 241–261.issn: 2367-1734.doi: 10.1007/s41468-017-0008-7.url:https://doi.org/10.1007/s41468-017-0008-7

  17. [18]

    A Piecewise Rotation of the Circle, IPR Maps and Their Connection with Translation Surfaces

    Matteo Rucco et al. “Characterisation of the Idiotypic Immune Network Through Persistent En- tropy”. en. In:Proceedings of ECCS 2014. Ed. by Stefano Battiston et al. Cham: Springer Interna- tional Publishing, 2016, pp. 117–128.isbn: 978-3-319-29228-1.doi:10.1007/978- 3- 319- 29228- 1_11

  18. [19]

    Walter de Gruyter GmbH & Co KG, 2014

    Ren´ e L Schilling and Lothar Partzsch.Brownian motion: an introduction to stochastic processes. Walter de Gruyter GmbH & Co KG, 2014

  19. [20]

    Topological Analysis of Recurrent Systems

    Primoz Skraba, Vin de Silva, and Mikael Vejdemo-Johansson. “Topological Analysis of Recurrent Systems”. eng. In:NIPS 2012 Workshop on Algebraic Topology and Machine Learning. 2012, pp. 1– 5.url:https://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-107210

  20. [21]

    Computing Persistent Homology

    Afra Zomorodian and Gunnar Carlsson. “Computing Persistent Homology”. en. In:Discrete & Com- putational Geometry33.2 (Feb. 2005), pp. 249–274.issn: 1432-0444.doi:10.1007/s00454- 004- 1146-y.url:https://doi.org/10.1007/s00454-004-1146-y. Department of Data Analytics and Digitalisation, Maastricht University, School of Business and Eco- nomics, P.O. Box 616...