pith. sign in

arxiv: 2501.02817 · v4 · submitted 2025-01-06 · 🧮 math.AT · math.ST· stat.ML· stat.TH

A Stable Measure of Similarity for Time Series using Persistent Homology

Pith reviewed 2026-05-23 06:20 UTC · model grok-4.3

classification 🧮 math.AT math.STstat.MLstat.TH
keywords persistent homologytime series similaritybi-conditional periodicity scorestabilityembedding dimensiondimension reductionpercent determinismclimate data
0
0 comments X

The pith

The bi-conditional periodicity score from persistent homology gives a stable similarity measure for pairs of time series.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper defines a new score(f1,f2) that uses persistent homology on embeddings of two time series to quantify their similarity through shared periodic structure. It proves that this score changes only slightly when the series or their frequencies receive small perturbations, and that the score converges once the embedding dimension reaches a sufficient minimum value. The authors further establish that the score remains stable after projection onto the leading principal components, provided those components retain most of the variance. Experiments on synthetic series and climate records show the score varies less than percent determinism and needs only one parameter instead of four.

Core claim

The authors define the bi-conditional periodicity score score(f1,f2) via persistent homology applied to embeddings of two time series. They prove its stability under small perturbations of the series values or frequencies, establish existence of a minimum embedding dimension for convergence of the score, and prove stability of the score under dimension reduction to the first K principal components whenever those components capture a majority of the variance under orthogonal projection. An algorithm is given whose complexity is O(N log N + P K^2 + P^6), and experiments confirm greater stability than percent determinism on both synthetic and real climate data while requiring only one parameter

What carries the argument

The bi-conditional periodicity score score(f1,f2), obtained from the persistent homology of point clouds formed by delay embeddings of the two time series.

If this is right

  • Small perturbations in a time series or its frequency produce only small changes in score(f1,f2).
  • The score converges once the embedding dimension exceeds a minimum value whose existence is guaranteed.
  • Orthogonal projection onto the first K principal components changes the score only slightly whenever those components retain most variance.
  • The algorithm computes the score in time O(N log N + P K^2 + P^6).
  • The score requires one parameter and exhibits greater stability than percent determinism on synthetic series and climate data.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The convergence result supplies a concrete rule for selecting embedding dimensions in applications.
  • The dimension-reduction stability suggests the score can be used on high-dimensional embeddings without immediate loss of reliability.
  • The single-parameter requirement may simplify comparison pipelines that currently tune four parameters for percent determinism.
  • The same stability properties could be tested on irregularly sampled or multivariate series by adapting the embedding step.

Load-bearing premise

The chosen embeddings must let persistent homology detect the periodic features that determine similarity, and the retained principal components must hold enough of the variance.

What would settle it

Two periodic time series and a small perturbation of their values or frequencies such that the resulting change in score(f1,f2) remains large even after the embedding dimension exceeds the claimed minimum and the first K principal components capture most variance.

read the original abstract

Persistent homology, the study of holes that appear in data as one thickens balls centered around its points over time, has theoretically guaranteed stability. That is, small data perturbations guarantee small changes in the lifetimes of these holes. This stability has been used to construct a measure of periodicity for a single univariate time series, denoted score(f1). One popular measure of similarity between two time series is percent determinism (%DET), which measures the correlation between two time-series embeddings. We introduce a novel persistent-homology based measure of time-series similarity which we denote the bi-conditional periodicity score, score(f1,f2). We prove the stability of our measure under small time series and frequency perturbations, as well as the existence of a minimum embedding dimension for the convergence of our score. Our latter result implies that larger embedding dimensions may be necessary to reach desired levels of convergence. Since pairwise distances between points in these larger dimensions may start to concentrate, we also prove the stability of our measure under dimension reduction which guarantees that as long as the first K principal components capture a majority of the variance under orthogonal projection, the score will undergo small changes. We next introduce an algorithm for computing the bi-conditional periodicity score and deduce its computational complexity as O(N log N + PK^2 + P^6) for N the number of time series points, P the number of embedding points, and K the number of principal components. We experimentally verify the greater stability of our measure in comparison with %DET on both synthetic time series as well as real climate data. As well, score(f1,f2) requires only one parameter for its computation while %DET requires four.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 1 minor

Summary. The manuscript introduces the bi-conditional periodicity score score(f1,f2), a persistent-homology-based measure of similarity between two time series. It claims to prove stability of the measure under small perturbations to the time series and frequencies, the existence of a minimum embedding dimension guaranteeing convergence of the score, and stability under PCA-based dimension reduction provided the first K principal components capture a majority of the variance. An algorithm is presented with complexity O(N log N + P K^2 + P^6), and experiments on synthetic and real climate data are said to demonstrate greater stability than percent determinism (%DET) while requiring only one parameter.

Significance. If the stability and convergence claims hold with the necessary quantitative controls, the work would supply a topologically grounded pairwise similarity measure with explicit stability guarantees and a reduced parameter count relative to %DET. The explicit complexity bound and the attempt to mitigate the curse of dimensionality via PCA are constructive elements.

major comments (2)
  1. [Abstract] Abstract (stability under dimension reduction): the statement that 'as long as the first K principal components capture a majority of the variance under orthogonal projection, the score will undergo small changes' supplies no quantitative relation between the retained variance fraction and either the bottleneck distance on the relevant persistence diagrams or the change in the specific form of score(f1,f2). This is load-bearing because the embedding-dimension convergence result is invoked precisely to justify larger dimensions, after which PCA is applied.
  2. [Abstract] Abstract (existence of minimum embedding dimension and its composition with PCA stability): the existence result presupposes that a Takens-type embedding eventually isolates the periodic orbits in low-dimensional homology, yet no explicit link is given between that dimension and the variance retained after projection onto the first K PCs. Consequently the two stability statements do not compose to control the score in the reduced space when the periodic signal is distributed across many components.
minor comments (1)
  1. [Experimental verification] The experimental comparisons mention greater stability versus %DET but omit quantitative details on controls, effect sizes, and the precise procedure used to select the single free parameter of score(f1,f2).

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the careful reading and for identifying points where the abstract could more precisely reflect the quantitative content of the proofs. We address each major comment below and indicate the revisions we will make.

read point-by-point responses
  1. Referee: [Abstract] Abstract (stability under dimension reduction): the statement that 'as long as the first K principal components capture a majority of the variance under orthogonal projection, the score will undergo small changes' supplies no quantitative relation between the retained variance fraction and either the bottleneck distance on the relevant persistence diagrams or the change in the specific form of score(f1,f2). This is load-bearing because the embedding-dimension convergence result is invoked precisely to justify larger dimensions, after which PCA is applied.

    Authors: The full proof of PCA stability (Theorem 4.4) bounds the change in score(f1,f2) by a constant times the operator norm of the orthogonal projection error onto the orthogonal complement of the first K components. Because the retained-variance fraction directly controls this operator norm (via the sum of the discarded eigenvalues), the bound is quantitative once the Lipschitz constants from the earlier stability theorems are fixed. The abstract statement is therefore a high-level summary rather than a complete quantitative claim. We will revise the abstract to state explicitly that the change is controlled by C·(1−retained variance fraction), where C depends only on the stability constants already established for perturbations of the time series and frequencies. revision: yes

  2. Referee: [Abstract] Abstract (existence of minimum embedding dimension and its composition with PCA stability): the existence result presupposes that a Takens-type embedding eventually isolates the periodic orbits in low-dimensional homology, yet no explicit link is given between that dimension and the variance retained after projection onto the first K PCs. Consequently the two stability statements do not compose to control the score in the reduced space when the periodic signal is distributed across many components.

    Authors: Theorem 3.2 establishes a finite minimum embedding dimension d* such that for all d≥d* the bi-conditional score converges to its value on the underlying periodic orbits; this d* depends only on the intrinsic dimension of the attractor and is independent of any subsequent linear projection. Theorem 4.4 then applies to the already-embedded point cloud in R^d and controls the effect of any orthogonal projection whose retained-variance fraction is large. The composition is therefore valid for any choice of K that achieves a prescribed retained-variance threshold in the embedded space; when the periodic signal is spread across coordinates one simply selects a correspondingly larger K. We will add a short clarifying paragraph after Theorem 4.4 that makes this order of quantifiers explicit and notes that the required K is determined empirically from the singular values of the embedded data matrix. revision: yes

Circularity Check

0 steps flagged

No circularity; derivations rely on external PH stability theorems

full rationale

The bi-conditional periodicity score is introduced as a new construction from persistent homology diagrams. Stability under perturbations, embedding dimension convergence, and PCA projection are stated as theorems whose proofs invoke standard external results on bottleneck distance and Takens-type embeddings rather than reducing to the paper's own fitted quantities or self-citations. The variance-majority condition for projection stability is an explicit hypothesis, not a definitional loop. No load-bearing self-citation chains or ansatz smuggling appear in the provided derivation steps.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The work rests on the standard stability theorem of persistent homology and on the assumption that time-series embeddings preserve periodic structure; no free parameters or invented entities are introduced in the abstract.

axioms (1)
  • standard math Persistent homology is stable under small perturbations of the input point cloud
    Invoked as the foundation for all stability claims.

pith-pipeline@v0.9.0 · 5836 in / 1025 out tokens · 43351 ms · 2026-05-23T06:20:51.237611+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

29 extracted references · 29 canonical work pages · 1 internal anchor

  1. [1]

    Mathematics10(22) (2022) https://doi.org/10.3390/math10224278

    Kovaˇ cevi´ c, A.B., Nina, A., Popovi´ c, L.ˇ c., Radovanovi´ c, M.: Two-dimensional correlation analysis of periodicity in noisy series: Case of vlf signal amplitude variations in the time vicinity of an earthquake. Mathematics10(22) (2022) https://doi.org/10.3390/math10224278

  2. [2]

    Journal of Neuroscience Methods156(1), 322–332 (2006) https://doi.org/10.1016/j.jneumeth.2006.02.013

    Zhan, Y., Halliday, D., Jiang, P., Liu, X., Feng, J.: Detecting time-dependent coherence between non-stationary electrophysiological signals: A combined sta- tistical and time-frequency approach. Journal of Neuroscience Methods156(1), 322–332 (2006) https://doi.org/10.1016/j.jneumeth.2006.02.013

  3. [3]

    Neurophysiologie Clinique/Clinical Neurophysiology32(3), 157–174 (2002) https://doi.org/10.1016/S0987-7053(02) 00301-5

    Lachaux, J.-P., Lutz, A., Rudrauf, D., Cosmelli, D., Quyen, M.L.V., Martinerie, J., Varela, F.: Estimating the time-course of coherence between single-trial brain signals: An introduction to wavelet coherence. Neurophysiologie Clinique/Clinical Neurophysiology32(3), 157–174 (2002) https://doi.org/10.1016/S0987-7053(02) 00301-5

  4. [4]

    Gondwana Research128, 69–85 (2024) https://doi.org/10.1016/j.gr.2023.10.014 26

    Zhang, W., Bakhsh, S., Ali, K., Anas, M.: Fostering environmental sustain- ability: An analysis of green investment and digital financial inclusion in china using quantile-on-quantile regression and wavelet coherence approach. Gondwana Research128, 69–85 (2024) https://doi.org/10.1016/j.gr.2023.10.014 26

  5. [5]

    Behaviour Research and Therapy178, 104572 (2024) https://doi.org/10.1016/j.brat.2024.104572

    Beurs, D., Giltay, E.J., Nuij, C., O’Connor, R., Winter, R.F.P., Kerkhof, A., Ballegooijen, W., Riper, H.: Symptoms of a feather flock together? an exploratory secondary dynamic time warp analysis of 11 single case time series of suicidal ideation and related symptoms. Behaviour Research and Therapy178, 104572 (2024) https://doi.org/10.1016/j.brat.2024.104572

  6. [6]

    Bipolar Disorders26, 44–57 (2024) https://doi.org/ 10.1111/bdi.13340

    Mesbah, R., Koenders, M.A., Spijker, A.T., Leeuw, M., Hemert, A.M., Giltay, E.J.: Dynamic time warp analysis of individual symptom trajectories in individ- uals with bipolar disorder. Bipolar Disorders26, 44–57 (2024) https://doi.org/ 10.1111/bdi.13340

  7. [7]

    Expert Systems with Applications238, 122229 (2024) https://doi

    Wei, Z., Gao, Y., Zhang, X., Li, X., Han, Z.: Adaptive marine traffic behaviour pattern recognition based on multidimensional dynamic time warping and dbscan algorithm. Expert Systems with Applications238, 122229 (2024) https://doi. org/10.1016/j.eswa.2023.122229

  8. [8]

    The Quan- titative Methods for Psychology20, 137–155 (2024) https://doi.org/10.20982/ tqmp.20.2.p137

    Duong, S., Davis, T.J., Bachman, H.J., Votruba-Drzal, E., Libertus, M.E.: Dynamic structures of parent-child number talk: An application of categorical cross-recurrence quantification analysis and companion to duong et al. The Quan- titative Methods for Psychology20, 137–155 (2024) https://doi.org/10.20982/ tqmp.20.2.p137

  9. [9]

    Experimental Brain Research242, 355–365 (2024) https://doi

    Riehm, C.D., Bonnette, S., Rush, J.L., Diekfuss, J.A., Koohestani, M., Myer, G.D., Norte, G.E., Sherman, D.A.: Corticomuscular cross-recurrence analysis reveals between-limb differences in motor control among individuals with acl reconstruction. Experimental Brain Research242, 355–365 (2024) https://doi. org/10.1007/s00221-023-06751-1

  10. [10]

    Physics Reports438(5), 237–329 (2007) https: //doi.org/10.1016/j.physrep.2006.11.001

    Marwan, N., Romano, M.C., Thiel, M., Kurths, J.: Recurrence plots for the analysis of complex systems. Physics Reports438(5), 237–329 (2007) https: //doi.org/10.1016/j.physrep.2006.11.001

  11. [11]

    SpringerBriefs in Mathematics

    Chazal, F., Silva, V., Glisse, M., Oudot, S.: The Structure and Stability of Persistence Modules, 1st edn. SpringerBriefs in Mathematics. Springer, ??? (2016)

  12. [12]

    Geometriae Dedicata173, 193–214 (2014) https://doi.org/10.1007/ s10711-013-9937-z

    Chazal, F., Silva, V., Oudot, S.: Persistence stability for geometric com- plexes. Geometriae Dedicata173, 193–214 (2014) https://doi.org/10.1007/ s10711-013-9937-z

  13. [13]

    Sliding Windows and Persistence: An Application of Topological Methods to Signal Analysis

    Perea, J., Harer, J.: Sliding windows and persistence: An application of topological methods to signal analysis. Foundations of Computational Mathematics15, 799– 838 (2015) https://doi.org/10.1007/s10208-014-9206-z 1307.6188

  14. [14]

    BMC Bioinformatics16(1), 257 (2015) https://doi.org/10.1186/ 27 s12859-015-0645-6

    Perea, J.A., Deckard, A., Haase, S.B., Harer, J.: Sw1pers: Sliding windows and 1-persistence scoring; discovering periodicity in gene expression time series data. BMC Bioinformatics16(1), 257 (2015) https://doi.org/10.1186/ 27 s12859-015-0645-6

  15. [15]

    SIAM Journal on Imaging Sciences11(2), 1049–1077 (2018) https:// doi.org/10.1137/17M1150736

    Tralie, C.J., Perea, J.A.: (quasi)periodicity quantification in video data, using topology. SIAM Journal on Imaging Sciences11(2), 1049–1077 (2018) https:// doi.org/10.1137/17M1150736

  16. [16]

    Pattern Recognition Letters 133, 137–143 (2020) https://doi.org/10.1016/j.patrec.2020.02.022

    Tymochko, S., Munch, E., Dunion, J., Corbosiero, K., Torn, R.: Using persistent homology to quantify a diurnal cycle in hurricanes. Pattern Recognition Letters 133, 137–143 (2020) https://doi.org/10.1016/j.patrec.2020.02.022

  17. [17]

    In: Rand, D., Young, L.-S

    Takens, F.: Detecting strange attractors in turbulence. In: Rand, D., Young, L.-S. (eds.) Dynamical Systems and Turbulence, Warwick 1980, pp. 366–381. Springer, Berlin, Heidelberg (1981)

  18. [18]

    Graduate Studies in Mathematics, vol

    Burago, D., Burago, Y., Ivanov, S.: A Course in Metric Geometry. Graduate Studies in Mathematics, vol. 33. American Mathematical Society, ??? (2001)

  19. [19]

    https://arxiv.org/abs/2309.16648

    Adams, H., Frick, F., Majhi, S., McBride, N.: Hausdorff vs gromov-hausdorff distances (2024). https://arxiv.org/abs/2309.16648

  20. [20]

    arXiv:2307.16333

    Koyama, M.A., Memoli, F., Robins, V., Turner, K.: Faster computation of degree-1 persistent homology using the reduced Vietoris-Rips filtration. arXiv:2307.16333. https://arxiv.org/abs/2307.16333 (2024)

  21. [21]

    In: Bussche, J.V., Vianu, V

    Aggarwal, C.C., Hinneburg, A., Keim, D.A.: On the surprising behavior of dis- tance metrics in high dimensional space. In: Bussche, J.V., Vianu, V. (eds.) Database Theory — ICDT 2001, pp. 420–434. Springer, Berlin, Heidelberg (2001)

  22. [22]

    Jolliffe, I.T.: Principal Component Analysis vol. 89. Springer, ??? (2002)

  23. [23]

    La Matematica, 1–23 (2024) https://doi.org/10.1007/s44007-024-00130-0

    May, N.H., Krishnamoorthy, B., Gambill, P.: A normalized bottleneck distance on persistence diagrams and homology preservation under dimension reduction. La Matematica, 1–23 (2024) https://doi.org/10.1007/s44007-024-00130-0

  24. [24]

    Available as part of GeoDa: An Introduction to Spatial Data Science (2020)

    Anselin, L.: Dimension Reduction Methods (2): Distance Preserving Methods. Available as part of GeoDa: An Introduction to Spatial Data Science (2020). https://geodacenter.github.io/workbook/7ab mds/lab7ab.html

  25. [25]

    Dyer, S.A., Dyer, J.S.: Cubic-spline interpolation. 1. IEEE Instrumentation & Measurement Magazine4(1), 44–46 (2001) https://doi.org/10.1109/5289.911175

  26. [26]

    IEEE Spectrum4(12), 63–70 (1967) https://doi.org/10.1109/MSPEC.1967.5217220

    Brigham, E.O., Morrow, R.E.: The fast Fourier transform. IEEE Spectrum4(12), 63–70 (1967) https://doi.org/10.1109/MSPEC.1967.5217220

  27. [27]

    General Time Complexity of PCA

    Yi˘ git, H.: Time Complexity of PCA 1. General Time Complexity of PCA. Posted on ResearchGate (2024). https://doi.org/10.13140/RG.2.2.12847.34728

  28. [28]

    Comaniciu and P

    Comaniciu, D., Meer, P.: Mean shift: a robust approach toward feature space 28 analysis. IEEE Transactions on Pattern Analysis and Machine Intelligence24(5), 603–619 (2002) https://doi.org/10.1109/34.1000236

  29. [29]

    Public Safety (WADEPS), W.S.D.E.: What is CAD data? https://wadeps.org/ the-data/cad-data/ 29