A Stable Measure of Similarity for Time Series using Persistent Homology
Pith reviewed 2026-05-23 06:20 UTC · model grok-4.3
The pith
The bi-conditional periodicity score from persistent homology gives a stable similarity measure for pairs of time series.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The authors define the bi-conditional periodicity score score(f1,f2) via persistent homology applied to embeddings of two time series. They prove its stability under small perturbations of the series values or frequencies, establish existence of a minimum embedding dimension for convergence of the score, and prove stability of the score under dimension reduction to the first K principal components whenever those components capture a majority of the variance under orthogonal projection. An algorithm is given whose complexity is O(N log N + P K^2 + P^6), and experiments confirm greater stability than percent determinism on both synthetic and real climate data while requiring only one parameter
What carries the argument
The bi-conditional periodicity score score(f1,f2), obtained from the persistent homology of point clouds formed by delay embeddings of the two time series.
If this is right
- Small perturbations in a time series or its frequency produce only small changes in score(f1,f2).
- The score converges once the embedding dimension exceeds a minimum value whose existence is guaranteed.
- Orthogonal projection onto the first K principal components changes the score only slightly whenever those components retain most variance.
- The algorithm computes the score in time O(N log N + P K^2 + P^6).
- The score requires one parameter and exhibits greater stability than percent determinism on synthetic series and climate data.
Where Pith is reading between the lines
- The convergence result supplies a concrete rule for selecting embedding dimensions in applications.
- The dimension-reduction stability suggests the score can be used on high-dimensional embeddings without immediate loss of reliability.
- The single-parameter requirement may simplify comparison pipelines that currently tune four parameters for percent determinism.
- The same stability properties could be tested on irregularly sampled or multivariate series by adapting the embedding step.
Load-bearing premise
The chosen embeddings must let persistent homology detect the periodic features that determine similarity, and the retained principal components must hold enough of the variance.
What would settle it
Two periodic time series and a small perturbation of their values or frequencies such that the resulting change in score(f1,f2) remains large even after the embedding dimension exceeds the claimed minimum and the first K principal components capture most variance.
read the original abstract
Persistent homology, the study of holes that appear in data as one thickens balls centered around its points over time, has theoretically guaranteed stability. That is, small data perturbations guarantee small changes in the lifetimes of these holes. This stability has been used to construct a measure of periodicity for a single univariate time series, denoted score(f1). One popular measure of similarity between two time series is percent determinism (%DET), which measures the correlation between two time-series embeddings. We introduce a novel persistent-homology based measure of time-series similarity which we denote the bi-conditional periodicity score, score(f1,f2). We prove the stability of our measure under small time series and frequency perturbations, as well as the existence of a minimum embedding dimension for the convergence of our score. Our latter result implies that larger embedding dimensions may be necessary to reach desired levels of convergence. Since pairwise distances between points in these larger dimensions may start to concentrate, we also prove the stability of our measure under dimension reduction which guarantees that as long as the first K principal components capture a majority of the variance under orthogonal projection, the score will undergo small changes. We next introduce an algorithm for computing the bi-conditional periodicity score and deduce its computational complexity as O(N log N + PK^2 + P^6) for N the number of time series points, P the number of embedding points, and K the number of principal components. We experimentally verify the greater stability of our measure in comparison with %DET on both synthetic time series as well as real climate data. As well, score(f1,f2) requires only one parameter for its computation while %DET requires four.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript introduces the bi-conditional periodicity score score(f1,f2), a persistent-homology-based measure of similarity between two time series. It claims to prove stability of the measure under small perturbations to the time series and frequencies, the existence of a minimum embedding dimension guaranteeing convergence of the score, and stability under PCA-based dimension reduction provided the first K principal components capture a majority of the variance. An algorithm is presented with complexity O(N log N + P K^2 + P^6), and experiments on synthetic and real climate data are said to demonstrate greater stability than percent determinism (%DET) while requiring only one parameter.
Significance. If the stability and convergence claims hold with the necessary quantitative controls, the work would supply a topologically grounded pairwise similarity measure with explicit stability guarantees and a reduced parameter count relative to %DET. The explicit complexity bound and the attempt to mitigate the curse of dimensionality via PCA are constructive elements.
major comments (2)
- [Abstract] Abstract (stability under dimension reduction): the statement that 'as long as the first K principal components capture a majority of the variance under orthogonal projection, the score will undergo small changes' supplies no quantitative relation between the retained variance fraction and either the bottleneck distance on the relevant persistence diagrams or the change in the specific form of score(f1,f2). This is load-bearing because the embedding-dimension convergence result is invoked precisely to justify larger dimensions, after which PCA is applied.
- [Abstract] Abstract (existence of minimum embedding dimension and its composition with PCA stability): the existence result presupposes that a Takens-type embedding eventually isolates the periodic orbits in low-dimensional homology, yet no explicit link is given between that dimension and the variance retained after projection onto the first K PCs. Consequently the two stability statements do not compose to control the score in the reduced space when the periodic signal is distributed across many components.
minor comments (1)
- [Experimental verification] The experimental comparisons mention greater stability versus %DET but omit quantitative details on controls, effect sizes, and the precise procedure used to select the single free parameter of score(f1,f2).
Simulated Author's Rebuttal
We thank the referee for the careful reading and for identifying points where the abstract could more precisely reflect the quantitative content of the proofs. We address each major comment below and indicate the revisions we will make.
read point-by-point responses
-
Referee: [Abstract] Abstract (stability under dimension reduction): the statement that 'as long as the first K principal components capture a majority of the variance under orthogonal projection, the score will undergo small changes' supplies no quantitative relation between the retained variance fraction and either the bottleneck distance on the relevant persistence diagrams or the change in the specific form of score(f1,f2). This is load-bearing because the embedding-dimension convergence result is invoked precisely to justify larger dimensions, after which PCA is applied.
Authors: The full proof of PCA stability (Theorem 4.4) bounds the change in score(f1,f2) by a constant times the operator norm of the orthogonal projection error onto the orthogonal complement of the first K components. Because the retained-variance fraction directly controls this operator norm (via the sum of the discarded eigenvalues), the bound is quantitative once the Lipschitz constants from the earlier stability theorems are fixed. The abstract statement is therefore a high-level summary rather than a complete quantitative claim. We will revise the abstract to state explicitly that the change is controlled by C·(1−retained variance fraction), where C depends only on the stability constants already established for perturbations of the time series and frequencies. revision: yes
-
Referee: [Abstract] Abstract (existence of minimum embedding dimension and its composition with PCA stability): the existence result presupposes that a Takens-type embedding eventually isolates the periodic orbits in low-dimensional homology, yet no explicit link is given between that dimension and the variance retained after projection onto the first K PCs. Consequently the two stability statements do not compose to control the score in the reduced space when the periodic signal is distributed across many components.
Authors: Theorem 3.2 establishes a finite minimum embedding dimension d* such that for all d≥d* the bi-conditional score converges to its value on the underlying periodic orbits; this d* depends only on the intrinsic dimension of the attractor and is independent of any subsequent linear projection. Theorem 4.4 then applies to the already-embedded point cloud in R^d and controls the effect of any orthogonal projection whose retained-variance fraction is large. The composition is therefore valid for any choice of K that achieves a prescribed retained-variance threshold in the embedded space; when the periodic signal is spread across coordinates one simply selects a correspondingly larger K. We will add a short clarifying paragraph after Theorem 4.4 that makes this order of quantifiers explicit and notes that the required K is determined empirically from the singular values of the embedded data matrix. revision: yes
Circularity Check
No circularity; derivations rely on external PH stability theorems
full rationale
The bi-conditional periodicity score is introduced as a new construction from persistent homology diagrams. Stability under perturbations, embedding dimension convergence, and PCA projection are stated as theorems whose proofs invoke standard external results on bottleneck distance and Takens-type embeddings rather than reducing to the paper's own fitted quantities or self-citations. The variance-majority condition for projection stability is an explicit hypothesis, not a definitional loop. No load-bearing self-citation chains or ansatz smuggling appear in the provided derivation steps.
Axiom & Free-Parameter Ledger
axioms (1)
- standard math Persistent homology is stable under small perturbations of the input point cloud
Lean theorems connected to this paper
-
IndisputableMonolith/Foundation/RealityFromDistinction.leanreality_from_one_distinction unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
We prove the stability of our measure under small time series and frequency perturbations, as well as the existence of a minimum embedding dimension for the convergence of our score... stability under dimension reduction... first K principal components capture a majority of the variance
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
score(f1|f2) = mp(dgm1(SWM,τ f1|2(T))) / √3
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
Mathematics10(22) (2022) https://doi.org/10.3390/math10224278
Kovaˇ cevi´ c, A.B., Nina, A., Popovi´ c, L.ˇ c., Radovanovi´ c, M.: Two-dimensional correlation analysis of periodicity in noisy series: Case of vlf signal amplitude variations in the time vicinity of an earthquake. Mathematics10(22) (2022) https://doi.org/10.3390/math10224278
-
[2]
Journal of Neuroscience Methods156(1), 322–332 (2006) https://doi.org/10.1016/j.jneumeth.2006.02.013
Zhan, Y., Halliday, D., Jiang, P., Liu, X., Feng, J.: Detecting time-dependent coherence between non-stationary electrophysiological signals: A combined sta- tistical and time-frequency approach. Journal of Neuroscience Methods156(1), 322–332 (2006) https://doi.org/10.1016/j.jneumeth.2006.02.013
-
[3]
Lachaux, J.-P., Lutz, A., Rudrauf, D., Cosmelli, D., Quyen, M.L.V., Martinerie, J., Varela, F.: Estimating the time-course of coherence between single-trial brain signals: An introduction to wavelet coherence. Neurophysiologie Clinique/Clinical Neurophysiology32(3), 157–174 (2002) https://doi.org/10.1016/S0987-7053(02) 00301-5
-
[4]
Gondwana Research128, 69–85 (2024) https://doi.org/10.1016/j.gr.2023.10.014 26
Zhang, W., Bakhsh, S., Ali, K., Anas, M.: Fostering environmental sustain- ability: An analysis of green investment and digital financial inclusion in china using quantile-on-quantile regression and wavelet coherence approach. Gondwana Research128, 69–85 (2024) https://doi.org/10.1016/j.gr.2023.10.014 26
-
[5]
Behaviour Research and Therapy178, 104572 (2024) https://doi.org/10.1016/j.brat.2024.104572
Beurs, D., Giltay, E.J., Nuij, C., O’Connor, R., Winter, R.F.P., Kerkhof, A., Ballegooijen, W., Riper, H.: Symptoms of a feather flock together? an exploratory secondary dynamic time warp analysis of 11 single case time series of suicidal ideation and related symptoms. Behaviour Research and Therapy178, 104572 (2024) https://doi.org/10.1016/j.brat.2024.104572
-
[6]
Bipolar Disorders26, 44–57 (2024) https://doi.org/ 10.1111/bdi.13340
Mesbah, R., Koenders, M.A., Spijker, A.T., Leeuw, M., Hemert, A.M., Giltay, E.J.: Dynamic time warp analysis of individual symptom trajectories in individ- uals with bipolar disorder. Bipolar Disorders26, 44–57 (2024) https://doi.org/ 10.1111/bdi.13340
-
[7]
Expert Systems with Applications238, 122229 (2024) https://doi
Wei, Z., Gao, Y., Zhang, X., Li, X., Han, Z.: Adaptive marine traffic behaviour pattern recognition based on multidimensional dynamic time warping and dbscan algorithm. Expert Systems with Applications238, 122229 (2024) https://doi. org/10.1016/j.eswa.2023.122229
-
[8]
The Quan- titative Methods for Psychology20, 137–155 (2024) https://doi.org/10.20982/ tqmp.20.2.p137
Duong, S., Davis, T.J., Bachman, H.J., Votruba-Drzal, E., Libertus, M.E.: Dynamic structures of parent-child number talk: An application of categorical cross-recurrence quantification analysis and companion to duong et al. The Quan- titative Methods for Psychology20, 137–155 (2024) https://doi.org/10.20982/ tqmp.20.2.p137
work page 2024
-
[9]
Experimental Brain Research242, 355–365 (2024) https://doi
Riehm, C.D., Bonnette, S., Rush, J.L., Diekfuss, J.A., Koohestani, M., Myer, G.D., Norte, G.E., Sherman, D.A.: Corticomuscular cross-recurrence analysis reveals between-limb differences in motor control among individuals with acl reconstruction. Experimental Brain Research242, 355–365 (2024) https://doi. org/10.1007/s00221-023-06751-1
-
[10]
Physics Reports438(5), 237–329 (2007) https: //doi.org/10.1016/j.physrep.2006.11.001
Marwan, N., Romano, M.C., Thiel, M., Kurths, J.: Recurrence plots for the analysis of complex systems. Physics Reports438(5), 237–329 (2007) https: //doi.org/10.1016/j.physrep.2006.11.001
-
[11]
Chazal, F., Silva, V., Glisse, M., Oudot, S.: The Structure and Stability of Persistence Modules, 1st edn. SpringerBriefs in Mathematics. Springer, ??? (2016)
work page 2016
-
[12]
Geometriae Dedicata173, 193–214 (2014) https://doi.org/10.1007/ s10711-013-9937-z
Chazal, F., Silva, V., Oudot, S.: Persistence stability for geometric com- plexes. Geometriae Dedicata173, 193–214 (2014) https://doi.org/10.1007/ s10711-013-9937-z
work page 2014
-
[13]
Sliding Windows and Persistence: An Application of Topological Methods to Signal Analysis
Perea, J., Harer, J.: Sliding windows and persistence: An application of topological methods to signal analysis. Foundations of Computational Mathematics15, 799– 838 (2015) https://doi.org/10.1007/s10208-014-9206-z 1307.6188
work page internal anchor Pith review Pith/arXiv arXiv doi:10.1007/s10208-014-9206-z 2015
-
[14]
BMC Bioinformatics16(1), 257 (2015) https://doi.org/10.1186/ 27 s12859-015-0645-6
Perea, J.A., Deckard, A., Haase, S.B., Harer, J.: Sw1pers: Sliding windows and 1-persistence scoring; discovering periodicity in gene expression time series data. BMC Bioinformatics16(1), 257 (2015) https://doi.org/10.1186/ 27 s12859-015-0645-6
work page 2015
-
[15]
SIAM Journal on Imaging Sciences11(2), 1049–1077 (2018) https:// doi.org/10.1137/17M1150736
Tralie, C.J., Perea, J.A.: (quasi)periodicity quantification in video data, using topology. SIAM Journal on Imaging Sciences11(2), 1049–1077 (2018) https:// doi.org/10.1137/17M1150736
-
[16]
Pattern Recognition Letters 133, 137–143 (2020) https://doi.org/10.1016/j.patrec.2020.02.022
Tymochko, S., Munch, E., Dunion, J., Corbosiero, K., Torn, R.: Using persistent homology to quantify a diurnal cycle in hurricanes. Pattern Recognition Letters 133, 137–143 (2020) https://doi.org/10.1016/j.patrec.2020.02.022
-
[17]
Takens, F.: Detecting strange attractors in turbulence. In: Rand, D., Young, L.-S. (eds.) Dynamical Systems and Turbulence, Warwick 1980, pp. 366–381. Springer, Berlin, Heidelberg (1981)
work page 1980
-
[18]
Graduate Studies in Mathematics, vol
Burago, D., Burago, Y., Ivanov, S.: A Course in Metric Geometry. Graduate Studies in Mathematics, vol. 33. American Mathematical Society, ??? (2001)
work page 2001
-
[19]
https://arxiv.org/abs/2309.16648
Adams, H., Frick, F., Majhi, S., McBride, N.: Hausdorff vs gromov-hausdorff distances (2024). https://arxiv.org/abs/2309.16648
-
[20]
Koyama, M.A., Memoli, F., Robins, V., Turner, K.: Faster computation of degree-1 persistent homology using the reduced Vietoris-Rips filtration. arXiv:2307.16333. https://arxiv.org/abs/2307.16333 (2024)
-
[21]
Aggarwal, C.C., Hinneburg, A., Keim, D.A.: On the surprising behavior of dis- tance metrics in high dimensional space. In: Bussche, J.V., Vianu, V. (eds.) Database Theory — ICDT 2001, pp. 420–434. Springer, Berlin, Heidelberg (2001)
work page 2001
-
[22]
Jolliffe, I.T.: Principal Component Analysis vol. 89. Springer, ??? (2002)
work page 2002
-
[23]
La Matematica, 1–23 (2024) https://doi.org/10.1007/s44007-024-00130-0
May, N.H., Krishnamoorthy, B., Gambill, P.: A normalized bottleneck distance on persistence diagrams and homology preservation under dimension reduction. La Matematica, 1–23 (2024) https://doi.org/10.1007/s44007-024-00130-0
-
[24]
Available as part of GeoDa: An Introduction to Spatial Data Science (2020)
Anselin, L.: Dimension Reduction Methods (2): Distance Preserving Methods. Available as part of GeoDa: An Introduction to Spatial Data Science (2020). https://geodacenter.github.io/workbook/7ab mds/lab7ab.html
work page 2020
-
[25]
Dyer, S.A., Dyer, J.S.: Cubic-spline interpolation. 1. IEEE Instrumentation & Measurement Magazine4(1), 44–46 (2001) https://doi.org/10.1109/5289.911175
-
[26]
IEEE Spectrum4(12), 63–70 (1967) https://doi.org/10.1109/MSPEC.1967.5217220
Brigham, E.O., Morrow, R.E.: The fast Fourier transform. IEEE Spectrum4(12), 63–70 (1967) https://doi.org/10.1109/MSPEC.1967.5217220
-
[27]
General Time Complexity of PCA
Yi˘ git, H.: Time Complexity of PCA 1. General Time Complexity of PCA. Posted on ResearchGate (2024). https://doi.org/10.13140/RG.2.2.12847.34728
-
[28]
Comaniciu, D., Meer, P.: Mean shift: a robust approach toward feature space 28 analysis. IEEE Transactions on Pattern Analysis and Machine Intelligence24(5), 603–619 (2002) https://doi.org/10.1109/34.1000236
-
[29]
Public Safety (WADEPS), W.S.D.E.: What is CAD data? https://wadeps.org/ the-data/cad-data/ 29
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.