Clustering piecewise stationary processes
Pith reviewed 2026-05-25 15:28 UTC · model grok-4.3
The pith
Algorithms exist that consistently cluster time series from piecewise stationary ergodic processes by matching their sets of stationary distributions.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The central discovery is that consistent clustering of piecewise stationary processes is possible with computationally efficient algorithms that require no assumptions beyond piecewise stationarity and ergodicity, where consistency is defined by equality of the sets of stationary distributions across the piecewise segments.
What carries the argument
The notion of consistency defined by whether the piecewise stationary distributions share the same set of stationary distributions, which allows the algorithms to ignore the change points and focus on the stationary components.
Load-bearing premise
The samples come from piecewise stationary ergodic processes and that correctly identifying shared stationary distribution sets is the appropriate goal for clustering.
What would settle it
A dataset of time series from two piecewise stationary processes that have different sets of stationary distributions but where the proposed algorithms group them together.
read the original abstract
The problem of time-series clustering is considered in the case where each data-point is a sample generated by a piecewise stationary ergodic process. Stationary processes are perhaps the most general class of processes considered in non-parametric statistics and allow for arbitrary long-range dependence between variables. Piecewise stationary processes studied here for the first time in the context of clustering, relax the last remaining assumption in this model: stationarity. A natural formulation is proposed for this problem and a notion of consistency is introduced which requires the samples to be placed in the same cluster if and only if the piecewise stationary distributions that generate them have the same set of stationary distributions. Simple, computationally efficient algorithms are proposed and are shown to be consistent without any additional assumptions beyond piecewise stationarity.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper formulates the clustering problem for time series generated by piecewise stationary ergodic processes, introduces a consistency notion (samples clustered together iff the processes share identical sets of stationary distributions), and proposes simple computationally efficient algorithms claimed to achieve this consistency under no assumptions beyond piecewise stationarity and ergodicity.
Significance. If the consistency holds under only the stated assumptions, the result would meaningfully extend nonparametric time-series clustering by dropping stationarity while retaining ergodicity (hence allowing long-range dependence). The computational simplicity of the algorithms is a positive feature.
major comments (1)
- [Abstract / main consistency theorem] Abstract (and the consistency theorem): the claim of consistency 'without any additional assumptions beyond piecewise stationarity' is load-bearing for the central contribution. Standard change-point results show that consistent segmentation (necessary to recover the set of stationary distributions from a single path) fails when stationary segments have lengths o(n) or when distinct regimes have arbitrarily small total-variation distance; both regimes are permitted by the stated assumptions. The proof must either add the missing conditions or exhibit an argument that evades them.
minor comments (1)
- [Section 2 (problem formulation)] The precise definition of 'same set of stationary distributions' should be stated formally early in the paper, including how ties or measure-zero differences are handled.
Simulated Author's Rebuttal
We thank the referee for the careful reading and for identifying a key point regarding the strength of our consistency claim. We address the concern directly below.
read point-by-point responses
-
Referee: [Abstract / main consistency theorem] Abstract (and the consistency theorem): the claim of consistency 'without any additional assumptions beyond piecewise stationarity' is load-bearing for the central contribution. Standard change-point results show that consistent segmentation (necessary to recover the set of stationary distributions from a single path) fails when stationary segments have lengths o(n) or when distinct regimes have arbitrarily small total-variation distance; both regimes are permitted by the stated assumptions. The proof must either add the missing conditions or exhibit an argument that evades them.
Authors: The referee correctly notes that standard change-point detection requires conditions on minimum segment length and regime separation for consistency, and that our stated assumptions permit regimes where these fail. Our original proof sketch attempted to evade explicit segmentation by working directly with a clustering criterion based on pairwise distances between empirical occupation measures of entire samples; however, upon closer inspection we agree that recovering the exact set of stationary distributions (as required by the consistency definition) implicitly requires being able to distinguish the regimes, which cannot be guaranteed without further conditions. We will therefore revise the manuscript by (i) adding explicit assumptions that each stationary segment has length at least c n for some c>0 and that distinct regimes are separated by a fixed total-variation distance delta>0, (ii) updating the statement of the main consistency theorem and the abstract accordingly, and (iii) adding a brief discussion of why these conditions are necessary and how they relate to the literature on change-point detection. These changes constitute a major revision but preserve the core algorithmic contribution and the nonparametric flavor of the result. revision: yes
Circularity Check
New problem formulation and consistency definition; derivation self-contained
full rationale
The paper introduces a novel clustering task for piecewise stationary ergodic processes and defines consistency directly in terms of matching sets of stationary distributions. No quoted steps reduce a claimed prediction or uniqueness result to a fitted parameter, self-citation chain, or ansatz smuggled from prior work. The central consistency claim is presented as a theorem under the stated model class; any self-citations (if present) are not shown to be load-bearing for the main result. This matches the default expectation of no significant circularity.
Axiom & Free-Parameter Ledger
axioms (2)
- domain assumption The generating processes are ergodic.
- domain assumption Piecewise stationarity is sufficient to define a meaningful clustering target via the set of stationary distributions.
Reference graph
Works this paper leans on
- [1]
-
[2]
R. Gray. Probability, Random Processes, and Ergodic Properties. Springer V erlag, 1988
work page 1988
- [3]
-
[4]
I. Katsavounidis, C.-C. J. Kuo, and Z. Zhang. A new initia lization technique for generalized Lloyd iteration. IEEE Signal Processing Letters, 1:144–146, 1994
work page 1994
-
[5]
A. Khaleghi and D. Ryabko. Locating changes in highly dep endent data with unknown number of change points. In Proceedings of the 25th International Conference on Neural Information Processing Systems, pages 3095–3103, 2012
work page 2012
-
[6]
A. Khaleghi and D. Ryabko. Asymptotically consistent es timation of the number of change points in highly dependent time series. In Proceedings of the 31st International Conference on Machine Learning, volume 32, pages 539–547, 22–24 Jun 2014
work page 2014
-
[7]
A. Khaleghi and D. Ryabko. Nonparametric multiple chang e point estimation in highly dependent time series. Theoretical Computer Science, 620:119–133, 2016
work page 2016
-
[8]
A. Khaleghi, D. Ryabko, J. Mary, and P . Preux. Consistent algorithms for clustering time series. The Journal of Machine Learning Research , 17(1):94–125, 2016
work page 2016
-
[9]
D. Ryabko. Clustering processes. In Proceedings of the 27th International Conference on Machin e Learning, pages 919–926, 2010
work page 2010
-
[10]
D. Ryabko. Discrimination between B-processes is impo ssible. Journal of Theoretical Probability , 23(2):565–575, 2010
work page 2010
-
[11]
D. Ryabko. Asymptotic Nonparametric Statistical Analysis of Station ary Time Series. Springer, 2019
work page 2019
-
[12]
P . Shields. The Ergodic Theory of Discrete Sample Paths . AMS Bookstore, 1996
work page 1996
-
[13]
F. M. Willems. Coding for a binary independent piecewis e-identically-distributed source. IEEE Trans- actions on Information Theory , 42(6):2210–2217, 1996. 12
work page 1996
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.