Dynamic time series clustering via volatility change-points

Nick Whiteley

arxiv: 1906.10372 · v1 · pith:2UZQIRLLnew · submitted 2019-06-25 · 📊 stat.ME · q-fin.ST

Dynamic time series clustering via volatility change-points

Nick Whiteley This is my paper

Pith reviewed 2026-05-25 16:55 UTC · model grok-4.3

classification 📊 stat.ME q-fin.ST

keywords time series clusteringvolatility change-pointsdynamic clusteringposterior distributionsGARCH modelsS&P 500 returnsonline updatingfinancial time series

0 comments

The pith

Time series are clustered dynamically by comparing the timing of their most recent volatility shifts using a probability metric on posterior distributions.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper outlines a clustering method for time series that share a volatility model with shifts at unobserved change-points. Series are grouped if a probability metric indicates their most recent volatility shifts were coincident or closely timed. The approach supports online updates to the groupings as new observations arrive and is demonstrated on daily returns from S&P 500 constituents. It relates the change-point model to classical GARCH specifications while accommodating stylized features of returns. A reader would care because the method identifies groups whose volatility behavior synchronizes around shared shift times without requiring fixed cluster assignments.

Core claim

Clustering is performed using a probability metric evaluated between posterior distributions of the most recent change-point associated with each series. This implies series are grouped together at a given time if there is evidence the most recent shifts in their respective volatilities were coincident or closely timed. The clustering method is dynamic, in that groupings may be updated in an online manner as data arrive.

What carries the argument

Probability metric between posterior distributions of the most recent change-point for each series, which groups series whose volatility shifts appear coincident.

If this is right

Series whose volatility shifts occurred at similar times are grouped together at each analysis point.
Groupings can be revised online as fresh data arrive without restarting the procedure.
The method applies directly to daily returns of S&P 500 constituents and accommodates features typical of financial returns.
The underlying model connects to GARCH specifications through its treatment of volatility dynamics.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The approach could help track how market regimes propagate across assets by revealing synchronized volatility breaks.
It might extend to non-financial series that exhibit abrupt variance changes, such as sensor or climate data.
Sensitivity of the clusters to the choice of metric suggests testing multiple metrics on the same data to assess robustness.

Load-bearing premise

That a probability metric between posteriors of the most recent change-points produces stable and meaningful clusters, which depends on the volatility model, the prior on change-points, and the specific metric.

What would settle it

Finding that the clusters change substantially under small alterations to the metric or prior, or that they fail to align with documented market-wide volatility events in the S&P 500 data.

Figures

Figures reproduced from arXiv: 1906.10372 by Nick Whiteley.

**Figure 2.** Figure 2: Change-point model applied to AMZN. Posterior distributions over time of most recent change [PITH_FULL_IMAGE:figures/full_fig_p010_2.png] view at source ↗

**Figure 3.** Figure 3: Change-point model applied to AMZN. Blue plot shows adjusted daily closing log-returns [PITH_FULL_IMAGE:figures/full_fig_p010_3.png] view at source ↗

**Figure 4.** Figure 4: Dissimilarity matrix for first 80 constituents of S&P 500 for [PITH_FULL_IMAGE:figures/full_fig_p011_4.png] view at source ↗

**Figure 6.** Figure 6: Posterior distributions of time of most recent change-point as of 16/07/2009 for a cluster of [PITH_FULL_IMAGE:figures/full_fig_p014_6.png] view at source ↗

**Figure 7.** Figure 7: Posterior distributions of time of most recent change-point as of 16/07/2009 for a cluster of [PITH_FULL_IMAGE:figures/full_fig_p015_7.png] view at source ↗

**Figure 8.** Figure 8: Posterior distributions of time of most recent change-point as of 16/07/2009 for a cluster of [PITH_FULL_IMAGE:figures/full_fig_p015_8.png] view at source ↗

read the original abstract

This note outlines a method for clustering time series based on a statistical model in which volatility shifts at unobserved change-points. The model accommodates some classical stylized features of returns and its relation to GARCH is discussed. Clustering is performed using a probability metric evaluated between posterior distributions of the most recent change-point associated with each series. This implies series are grouped together at a given time if there is evidence the most recent shifts in their respective volatilities were coincident or closely timed. The clustering method is dynamic, in that groupings may be updated in an online manner as data arrive. Numerical results are given analyzing daily returns of constituents of the S&P 500.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

Short note sketches volatility change-point clustering for time series but leaves the model, metric, and validation too thin to judge stability.

read the letter

The paper is a short methodological note that clusters financial time series by comparing posteriors on the timing of their most recent volatility shifts. Series get grouped when those shifts look close in time, and the grouping can be refreshed online as new returns arrive. It ties the setup to GARCH-style models for returns and reports some numerical work on S&P 500 constituents. That combination of change-point modeling with a posterior-based distance for dynamic clustering is the main concrete idea on offer. The online updating and the direct link to volatility regimes are the parts that could be useful in practice for grouping assets that experience regime changes together. The numerical example on real daily returns shows the method can be run on a moderate-sized panel, which is at least a start. The central weakness is that the note supplies no explicit likelihood, no form for the change-point prior, and no description of the probability metric used to compare posteriors. Without those pieces it is impossible to check how sensitive the clusters are to prior or model misspecification, exactly the issue the stress-test note flags. The results section is mentioned but no quantitative metrics, baselines, or stability checks appear in the provided text, so the evidence that the clusters are meaningful or stable remains thin. This is the kind of targeted tool that might interest people working on financial time-series clustering or regime detection, but it is not aimed at a broad methodological audience. A serious referee could usefully press for the missing model details and some validation numbers; the work is coherent enough on its own terms to merit that step rather than a desk reject.

Referee Report

3 major / 1 minor

Summary. The paper outlines a dynamic clustering method for time series in which each series follows a volatility model with unobserved change-points. Clustering proceeds by computing a probability metric between the posterior distributions of each series' most recent change-point; series are grouped when these posteriors indicate coincident or closely timed volatility shifts. The procedure is online, allowing clusters to update as new data arrive, and is illustrated on daily returns of S&P 500 constituents. The model is stated to accommodate classical stylized facts of returns and is related to GARCH.

Significance. If the central construction is shown to be robust, the method would supply a timing-based clustering criterion distinct from level- or correlation-based approaches, with potential utility in financial risk monitoring. The online character is a clear practical strength. No machine-checked proofs, parameter-free derivations, or reproducible code are reported.

major comments (3)

[Model and likelihood (abstract/introduction)] The volatility model, likelihood, and prior on change-points are described only at a high level (abstract and introduction) with no explicit equations; without these, the posterior p(τ_i | data_i) used for the clustering metric cannot be derived or checked for identifiability and concentration properties.
[Clustering procedure (abstract)] No explicit form is supplied for the probability metric between posteriors, nor any analysis of its sensitivity to the change-point prior or volatility specification; this choice is load-bearing for the claim that clusters reflect coincident volatility shifts.
[Numerical results] The numerical results on S&P 500 returns provide no validation metrics, sensitivity checks to prior/model choices, or comparisons against alternative clustering procedures, so the stability and interpretability of the reported groupings cannot be assessed.

minor comments (1)

[Abstract] The abstract refers to 'some classical stylized features' without enumerating them or indicating how they enter the model.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for the constructive comments. The manuscript is a concise note outlining the method at a high level, and we agree the presentation would benefit from additional explicit details and empirical checks. We will revise accordingly.

read point-by-point responses

Referee: [Model and likelihood (abstract/introduction)] The volatility model, likelihood, and prior on change-points are described only at a high level (abstract and introduction) with no explicit equations; without these, the posterior p(τ_i | data_i) used for the clustering metric cannot be derived or checked for identifiability and concentration properties.

Authors: We agree the current description is high-level. In revision we will add the explicit volatility model equations, likelihood, and prior on the change-points τ_i, together with a derivation of the posterior and brief discussion of identifiability and concentration. revision: yes
Referee: [Clustering procedure (abstract)] No explicit form is supplied for the probability metric between posteriors, nor any analysis of its sensitivity to the change-point prior or volatility specification; this choice is load-bearing for the claim that clusters reflect coincident volatility shifts.

Authors: We acknowledge the metric is central. The revised manuscript will state the precise probability metric (e.g., a chosen divergence between the posteriors of the most recent change-point) and include sensitivity checks to the change-point prior and volatility model specification. revision: yes
Referee: [Numerical results] The numerical results on S&P 500 returns provide no validation metrics, sensitivity checks to prior/model choices, or comparisons against alternative clustering procedures, so the stability and interpretability of the reported groupings cannot be assessed.

Authors: We will expand the numerical section to report validation metrics, perform sensitivity analyses to prior and model choices, and add comparisons with alternative procedures such as correlation-based or level-based clustering methods. revision: yes

Circularity Check

0 steps flagged

No circularity: clustering defined directly as modeling choice on change-point posteriors

full rationale

The abstract presents the clustering procedure as an explicit modeling decision: a probability metric is evaluated between posteriors of the most recent change-point for each series, with grouping following when recent volatility shifts appear coincident. No derivation chain, equations, or fitted quantities are shown that would reduce a claimed prediction to its own inputs by construction. No self-citations, uniqueness theorems, or ansatzes are invoked to justify the metric or the change-point model. The construction therefore remains a direct statistical modeling choice rather than a self-referential identity.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract-only review supplies no explicit free parameters, axioms, or invented entities; the volatility change-point model is described at a conceptual level only.

pith-pipeline@v0.9.0 · 5620 in / 1162 out tokens · 25644 ms · 2026-05-25T16:55:35.313978+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

25 extracted references · 25 canonical work pages · 2 internal anchors

[1]

Ryan Prescott Adams and David J.C. MacKay. Bayesian online changepoint detection. arXiv preprint arXiv:0710.3742, 2007

work page internal anchor Pith review Pith/arXiv arXiv 2007
[2]

Alonso, Jos \'e Ram \'o n Berrendero, Adolfo Hern \'a ndez, and Ana Justel

Andr \'e s M. Alonso, Jos \'e Ram \'o n Berrendero, Adolfo Hern \'a ndez, and Ana Justel. Time series clustering based on forecast densities. Computational Statistics & Data Analysis, 51 0 (2): 0 762--776, 2006

work page 2006
[3]

Davis, Jens-Peter Krei , and Thomas V

Torben Gustav Andersen, Richard A. Davis, Jens-Peter Krei , and Thomas V. Mikosch. Handbook of financial time series. Springer Science & Business Media, 2009

work page 2009
[4]

Clustering with bregman divergences

Arindam Banerjee, Srujana Merugu, Inderjit S Dhillon, and Joydeep Ghosh. Clustering with bregman divergences. Journal of Machine Learning Research, 6 0 (Oct): 0 1705--1749, 2005

work page 2005
[5]

Berndt and James Clifford

Donald J. Berndt and James Clifford. Using dynamic time warping to find patterns in time series. In Proceedings of the 3rd International Conference on Knowledge Discovery and Data Mining, volume 10, pages 359--370. Seattle, WA, 1994

work page 1994
[6]

One-dimensional empirical measures, order statistics and kantorovich transport distances

Sergey Bobkov and Michel Ledoux. One-dimensional empirical measures, order statistics and kantorovich transport distances. preprint, 2016

work page 2016
[7]

Generalized autoregressive conditional heteroskedasticity

Tim Bollerslev. Generalized autoregressive conditional heteroskedasticity. Journal of Econometrics, 31 0 (3): 0 307--327, 1986

work page 1986
[8]

A conditionally heteroskedastic time series model for speculative prices and rates of return

Tim Bollerslev. A conditionally heteroskedastic time series model for speculative prices and rates of return. Review of economics and statistics, 69 0 (3): 0 542--547, 1987

work page 1987
[9]

Dynamic detection of change points in long time series

Nicolas Chopin. Dynamic detection of change points in long time series. Annals of the Institute of Statistical Mathematics, 59 0 (2): 0 349--366, 2007

work page 2007
[10]

Time series clustering and classification by the autoregressive metric

Marcella Corduas and Domenico Piccolo. Time series clustering and classification by the autoregressive metric. Computational statistics & data analysis, 52 0 (4): 0 1860--1872, 2008

work page 2008
[11]

On-line inference for multiple changepoint problems

Paul Fearnhead and Zhen Liu. On-line inference for multiple changepoint problems. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 69 0 (4): 0 589--605, 2007

work page 2007
[12]

Least squares quantization in PCM

Stuart Lloyd. Least squares quantization in PCM . IEEE transactions on information theory, 28 0 (2): 0 129--137, 1982

work page 1982
[13]

Some methods for classification and analysis of multivariate observations

James MacQueen. Some methods for classification and analysis of multivariate observations. In Proceedings of the fifth Berkeley symposium on mathematical statistics and probability, volume 1, pages 281--297. Oakland, CA, USA, 1967

work page 1967
[14]

Mantegna

Rosario N. Mantegna. Hierarchical structure in financial markets. The European Physical Journal B-Condensed Matter and Complex Systems, 11 0 (1): 0 193--197, 1999

work page 1999
[15]

Clustering financial time series: how long is enough? In Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, pages 2583--2589

Gautier Marti, S \'e bastien Andler, Frank Nielsen, and Philippe Donnat. Clustering financial time series: how long is enough? In Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, pages 2583--2589. AAAI Press, 2016

work page 2016
[16]

A review of two decades of correlations, hierarchies, networks and clustering in financial markets

Gautier Marti, Frank Nielsen, Miko aj Bi \'n kowski, and Philippe Donnat. A review of two decades of correlations, hierarchies, networks and clustering in financial markets. arXiv preprint arXiv:1703.00485, 2017

work page arXiv 2017
[17]

Tsclust: An R package for time series clustering

Pablo Montero, Jos \'e A Vilar, et al. Tsclust: An R package for time series clustering. Journal of Statistical Software, 62 0 (1): 0 1--43, 2014

work page 2014
[18]

Kevin P. Murphy. Machine learning: a probabilistic perspective. MIT press, 2012

work page 2012
[19]

Scalable Bayesian Nonparametric Clustering and Classification

Yang Ni, Peter M \"u ller, Maurice Diesendruck, Sinead Williamson, Yitan Zhu, and Yuan Ji. Scalable bayesian nonparametric clustering and classification. arXiv preprint arXiv:1806.02670, 2018

work page internal anchor Pith review Pith/arXiv arXiv 2018
[20]

Clustering heteroskedastic time series by model-based procedures

Edoardo Otranto. Clustering heteroskedastic time series by model-based procedures. Computational Statistics & Data Analysis, 52 0 (10): 0 4685--4698, 2008

work page 2008
[21]

Identifying financial time series with similar dynamic conditional correlation

Edoardo Otranto. Identifying financial time series with similar dynamic conditional correlation. Computational Statistics & Data Analysis, 54 0 (1): 0 1--15, 2010

work page 2010
[22]

Computational optimal transport

Gabriel Peyr \'e and Marco Cuturi. Computational optimal transport. Foundations and Trends in Machine Learning, 11 0 (5-6): 0 355--607, 2019

work page 2019
[23]

Non-linear time series clustering based on non-parametric forecast densities

Jos \'e Antonio Vilar, Andr \'e s M Alonso, and Juan Manuel Vilar. Non-linear time series clustering based on non-parametric forecast densities. Computational Statistics & Data Analysis, 54 0 (11): 0 2850--2865, 2010

work page 2010
[24]

Bayesian computational methods for inference in multiple change-points problems

Nick Whiteley, Christophe Andrieu, and Arnaud Doucet. Bayesian computational methods for inference in multiple change-points problems. Technical report, University of Bristol, School of Mathematics, 2009. URL sites.google.com/view/nickwhiteley/

work page 2009
[25]

Fast discrete distribution clustering using wasserstein barycenter with sparse support

Jianbo Ye, Panruo Wu, James Z Wang, and Jia Li. Fast discrete distribution clustering using wasserstein barycenter with sparse support. IEEE Transactions on Signal Processing, 65 0 (9): 0 2317--2332, 2017

work page 2017

[1] [1]

Ryan Prescott Adams and David J.C. MacKay. Bayesian online changepoint detection. arXiv preprint arXiv:0710.3742, 2007

work page internal anchor Pith review Pith/arXiv arXiv 2007

[2] [2]

Alonso, Jos \'e Ram \'o n Berrendero, Adolfo Hern \'a ndez, and Ana Justel

Andr \'e s M. Alonso, Jos \'e Ram \'o n Berrendero, Adolfo Hern \'a ndez, and Ana Justel. Time series clustering based on forecast densities. Computational Statistics & Data Analysis, 51 0 (2): 0 762--776, 2006

work page 2006

[3] [3]

Davis, Jens-Peter Krei , and Thomas V

Torben Gustav Andersen, Richard A. Davis, Jens-Peter Krei , and Thomas V. Mikosch. Handbook of financial time series. Springer Science & Business Media, 2009

work page 2009

[4] [4]

Clustering with bregman divergences

Arindam Banerjee, Srujana Merugu, Inderjit S Dhillon, and Joydeep Ghosh. Clustering with bregman divergences. Journal of Machine Learning Research, 6 0 (Oct): 0 1705--1749, 2005

work page 2005

[5] [5]

Berndt and James Clifford

Donald J. Berndt and James Clifford. Using dynamic time warping to find patterns in time series. In Proceedings of the 3rd International Conference on Knowledge Discovery and Data Mining, volume 10, pages 359--370. Seattle, WA, 1994

work page 1994

[6] [6]

One-dimensional empirical measures, order statistics and kantorovich transport distances

Sergey Bobkov and Michel Ledoux. One-dimensional empirical measures, order statistics and kantorovich transport distances. preprint, 2016

work page 2016

[7] [7]

Generalized autoregressive conditional heteroskedasticity

Tim Bollerslev. Generalized autoregressive conditional heteroskedasticity. Journal of Econometrics, 31 0 (3): 0 307--327, 1986

work page 1986

[8] [8]

A conditionally heteroskedastic time series model for speculative prices and rates of return

Tim Bollerslev. A conditionally heteroskedastic time series model for speculative prices and rates of return. Review of economics and statistics, 69 0 (3): 0 542--547, 1987

work page 1987

[9] [9]

Dynamic detection of change points in long time series

Nicolas Chopin. Dynamic detection of change points in long time series. Annals of the Institute of Statistical Mathematics, 59 0 (2): 0 349--366, 2007

work page 2007

[10] [10]

Time series clustering and classification by the autoregressive metric

Marcella Corduas and Domenico Piccolo. Time series clustering and classification by the autoregressive metric. Computational statistics & data analysis, 52 0 (4): 0 1860--1872, 2008

work page 2008

[11] [11]

On-line inference for multiple changepoint problems

Paul Fearnhead and Zhen Liu. On-line inference for multiple changepoint problems. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 69 0 (4): 0 589--605, 2007

work page 2007

[12] [12]

Least squares quantization in PCM

Stuart Lloyd. Least squares quantization in PCM . IEEE transactions on information theory, 28 0 (2): 0 129--137, 1982

work page 1982

[13] [13]

Some methods for classification and analysis of multivariate observations

James MacQueen. Some methods for classification and analysis of multivariate observations. In Proceedings of the fifth Berkeley symposium on mathematical statistics and probability, volume 1, pages 281--297. Oakland, CA, USA, 1967

work page 1967

[14] [14]

Mantegna

Rosario N. Mantegna. Hierarchical structure in financial markets. The European Physical Journal B-Condensed Matter and Complex Systems, 11 0 (1): 0 193--197, 1999

work page 1999

[15] [15]

Clustering financial time series: how long is enough? In Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, pages 2583--2589

Gautier Marti, S \'e bastien Andler, Frank Nielsen, and Philippe Donnat. Clustering financial time series: how long is enough? In Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, pages 2583--2589. AAAI Press, 2016

work page 2016

[16] [16]

A review of two decades of correlations, hierarchies, networks and clustering in financial markets

Gautier Marti, Frank Nielsen, Miko aj Bi \'n kowski, and Philippe Donnat. A review of two decades of correlations, hierarchies, networks and clustering in financial markets. arXiv preprint arXiv:1703.00485, 2017

work page arXiv 2017

[17] [17]

Tsclust: An R package for time series clustering

Pablo Montero, Jos \'e A Vilar, et al. Tsclust: An R package for time series clustering. Journal of Statistical Software, 62 0 (1): 0 1--43, 2014

work page 2014

[18] [18]

Kevin P. Murphy. Machine learning: a probabilistic perspective. MIT press, 2012

work page 2012

[19] [19]

Scalable Bayesian Nonparametric Clustering and Classification

Yang Ni, Peter M \"u ller, Maurice Diesendruck, Sinead Williamson, Yitan Zhu, and Yuan Ji. Scalable bayesian nonparametric clustering and classification. arXiv preprint arXiv:1806.02670, 2018

work page internal anchor Pith review Pith/arXiv arXiv 2018

[20] [20]

Clustering heteroskedastic time series by model-based procedures

Edoardo Otranto. Clustering heteroskedastic time series by model-based procedures. Computational Statistics & Data Analysis, 52 0 (10): 0 4685--4698, 2008

work page 2008

[21] [21]

Identifying financial time series with similar dynamic conditional correlation

Edoardo Otranto. Identifying financial time series with similar dynamic conditional correlation. Computational Statistics & Data Analysis, 54 0 (1): 0 1--15, 2010

work page 2010

[22] [22]

Computational optimal transport

Gabriel Peyr \'e and Marco Cuturi. Computational optimal transport. Foundations and Trends in Machine Learning, 11 0 (5-6): 0 355--607, 2019

work page 2019

[23] [23]

Non-linear time series clustering based on non-parametric forecast densities

Jos \'e Antonio Vilar, Andr \'e s M Alonso, and Juan Manuel Vilar. Non-linear time series clustering based on non-parametric forecast densities. Computational Statistics & Data Analysis, 54 0 (11): 0 2850--2865, 2010

work page 2010

[24] [24]

Bayesian computational methods for inference in multiple change-points problems

Nick Whiteley, Christophe Andrieu, and Arnaud Doucet. Bayesian computational methods for inference in multiple change-points problems. Technical report, University of Bristol, School of Mathematics, 2009. URL sites.google.com/view/nickwhiteley/

work page 2009

[25] [25]

Fast discrete distribution clustering using wasserstein barycenter with sparse support

Jianbo Ye, Panruo Wu, James Z Wang, and Jia Li. Fast discrete distribution clustering using wasserstein barycenter with sparse support. IEEE Transactions on Signal Processing, 65 0 (9): 0 2317--2332, 2017

work page 2017