Clustering Financial Time Series: How Long is Enough?
read the original abstract
Researchers have used from 30 days to several years of daily returns as source data for clustering financial time series based on their correlations. This paper sets up a statistical framework to study the validity of such practices. We first show that clustering correlated random variables from their observed values is statistically consistent. Then, we also give a first empirical answer to the much debated question: How long should the time series be? If too short, the clusters found can be spurious; if too long, dynamics can be smoothed out.
This paper has not been read by Pith yet.
Forward citations
Cited by 1 Pith paper
-
TSseek: Regular Expression-Based Similarity Search for Distributed Time Series Datasets
TSseek approximates time series as line segments and regex queries as bounding rectangles, then uses a distributed spatial index (TSseek-X) to support efficient exact whole-matching and subsequence-matching queries.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.