On Clustering Time Series Using Euclidean Distance and Pearson Correlation

Frank H\"oppner; Michael R. Berthold

arxiv: 1601.02213 · v1 · pith:RNCNFPOJnew · submitted 2016-01-10 · 💻 cs.LG · cs.AI· stat.ML

On Clustering Time Series Using Euclidean Distance and Pearson Correlation

Michael R. Berthold , Frank H\"oppner This is my paper

classification 💻 cs.LG cs.AIstat.ML

keywords correlationdistanceeuclideanpearsonalgorithmclusteringk-meansmany

0 comments

read the original abstract

For time series comparisons, it has often been observed that z-score normalized Euclidean distances far outperform the unnormalized variant. In this paper we show that a z-score normalized, squared Euclidean Distance is, in fact, equal to a distance based on Pearson Correlation. This has profound impact on many distance-based classification or clustering methods. In addition to this theoretically sound result we also show that the often used k-Means algorithm formally needs a mod ification to keep the interpretation as Pearson correlation strictly valid. Experimental results demonstrate that in many cases the standard k-Means algorithm generally produces the same results.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

TimeGuard: Channel-wise Pool Training for Backdoor Defense in Time Series Forecasting
cs.CR 2026-05 unverdicted novelty 6.0

TimeGuard employs channel-wise pool training initialized with time-aware criteria and distance-regularized loss selection to defend time series forecasting against backdoor attacks, improving robustness by 1.96x while...