pith. sign in

arxiv: 1601.02213 · v1 · pith:RNCNFPOJnew · submitted 2016-01-10 · 💻 cs.LG · cs.AI· stat.ML

On Clustering Time Series Using Euclidean Distance and Pearson Correlation

classification 💻 cs.LG cs.AIstat.ML
keywords correlationdistanceeuclideanpearsonalgorithmclusteringk-meansmany
0
0 comments X
read the original abstract

For time series comparisons, it has often been observed that z-score normalized Euclidean distances far outperform the unnormalized variant. In this paper we show that a z-score normalized, squared Euclidean Distance is, in fact, equal to a distance based on Pearson Correlation. This has profound impact on many distance-based classification or clustering methods. In addition to this theoretically sound result we also show that the often used k-Means algorithm formally needs a mod ification to keep the interpretation as Pearson correlation strictly valid. Experimental results demonstrate that in many cases the standard k-Means algorithm generally produces the same results.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. TimeGuard: Channel-wise Pool Training for Backdoor Defense in Time Series Forecasting

    cs.CR 2026-05 unverdicted novelty 6.0

    TimeGuard employs channel-wise pool training initialized with time-aware criteria and distance-regularized loss selection to defend time series forecasting against backdoor attacks, improving robustness by 1.96x while...