State-Space Dynamics Distance for Clustering Sequential Data

Dar\'io Garc\'ia-Garc\'ia; Emilio Parrado-Hern\'andez; Fernando D\'iaz-de-Mar\'ia

arxiv: 1004.1982 · v1 · submitted 2010-04-09 · 💻 cs.LG

State-Space Dynamics Distance for Clustering Sequential Data

Dar\'io Garc\'ia-Garc\'ia , Emilio Parrado-Hern\'andez , Fernando D\'iaz-de-Mar\'ia This is my paper

classification 💻 cs.LG

keywords clusteringstate-spacedatameasuremodelsequencesequencessequential

0 comments

read the original abstract

This paper proposes a novel similarity measure for clustering sequential data. We first construct a common state-space by training a single probabilistic model with all the sequences in order to get a unified representation for the dataset. Then, distances are obtained attending to the transition matrices induced by each sequence in that state-space. This approach solves some of the usual overfitting and scalability issues of the existing semi-parametric techniques, that rely on training a model for each sequence. Empirical studies on both synthetic and real-world datasets illustrate the advantages of the proposed similarity measure for clustering sequences.

This paper has not been read by Pith yet.

State-Space Dynamics Distance for Clustering Sequential Data

discussion (0)