Scalable Discovery of Time-Series Shapelets

Josif Grabocka; Lars Schmidt-Thieme; Martin Wistuba

arxiv: 1503.03238 · v1 · pith:7AOS2KFBnew · submitted 2015-03-11 · 💻 cs.LG

Scalable Discovery of Time-Series Shapelets

Josif Grabocka , Martin Wistuba , Lars Schmidt-Thieme This is my paper

classification 💻 cs.LG

keywords accuracycandidatesshapeletsmethodpredictiontime-seriesclassificationdata

0 comments

read the original abstract

Time-series classification is an important problem for the data mining community due to the wide range of application domains involving time-series data. A recent paradigm, called shapelets, represents patterns that are highly predictive for the target variable. Shapelets are discovered by measuring the prediction accuracy of a set of potential (shapelet) candidates. The candidates typically consist of all the segments of a dataset, therefore, the discovery of shapelets is computationally expensive. This paper proposes a novel method that avoids measuring the prediction accuracy of similar candidates in Euclidean distance space, through an online clustering pruning technique. In addition, our algorithm incorporates a supervised shapelet selection that filters out only those candidates that improve classification accuracy. Empirical evidence on 45 datasets from the UCR collection demonstrate that our method is 3-4 orders of magnitudes faster than the fastest existing shapelet-discovery method, while providing better prediction accuracy.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

INSHAPE: Instance-Level Shapelets for Interpretable Time-Series Classification
cs.LG 2026-05 unverdicted novelty 7.0

INSHAPE discovers instance-specific non-overlapping shapelets, models their temporal dependencies, and aggregates them bottom-up into population-level prototypes for improved accuracy and interpretability in time-seri...