Time series classification with random convolution kernels: pooling operators and input representations matter

David Mercier; Fabrice Morganti; Gildas Morvan; Mathieu Rossi; Mouhamadou Mansour Lo

arxiv: 2409.01115 · v5 · submitted 2024-09-02 · 💻 cs.LG

Time series classification with random convolution kernels: pooling operators and input representations matter

Mouhamadou Mansour Lo , Gildas Morvan , Mathieu Rossi , Fabrice Morganti , David Mercier This is my paper

Pith reviewed 2026-05-23 21:21 UTC · model grok-4.3

classification 💻 cs.LG

keywords time series classificationrandom convolution kernelsMiniRocketSelF-Rocketpooling operatorsinput representationsUCR benchmark

0 comments

The pith

SelF-Rocket dynamically selects the best input representations and pooling operators during training to achieve state-of-the-art accuracy on time series classification benchmarks.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper presents SelF-Rocket as a new method for fast time series classification that builds on MiniRocket by dynamically choosing the best pair of input representation and pooling operator. This is done to exploit the fact that these choices matter for performance. A reader would care if the selection leads to better accuracy while keeping the computational efficiency of random kernel approaches. The work reports that this results in top performance on the UCR collection of datasets.

Core claim

SelF-Rocket, based on MiniRocket, dynamically selects the best couple of input representations and pooling operator during the training process, achieving state-of-the-art accuracy on the University of California Riverside TSC benchmark datasets.

What carries the argument

Dynamic selection of the optimal input representation and pooling operator pair during training in a random convolution kernel based classifier.

Load-bearing premise

The dynamic selection process does not introduce overfitting or selection bias that inflates the reported accuracy on the benchmark datasets.

What would settle it

Running SelF-Rocket on a separate set of time series classification datasets and comparing its accuracy to MiniRocket and other methods would test if the gains hold.

read the original abstract

This article presents a new approach based on MiniRocket, called SelF-Rocket, for fast time series classification (TSC). Unlike existing approaches based on random convolution kernels, it dynamically selects the best couple of input representations and pooling operator during the training process. SelF-Rocket achieves state-of-the-art accuracy on the University of California Riverside (UCR) TSC benchmark datasets.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

SelF-Rocket's dynamic selection of representation-plus-pooling is a small but logical step from fixed MiniRocket configs, yet the SOTA claim on UCR rests on an unevaluated risk of per-dataset overfitting.

read the letter

The core addition here is letting the model pick the input representation and pooling operator on the fly during training instead of committing to one pair upfront. That produces a modest accuracy lift over the static MiniRocket variants while preserving the speed that makes these methods practical for streaming data. The authors stay inside the established random-kernel framework rather than inventing new kernels or architectures, which keeps the contribution focused and easy to implement on top of existing code.

Referee Report

2 major / 2 minor

Summary. The manuscript introduces SelF-Rocket, an extension of MiniRocket for time series classification that dynamically selects the optimal pair of input representation and pooling operator during training. It reports state-of-the-art accuracy on the UCR TSC benchmark datasets.

Significance. If the selection procedure is shown to be free of dataset-specific overfitting, the method could provide a practical way to adapt random convolution kernels to varying time series properties, extending the utility of MiniRocket-style approaches beyond fixed configurations.

major comments (2)

[§4] §4 (Experimental Setup): The description of the dynamic selection process does not specify whether the choice of representation/pooling pair is performed via nested cross-validation (with selection hyperparameters frozen before test-set evaluation) or via per-dataset tuning on validation splits; without this, the SOTA claim on the fixed UCR collection risks capitalizing on dataset quirks rather than demonstrating generalization.
[Table 2] Table 2 (Accuracy Results): The reported accuracies for SelF-Rocket are presented without error bars, number of independent runs, or statistical significance tests against MiniRocket and other baselines; this undermines the reliability of the headline performance claim.

minor comments (2)

[Abstract] Abstract: The claim of SOTA accuracy is stated without reference to the experimental protocol or baselines, which would strengthen the summary.
[§3.2] §3.2: The notation distinguishing the candidate input representations could be accompanied by a short illustrative equation for clarity.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive comments, which highlight important aspects of experimental rigor. We address each major comment below and will incorporate clarifications and additional analyses in the revised manuscript.

read point-by-point responses

Referee: [§4] §4 (Experimental Setup): The description of the dynamic selection process does not specify whether the choice of representation/pooling pair is performed via nested cross-validation (with selection hyperparameters frozen before test-set evaluation) or via per-dataset tuning on validation splits; without this, the SOTA claim on the fixed UCR collection risks capitalizing on dataset quirks rather than demonstrating generalization.

Authors: We agree that §4 does not provide sufficient detail on the selection procedure. The current implementation selects the representation/pooling pair on a per-dataset basis by evaluating candidates on a validation split drawn from the training data (typically 20% hold-out), without nested cross-validation. This matches standard practice for UCR benchmark comparisons but does introduce the risk of dataset-specific adaptation noted by the referee. We will revise §4 to explicitly describe this process, add a limitations paragraph discussing potential overfitting to UCR dataset characteristics, and include results for a fixed (non-per-dataset) selection strategy to strengthen the generalization claim. revision: yes
Referee: [Table 2] Table 2 (Accuracy Results): The reported accuracies for SelF-Rocket are presented without error bars, number of independent runs, or statistical significance tests against MiniRocket and other baselines; this undermines the reliability of the headline performance claim.

Authors: We acknowledge that Table 2 reports point estimates only. Although the core kernel generation is efficient, the random components mean results can vary with seed. We will rerun all experiments across 10 independent random seeds, report mean accuracy with standard deviation in the revised Table 2, and add pairwise statistical significance tests (Wilcoxon signed-rank with Holm correction) against MiniRocket and the other baselines to support the performance claims. revision: yes

Circularity Check

0 steps flagged

No significant circularity in empirical method presentation

full rationale

The paper describes SelF-Rocket as an empirical extension of MiniRocket that dynamically selects input representation and pooling operator pairs during training, reporting SOTA accuracy on the UCR TSC benchmark. No derivation chain, equations, or first-principles results are referenced in the abstract or method summary. The central claim is benchmark performance from a selection procedure, not a mathematical reduction where a 'prediction' equals a fitted input by construction, nor any self-definitional loop, uniqueness theorem imported from self-citation, or ansatz smuggled via prior work. The selection mechanism is a training heuristic whose validity rests on external benchmark evaluation rather than internal equivalence to inputs. This is the common case of a self-contained empirical contribution with no detectable circularity.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract-only review supplies no equations, so no free parameters, axioms, or invented entities can be extracted.

pith-pipeline@v0.9.0 · 5594 in / 936 out tokens · 17051 ms · 2026-05-23T21:21:39.981689+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

22 extracted references · 22 canonical work pages

[1]

Data Mining 28 and Knowledge Discovery 34(5), 1454–1495 (2020)

Dempster, A., Petitjean, F., Webb, G.I.: ROCKET: exceptionally fast and accu- rate time series classification using random convolutional kernels. Data Mining 28 and Knowledge Discovery 34(5), 1454–1495 (2020)

work page 2020
[2]

In: Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining, pp

Dempster, A., Schmidt, D.F., Webb, G.I.: MiniRocket: A very fast (almost) de- terministic transform for time series classification. In: Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining, pp. 248–257 (2021)

work page 2021
[3]

Data Mining and Knowledge Discovery 36(5), 1623–1646 (2022)

Tan, C.W., Dempster, A., Bergmeir, C., Webb, G.I.: MultiRocket: multiple pool- ing operators and transformations for fast and effective time series classification. Data Mining and Knowledge Discovery 36(5), 1623–1646 (2022)

work page 2022
[4]

Data Mining and Knowledge Discovery 38(4), 1958–2031 (2024)

Middlehurst, M., Sch¨ afer, P., Bagnall, A.: Bake off redux: a review and exper- imental evaluation of recent time series classification algorithms. Data Mining and Knowledge Discovery 38(4), 1958–2031 (2024)

work page 1958
[5]

arXiv (2018)

Dau, H.A., Bagnall, A., Kamgar, K., Yeh, C.-C.M., Zhu, Y., Gharghabi, S., Ratanamahatana, C.A., Keogh, E.: The UCR Time Series Archive. arXiv (2018). https://arxiv.org/abs/1810.07758

work page arXiv 2018
[6]

IEEE transactions on acoustics, speech, and signal processing 26(1), 43–49 (1978)

Sakoe, H., Chiba, S.: Dynamic programming algorithm optimization for spoken word recognition. IEEE transactions on acoustics, speech, and signal processing 26(1), 43–49 (1978)

work page 1978
[7]

Data Mining and Knowledge Discovery 34(6), 1936–1962 (2020)

Ismail Fawaz, H., Lucas, B., Forestier, G., Pelletier, C., Schmidt, D.F., Weber, J., Webb, G.I., Idoumghar, L., Muller, P.-A., Petitjean, F.: Inceptiontime: Find- ing alexnet for time series classification. Data Mining and Knowledge Discovery 34(6), 1936–1962 (2020)

work page 1936
[8]

Machine Learning 110(11), 3211–3243 (2021)

Middlehurst, M., Large, J., Flynn, M., Lines, J., Bostrom, A., Bagnall, A.: HIVE- COTE 2.0: a new meta ensemble for time series classification. Machine Learning 110(11), 3211–3243 (2021)

work page 2021
[9]

Data Mining and Knowledge Discovery 37(5), 1779–1805 (2023)

Dempster, A., Schmidt, D.F., Webb, G.I.: Hydra: Competing convolutional ker- nels for fast and accurate time series classification. Data Mining and Knowledge Discovery 37(5), 1779–1805 (2023)

work page 2023
[10]

arXiv preprint arXiv:2203.03445 (2022)

Salehinejad, H., Wang, Y., Yu, Y., Jin, T., Valaee, S.: S-Rocket: Selective random convolution kernels for time series classification. arXiv preprint arXiv:2203.03445 (2022)

work page arXiv 2022
[11]

arXiv preprint arXiv:2309.08499 (2023)

Chen, S., Sun, W., Huang, L., Li, X., Wang, Q., John, D.: POCKET: Prun- ing random convolution kernels for time series classification. arXiv preprint arXiv:2309.08499 (2023)

work page arXiv 2023
[12]

arXiv preprint arXiv:2309.14518 (2023) 29

Uribarri, G., Barone, F., Ansuini, A., Frans´ en, E.: Detach-ROCKET: Sequential feature selection for time series classification with random convolutional kernels. arXiv preprint arXiv:2309.14518 (2023) 29

work page arXiv 2023
[13]

Applied Intelligence 53(14), 17778– 17795 (2023)

He, C., Huo, X., Gao, H.: FT-FVC: fast transformation-based feature vector concatenation for time series classification. Applied Intelligence 53(14), 17778– 17795 (2023)

work page 2023
[14]

The Journal of Machine learning research 7(1), 1–30 (2006)

Demˇ sar, J.: Statistical comparisons of classifiers over multiple data sets. The Journal of Machine learning research 7(1), 1–30 (2006)

work page 2006
[15]

Data Mining and Knowledge Discovery 38(4), 1–26 (2024)

Dempster, A., Schmidt, D.F., Webb, G.I.: Quant: A minimalist interval method for time series classification. Data Mining and Knowledge Discovery 38(4), 1–26 (2024)

work page 2024
[16]

Data Mining and Knowledge Discovery 39(2), 14 (2025)

Tan, C.W., Herrmann, M., Salehi, M., Webb, G.I.: Proximity forest 2.0: a new effective and scalable similarity-based classifier for time series. Data Mining and Knowledge Discovery 39(2), 14 (2025)

work page 2025
[17]

Data mining and knowledge discovery 30(2), 403–437 (2016)

Hu, B., Chen, Y., Keogh, E.: Classification of streaming time series under more realistic assumptions. Data mining and knowledge discovery 30(2), 403–437 (2016)

work page 2016
[19]

Data Mining and Knowledge Discovery 29, 565–592 (2015)

Lines, J., Bagnall, A.: Time series classification with ensembles of elastic distance measures. Data Mining and Knowledge Discovery 29, 565–592 (2015)

work page 2015
[20]

In: Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp

Lines, J., Davis, L.M., Hills, J., Bagnall, A.: A shapelet transform for time series classification. In: Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 289–297 (2012)

work page 2012
[21]

In: 2022 IEEE International Conference on Big Data (Big Data), pp

Ismail-Fawaz, A., Devanne, M., Weber, J., Forestier, G.: Deep learning for time series classification using new hand-crafted convolution filters. In: 2022 IEEE International Conference on Big Data (Big Data), pp. 972–981 (2022). IEEE

work page 2022
[22]

arXiv preprint arXiv:2305.11921 (2023)

Ismail-Fawaz, A., Dempster, A., Tan, C.W., Herrmann, M., Miller, L., Schmidt, D.F., Berretti, S., Weber, J., Devanne, M., Forestier, G., et al.: An approach to multiple comparison benchmark evaluations that is stable under manipulation of the comparate set. arXiv preprint arXiv:2305.11921 (2023)

work page arXiv 2023
[23]

Journal of Machine Learning Research 25(289), 1–10 (2024) 30

Middlehurst, M., Ismail-Fawaz, A., Guillaume, A., Holder, C., Guijo-Rubio, D., Bulatova, G., Tsaprounis, L., Mentel, L., Walter, M., Sch¨ afer, P., Bagnall, A.: aeon: a python toolkit for learning from time series. Journal of Machine Learning Research 25(289), 1–10 (2024) 30

work page 2024

[1] [1]

Data Mining 28 and Knowledge Discovery 34(5), 1454–1495 (2020)

Dempster, A., Petitjean, F., Webb, G.I.: ROCKET: exceptionally fast and accu- rate time series classification using random convolutional kernels. Data Mining 28 and Knowledge Discovery 34(5), 1454–1495 (2020)

work page 2020

[2] [2]

In: Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining, pp

Dempster, A., Schmidt, D.F., Webb, G.I.: MiniRocket: A very fast (almost) de- terministic transform for time series classification. In: Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining, pp. 248–257 (2021)

work page 2021

[3] [3]

Data Mining and Knowledge Discovery 36(5), 1623–1646 (2022)

Tan, C.W., Dempster, A., Bergmeir, C., Webb, G.I.: MultiRocket: multiple pool- ing operators and transformations for fast and effective time series classification. Data Mining and Knowledge Discovery 36(5), 1623–1646 (2022)

work page 2022

[4] [4]

Data Mining and Knowledge Discovery 38(4), 1958–2031 (2024)

Middlehurst, M., Sch¨ afer, P., Bagnall, A.: Bake off redux: a review and exper- imental evaluation of recent time series classification algorithms. Data Mining and Knowledge Discovery 38(4), 1958–2031 (2024)

work page 1958

[5] [5]

arXiv (2018)

Dau, H.A., Bagnall, A., Kamgar, K., Yeh, C.-C.M., Zhu, Y., Gharghabi, S., Ratanamahatana, C.A., Keogh, E.: The UCR Time Series Archive. arXiv (2018). https://arxiv.org/abs/1810.07758

work page arXiv 2018

[6] [6]

IEEE transactions on acoustics, speech, and signal processing 26(1), 43–49 (1978)

Sakoe, H., Chiba, S.: Dynamic programming algorithm optimization for spoken word recognition. IEEE transactions on acoustics, speech, and signal processing 26(1), 43–49 (1978)

work page 1978

[7] [7]

Data Mining and Knowledge Discovery 34(6), 1936–1962 (2020)

Ismail Fawaz, H., Lucas, B., Forestier, G., Pelletier, C., Schmidt, D.F., Weber, J., Webb, G.I., Idoumghar, L., Muller, P.-A., Petitjean, F.: Inceptiontime: Find- ing alexnet for time series classification. Data Mining and Knowledge Discovery 34(6), 1936–1962 (2020)

work page 1936

[8] [8]

Machine Learning 110(11), 3211–3243 (2021)

Middlehurst, M., Large, J., Flynn, M., Lines, J., Bostrom, A., Bagnall, A.: HIVE- COTE 2.0: a new meta ensemble for time series classification. Machine Learning 110(11), 3211–3243 (2021)

work page 2021

[9] [9]

Data Mining and Knowledge Discovery 37(5), 1779–1805 (2023)

Dempster, A., Schmidt, D.F., Webb, G.I.: Hydra: Competing convolutional ker- nels for fast and accurate time series classification. Data Mining and Knowledge Discovery 37(5), 1779–1805 (2023)

work page 2023

[10] [10]

arXiv preprint arXiv:2203.03445 (2022)

Salehinejad, H., Wang, Y., Yu, Y., Jin, T., Valaee, S.: S-Rocket: Selective random convolution kernels for time series classification. arXiv preprint arXiv:2203.03445 (2022)

work page arXiv 2022

[11] [11]

arXiv preprint arXiv:2309.08499 (2023)

Chen, S., Sun, W., Huang, L., Li, X., Wang, Q., John, D.: POCKET: Prun- ing random convolution kernels for time series classification. arXiv preprint arXiv:2309.08499 (2023)

work page arXiv 2023

[12] [12]

arXiv preprint arXiv:2309.14518 (2023) 29

Uribarri, G., Barone, F., Ansuini, A., Frans´ en, E.: Detach-ROCKET: Sequential feature selection for time series classification with random convolutional kernels. arXiv preprint arXiv:2309.14518 (2023) 29

work page arXiv 2023

[13] [13]

Applied Intelligence 53(14), 17778– 17795 (2023)

He, C., Huo, X., Gao, H.: FT-FVC: fast transformation-based feature vector concatenation for time series classification. Applied Intelligence 53(14), 17778– 17795 (2023)

work page 2023

[14] [14]

The Journal of Machine learning research 7(1), 1–30 (2006)

Demˇ sar, J.: Statistical comparisons of classifiers over multiple data sets. The Journal of Machine learning research 7(1), 1–30 (2006)

work page 2006

[15] [15]

Data Mining and Knowledge Discovery 38(4), 1–26 (2024)

Dempster, A., Schmidt, D.F., Webb, G.I.: Quant: A minimalist interval method for time series classification. Data Mining and Knowledge Discovery 38(4), 1–26 (2024)

work page 2024

[16] [16]

Data Mining and Knowledge Discovery 39(2), 14 (2025)

Tan, C.W., Herrmann, M., Salehi, M., Webb, G.I.: Proximity forest 2.0: a new effective and scalable similarity-based classifier for time series. Data Mining and Knowledge Discovery 39(2), 14 (2025)

work page 2025

[17] [17]

Data mining and knowledge discovery 30(2), 403–437 (2016)

Hu, B., Chen, Y., Keogh, E.: Classification of streaming time series under more realistic assumptions. Data mining and knowledge discovery 30(2), 403–437 (2016)

work page 2016

[18] [19]

Data Mining and Knowledge Discovery 29, 565–592 (2015)

Lines, J., Bagnall, A.: Time series classification with ensembles of elastic distance measures. Data Mining and Knowledge Discovery 29, 565–592 (2015)

work page 2015

[19] [20]

In: Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp

Lines, J., Davis, L.M., Hills, J., Bagnall, A.: A shapelet transform for time series classification. In: Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 289–297 (2012)

work page 2012

[20] [21]

In: 2022 IEEE International Conference on Big Data (Big Data), pp

Ismail-Fawaz, A., Devanne, M., Weber, J., Forestier, G.: Deep learning for time series classification using new hand-crafted convolution filters. In: 2022 IEEE International Conference on Big Data (Big Data), pp. 972–981 (2022). IEEE

work page 2022

[21] [22]

arXiv preprint arXiv:2305.11921 (2023)

Ismail-Fawaz, A., Dempster, A., Tan, C.W., Herrmann, M., Miller, L., Schmidt, D.F., Berretti, S., Weber, J., Devanne, M., Forestier, G., et al.: An approach to multiple comparison benchmark evaluations that is stable under manipulation of the comparate set. arXiv preprint arXiv:2305.11921 (2023)

work page arXiv 2023

[22] [23]

Journal of Machine Learning Research 25(289), 1–10 (2024) 30

Middlehurst, M., Ismail-Fawaz, A., Guillaume, A., Holder, C., Guijo-Rubio, D., Bulatova, G., Tsaprounis, L., Mentel, L., Walter, M., Sch¨ afer, P., Bagnall, A.: aeon: a python toolkit for learning from time series. Journal of Machine Learning Research 25(289), 1–10 (2024) 30

work page 2024