PRISM: Lightweight Multivariate Time-Series Classification through Symmetric Multi-Resolution Convolutional Layers

Federico Zucchi; Thomas Lampert

arxiv: 2508.04503 · v3 · submitted 2025-08-06 · 💻 cs.LG · cs.AI

PRISM: Lightweight Multivariate Time-Series Classification through Symmetric Multi-Resolution Convolutional Layers

Federico Zucchi , Thomas Lampert This is my paper

Pith reviewed 2026-05-19 00:18 UTC · model grok-4.3

classification 💻 cs.LG cs.AI

keywords multivariate time series classificationlightweight convolutional networkssymmetric filtersmulti-resolution convolutionsparameter efficiencysignal processing priorshuman activity recognition

0 comments

The pith

Symmetric multi-resolution filters halve parameters in early layers while preserving full receptive field for multivariate time series classification.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper presents PRISM as a fully convolutional model that applies symmetric multi-resolution filters channel by channel at the start of the network. This draws on linear-phase filter ideas to cut learnable parameters roughly in half in those layers without shrinking the range of temporal patterns the model can detect. The design is tested across the full UEA multivariate archive plus targeted sets for activity recognition, sleep staging, and biomedical signals, where it reaches or exceeds the accuracy of heavier CNN and Transformer baselines at far lower parameter count and compute cost. A reader would care because many real deployments, from wearables to clinical monitoring, need accurate classification that fits on modest hardware rather than large servers.

Core claim

PRISM achieves competitive or superior accuracy on the UEA multivariate time-series archive and related benchmarks by inserting a bank of multi-resolution symmetric convolutional filters in its initial stage; the symmetry constraint, modeled after linear-phase FIR filters, structurally halves the number of free parameters in those layers while the receptive field and per-channel independence remain unchanged, yielding lower overall computational cost than current CNN or Transformer approaches.

What carries the argument

Symmetric multi-resolution convolutional filters that impose linear-phase-style structural constraints to halve parameters in the first layers while keeping the full receptive field.

If this is right

Matches or exceeds accuracy of state-of-the-art CNN and Transformer models on the UEA archive.
Uses significantly fewer parameters than competing architectures.
Requires markedly lower computational cost during inference and training.
Delivers strong results on human activity recognition, sleep staging, and biomedical signal tasks.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same symmetry prior could be tested on other sequence lengths or on univariate series to check whether the efficiency gain scales.
Channel-independent processing may simplify deployment when the number of recorded channels varies across devices.
Hybrid architectures that insert symmetric layers only at selected depths could further tune the accuracy-efficiency trade-off.

Load-bearing premise

The symmetry constraints on the filters do not reduce the network's ability to learn the discriminative features required for accurate classification.

What would settle it

A controlled ablation on the same UEA and benchmark datasets in which an otherwise identical non-symmetric multi-resolution network shows clearly higher accuracy than PRISM.

Figures

Figures reproduced from arXiv: 2508.04503 by Federico Zucchi, Thomas Lampert.

**Figure 1.** Figure 1: Overview of the proposed architecture. The symmetric multi-resolution feature-extraction module (shown here for a single input channel) works as follows. [PITH_FULL_IMAGE:figures/full_fig_p004_1.png] view at source ↗

**Figure 2.** Figure 2: PRISM output interface. (a) supports fine-grained temporal analysis, [PITH_FULL_IMAGE:figures/full_fig_p004_2.png] view at source ↗

**Figure 3.** Figure 3: Comparison of PRISM+Linear and baselines in terms of (a) parameter count and (b) FLOPs versus classification accuracy on the UEA Heartbeat dataset. within a narrow margin of TS-TCC across the board. This further underlines the strength of the features extracted by PRISM without the overhead of extra training stages. Using a more complex classification head, such as a Transformer with self-attention, pro… view at source ↗

**Figure 5.** Figure 5: Frequency responses of the most dissimilar filter pairs (by cosine [PITH_FULL_IMAGE:figures/full_fig_p010_5.png] view at source ↗

read the original abstract

Multivariate time series classification supports applications from wearable sensing to biomedical monitoring and demands models that can capture both short-term patterns and multi-scale temporal dependencies. Despite recent advances, Transformer and CNN models often remain computationally heavy and rely on many parameters. This work presents PRISM(Per-channel Resolution Informed Symmetric Module), a lightweight fully convolutional classifier. Operating in a channel-independent manner, in its early stage it applies a set of multi-resolution symmetric convolutional filters. This symmetry enforces structural constraints inspired by linear-phase FIR filters from classical signal processing, effectively halving the number of learnable parameters within the initial layers while preserving the full receptive field. Across the diverse UEA multivariate time-series archive as well as specific benchmarks in human activity recognition, sleep staging, and biomedical signals, PRISM matches or outperforms state-of-the-art CNN and Transformer models while using significantly fewer parameters and markedly lower computational cost. By bringing a principled signal processing prior into a modern neural architecture, PRISM offers an effective and computationally economical solution for multivariate time series classification. Code and data are available at https://github.com/fedezuc/PRISM

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

PRISM adds symmetric multi-resolution per-channel convolutions drawn from FIR filter ideas to cut early-layer parameters roughly in half for multivariate time series classification while keeping receptive field size, and the reported results on UEA plus domain benchmarks look competitive with heavier models.

read the letter

The core idea here is straightforward: run channel-independent convolutions at multiple resolutions in the first stage, but force symmetry on the kernels so that w[i] equals w[k-1-i]. This structural prior from classical signal processing halves the free parameters in those layers without shrinking the receptive field. The paper then stacks the usual fully convolutional classifier on top and evaluates on the full UEA multivariate archive plus targeted sets for activity recognition, sleep staging, and biomedical signals. It reports matching or beating current CNN and Transformer baselines at markedly lower parameter count and compute. Code is released, which makes the claims checkable.

Referee Report

2 major / 2 minor

Summary. The manuscript presents PRISM, a lightweight fully convolutional architecture for multivariate time series classification. It uses symmetric multi-resolution convolutional layers in the early stages, drawing from linear-phase FIR filter designs to enforce symmetry and reduce the number of parameters by half while keeping the receptive field intact. The model operates channel-independently and is evaluated on the UEA multivariate time series archive as well as tasks in human activity recognition, sleep staging, and biomedical signals, where it is claimed to match or surpass state-of-the-art CNN and Transformer models with significantly fewer parameters and lower computational cost.

Significance. Should the empirical findings prove robust, this work contributes a practical approach to efficient time series modeling by integrating classical signal processing principles into neural networks. This could be particularly valuable for deployment in edge devices or real-time monitoring systems where computational resources are limited. The open-sourcing of code supports further validation and extension.

major comments (2)

[§3.2] §3.2 (Symmetric Multi-Resolution Module): The central claim that the symmetry constraint (w[i] = w[k-1-i]) halves parameters without reducing expressivity or receptive field relies on the assumption that directional/phase-sensitive patterns remain learnable. No ablation is presented that compares the symmetric filters against an otherwise identical asymmetric multi-resolution convolutional baseline under the same training regime and hyper-parameters; this comparison is load-bearing for attributing performance to the proposed prior rather than the multi-resolution structure alone.
[Table 4] Table 4 (UEA archive results): Reported mean accuracies show PRISM matching or exceeding baselines, but the table does not indicate the number of independent runs, standard deviations, or statistical significance tests (e.g., paired t-test or Wilcoxon). Without these, it is impossible to determine whether small reported gains are reliable or could be explained by training stochasticity.

minor comments (2)

[§4.1] §4.1 (Experimental setup): The description of data preprocessing and train/validation/test splits for the UEA datasets is brief; explicit reference to the exact splits used (e.g., from the UEA archive repository) would improve reproducibility.
[Figure 3] Figure 3 (Architecture diagram): The illustration of the per-channel symmetric convolution would benefit from a small inset showing the weight-sharing pattern for a kernel of length 5 or 7 to make the parameter reduction explicit.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive and detailed review. We address each major comment below and have revised the manuscript to incorporate the suggested improvements where feasible.

read point-by-point responses

Referee: [§3.2] §3.2 (Symmetric Multi-Resolution Module): The central claim that the symmetry constraint (w[i] = w[k-1-i]) halves parameters without reducing expressivity or receptive field relies on the assumption that directional/phase-sensitive patterns remain learnable. No ablation is presented that compares the symmetric filters against an otherwise identical asymmetric multi-resolution convolutional baseline under the same training regime and hyper-parameters; this comparison is load-bearing for attributing performance to the proposed prior rather than the multi-resolution structure alone.

Authors: We thank the referee for this observation. The symmetry constraint is derived from linear-phase FIR filter design, which theoretically preserves the receptive field while halving parameters. However, we acknowledge that an explicit ablation isolating the symmetry prior from the multi-resolution structure would strengthen the attribution of performance gains. In the revised manuscript we will add such an ablation, training an otherwise identical asymmetric multi-resolution baseline under the same regime and hyperparameters. revision: yes
Referee: [Table 4] Table 4 (UEA archive results): Reported mean accuracies show PRISM matching or exceeding baselines, but the table does not indicate the number of independent runs, standard deviations, or statistical significance tests (e.g., paired t-test or Wilcoxon). Without these, it is impossible to determine whether small reported gains are reliable or could be explained by training stochasticity.

Authors: We agree that reporting variability and statistical tests is necessary to establish reliability. In the revised manuscript we will update Table 4 to include the number of independent runs, standard deviations, and results of statistical significance tests (Wilcoxon signed-rank test) comparing PRISM to the baselines. revision: yes

Circularity Check

0 steps flagged

No significant circularity; symmetry prior is external and claims are empirical

full rationale

The paper introduces the symmetric multi-resolution convolutions as a structural prior drawn from classical signal processing (linear-phase FIR filters), not as a quantity derived from or fitted to the target classification task. Performance claims consist of empirical matches or outperformance on the UEA archive and other benchmarks with reduced parameter count; these do not reduce by construction to self-defined inputs, fitted parameters renamed as predictions, or self-citation chains. No load-bearing uniqueness theorems or ansatzes from the authors' prior work are invoked. The derivation chain is therefore self-contained against external benchmarks and architectural choices.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The model rests on the domain assumption that symmetry from FIR filter theory can be transferred to CNN layers without loss of expressiveness; no free parameters or invented entities are described in the abstract.

axioms (1)

domain assumption Symmetry constraints on convolutional filters preserve full receptive field and classification performance.
This premise is invoked to justify halving parameters while claiming no loss in capability.

pith-pipeline@v0.9.0 · 5722 in / 1271 out tokens · 70928 ms · 2026-05-19T00:18:47.601787+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

Every filter w(f,k)∈R^k satisfies the symmetry property w(f,k)_{-j}=w(f,k)_j … guaranteeing linear-phase behaviour that preserves the signal’s phase
IndisputableMonolith/Foundation/DimensionForcing.lean eight_tick_period_forces_D3 unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

multi-resolution symmetric convolutional filters … kernel sizes of 11, 21, 51 and 71

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

52 extracted references · 52 canonical work pages

[1]

Q. Wen, T. Zhou, C. Zhang, W. Chen, Z. Ma, Z. Yan, J. Yan, L. Sun, Transformers in time series: A survey, International Joint Conference on Artificial Intelligence (2023)

work page 2023
[2]

K. Li, Y . Wang, P. Gao, G. Song, Y . Liu, H. Li, Y . Qiao, UniFormer: Unified transformer for e fficient spatiotemporal representation learning (2022)

work page 2022
[3]

W. Wang, E. Zuo, C. Chen, C. Chen, J. Zhong, Z. Yan, X. Lv, E fficient time series adaptive representation learning via dynamic routing sparse attention, Pattern Recognition (2025)

work page 2025
[4]

Zhang, Y

T. Zhang, Y . Zhang, W. Cao, J. Bian, X. Yi, S. Zheng, J. Li, Less is more: Fast multivariate time series forecasting with light sampling-oriented mlp structures., arXiv preprint arXiv:2207.01186 (2022)

work page arXiv 2022
[5]

Y . Gu, X. Yan, H. Qin, N. Akhtar, S. Yuan, H. Fu, S. Yang, A. Mian, HDTCNet: A hybrid-dimensional convolutional network for multivariate time series classification, Pattern Recognition (2025)

work page 2025
[6]

H. Wu, T. Hu, Y . Liu, H. Zhou, J. Wang, M. Long, Timesnet: Temporal 2d-variation modeling for general time series analysis, in: International Conference on Learning Representations, 2023

work page 2023
[7]

K. Yi, J. Fei, Q. Zhang, H. He, S. Hao, D. Lian, W. Fan, Filternet: Har- nessing frequency filters for time series forecasting, in: Neural Informa- tion Processing Systems, 2024

work page 2024
[8]

S. G. Mallat, A theory for multiresolution signal decomposition: the wavelet representation, IEEE transactions on pattern analysis and ma- chine intelligence (1989)

work page 1989
[9]

G. Chen, T. D. Bui, A. Krzy˙zak, Invariant pattern recognition using radon, dual-tree complex wavelet and fourier transforms, Pattern Recognition (2009)

work page 2009
[10]

Linear-Phase FIR Filters

A. V . Oppenheim, R. W. Schafer, J. R. Buck, Discrete-Time Signal Pro- cessing, 3rd Edition, Prentice-Hall, 2010, section 7.4: “Linear-Phase FIR Filters”

work page 2010
[11]

Dzhezyan, H

G. Dzhezyan, H. Cecotti, Symmetrical filters in convolutional neural networks, International Journal of Machine Learning and Cybernetics (2021)

work page 2021
[12]

S. A. Martucci, Symmetric convolution and the discrete sine and cosine transforms, IEEE Transactions on Signal Processing (1994)

work page 1994
[13]

Ahmed, T

N. Ahmed, T. Natarajan, K. R. Rao, Discrete cosine transform, IEEE transactions on Computers (2006)

work page 2006
[14]

Ulicny, V

M. Ulicny, V . A. Krylov, R. Dahyot, Harmonic convolutional networks based on discrete cosine transform, Pattern Recognition (2022)

work page 2022
[15]

K. Xu, M. Qin, F. Sun, Y . Wang, Y .-K. Chen, F. Ren, Learning in the frequency domain, in: Proceedings of the IEEE/CVF conference on com- puter vision and pattern recognition, 2020

work page 2020
[16]

Vaswani, N

A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, Łukasz Kaiser, I. Polosukhin, Attention is all you need, in: Advances in Neural Information Processing Systems, 2017

work page 2017
[17]

H. Wu, J. Xu, J. Wang, M. Long, Autoformer: Decomposition transform- ers with auto-correlation for long-term series forecasting, in: Advances in Neural Information Processing Systems, 2021

work page 2021
[18]

T. Zhou, Q. Zhang, W. Chen, H. Wu, Z. Xie, Z. Yan, Y . Wang, L. Sun, FEDformer: Frequency enhanced decomposed transformer for long-term series forecasting, in: International Conference on Learning Representa- tions, 2022

work page 2022
[19]

Y . Li, Z. Zhang, M. Zaheer, S. Kottur, Enhancing the locality and break- ing the memory bottleneck of transformer on time series forecasting, in: Advances in Neural Information Processing Systems, 2020

work page 2020
[20]

Kitaev, Ł

N. Kitaev, Ł. Kaiser, A. Levskaya, Reformer: The e fficient transformer, in: International Conference on Learning Representations, 2020

work page 2020
[21]

Zhang, Z

C. Zhang, Z. Yan, Crossformer: Transformer utilizing cross-dimension dependency for multivariate time series forecasting, in: International Conference on Learning Representations, 2023

work page 2023
[22]

A. Zeng, M. Zhang, G. Gao, Z. Meng, J. Wang, J. Zhou, Are transformers all you need for long-term series forecasting?, in: Neural Information Processing Systems, 2023

work page 2023
[23]

Y . Liu, Y . Zhang, H. Wu, M. Long, J. Wang, DLinear: Are linear time- series models really that simple?, in: AAAI 2023, 2023

work page 2023
[24]

Y . Xin, Z. Ma, T. Wei, Y . Liu, Efficient attention mechanisms for long- sequence time-series modeling, in: ACM SIGKDD, 2024

work page 2024
[25]

Dempster, F

A. Dempster, F. Petitjean, G. I. Webb, ROCKET: Exceptionally fast and accurate time series classification using random convolutional kernels, Data Mining and Knowledge Discovery (2020)

work page 2020
[26]

Eldele, S

E. Eldele, S. Qu, X. Wu, Z. Cao, J. Ng, TS-TCC: Time-series represen- tation learning via temporal and contextual contrasting, in: International Joint Conference on Artificial Intelligence, 2021

work page 2021
[27]

Z. Yue, G. Zhou, Q. Yang, S. Xu, Q. Kan, TS2Vec: Towards universal representation of time series, in: AAAI Conference on Artificial Intelli- gence, 2022

work page 2022
[28]

Q. Meng, H. Qian, Y . Liu, L. Cui, Y . Xu, Z. Shen, MHCCL: masked hierarchical cluster-wise contrastive learning for multivariate time series, in: AAAI Conference on Artificial Intelligence, 2023

work page 2023
[29]

P. Liu, Q. Kong, L. Jian, J. Kuang, T-WaveNet: A tree-structured wavelet neural network for time-series signal analysis, in: International Confer- ence on Learning Representations, 2022

work page 2022
[30]

P. Liu, M. Zhang, C. Li, Q. Li, N. Laptev, SCINet: Time series modeling and forecasting with sample-convolve-interact networks, in: International Conference on Learning Representations, 2022

work page 2022
[31]

P. Liu, B. Wu, N. Li, T. Dai, F. Lei, J. Bao, Y . Jiang, S.-T. Xia, WFTNet: Exploiting global and local periodicity in long-term time series forecast- ing, in: ACM SIGKDD, 2023

work page 2023
[32]

Zhang, C

F. Zhang, C. Zhang, W. Chen, Z. Ma, TCE: Temporal convolutional ex- plorer helps understand 1d-cnns from the frequency perspective, in: ACM CIKM 2023, 2023

work page 2023
[33]

Yang, H.-S

Y . Yang, H.-S. Hong, BTSF: Bilinear temporal-spectral fusion for unsu- pervised time-series representation learning, in: International Conference on Learning Representations, 2022

work page 2022
[34]

A. L. Goldberger, L. A. Amaral, L. Glass, J. M. Hausdor ff, P. C. Ivanov, R. G. Mark, J. E. Mietus, G. B. Moody, C.-K. Peng, H. E. Stanley, Phys- iobank, physiotoolkit, and physionet: components of a new research re- source for complex physiologic signals, circulation (2000)

work page 2000
[35]

Eldele, Z

E. Eldele, Z. Chen, C. Liu, M. Wu, C.-K. Kwoh, X. Li, C. Guan, An attention-based deep learning approach for sleep stage classification with single-channel EEG, IEEE Transactions on Neural Systems and Rehabil- itation Engineering (2021)

work page 2021
[36]

Eldele, M

E. Eldele, M. Ragab, Z. Chen, M. Wu, X. Li, TSLANet: Rethinking trans- formers for time series representation learning, in: International Confer- ence on Machine Learning, 2024

work page 2024
[37]

G. B. Moody, R. G. Mark, The impact of the mit-bih arrhythmia database, IEEE engineering in medicine and biology magazine (2001)

work page 2001
[38]

Goldberger, UEA archive: Heartbeat data set (2016)

A. Goldberger, UEA archive: Heartbeat data set (2016)

work page 2016
[39]

Anguita, A

D. Anguita, A. Ghio, L. Oneto, X. Parra, J. L. Reyes-Ortiz, et al., A public domain dataset for human activity recognition using smartphones., in: Esann, 2013

work page 2013
[40]

J. R. Kwapisz, G. M. Weiss, S. A. Moore, Activity recognition using cell phone accelerometers, ACM SigKDD Explorations Newsletter (2011)

work page 2011
[41]

Stisen, H

A. Stisen, H. Blunck, S. Bhattacharya, T. S. Prentow, M. B. Kjærgaard, A. Dey, T. Sonne, M. M. Jensen, Smart devices are di fferent: Assess- ing and mitigatingmobile sensing heterogeneities for activity recognition, in: Proceedings of the ACM conference on embedded networked sensor systems, 2015

work page 2015
[42]

T. Zhou, P. Niu, L. Sun, R. Jin, et al., One fits all: Power general time series analysis by pretrained LM, Advances in neural information pro- cessing systems (2023)

work page 2023
[43]

Y . Nie, N. H. Nguyen, P. Sinthong, J. Kalagnanam, A time series is worth 64 words: Long-term forecasting with transformers, in: International Conference on Learning Representations, 2023

work page 2023
[44]

Khalighi, T

S. Khalighi, T. Sousa, J. M. Santos, U. Nunes, ISRUC-Sleep: A com- prehensive public dataset for sleep researchers, Computer methods and programs in biomedicine (2016)

work page 2016
[45]

Goerttler, Y

S. Goerttler, Y . Wang, E. Eldele, M. Wu, F. He, MSA-CNN: A lightweight multi-scale CNN with attention for sleep stage classification, arXiv preprint arXiv:2501.02949 (2025)

work page arXiv 2025
[46]

Supratak, H

A. Supratak, H. Dong, C. Wu, Y . Guo, DeepSleepNet: A model for auto- 11 matic sleep stage scoring based on raw single-channel eeg, IEEE transac- tions on neural systems and rehabilitation engineering (2017)

work page 2017
[47]

V . J. Lawhern, A. J. Solon, N. R. Waytowich, S. M. Gordon, C. P. Hung, B. J. Lance, EEGNet: a compact convolutional neural network for eeg- based brain–computer interfaces, Journal of neural engineering (2018)

work page 2018
[48]

Z. Jia, Y . Lin, J. Wang, X. Ning, Y . He, R. Zhou, Y . Zhou, L.-w. H. Lehman, Multi-view spatial-temporal graph convolutional networks with domain generalization for sleep stage classification, IEEE Transactions on Neural Systems and Rehabilitation Engineering (2021)

work page 2021
[49]

X. Ji, Y . Li, P. Wen, Jumping knowledge based spatial-temporal graph convolutional networks for automatic sleep stage classification, IEEE Transactions on Neural Systems and Rehabilitation Engineering (2022)

work page 2022
[50]

Y . Wang, M. Wu, X. Li, L. Xie, Z. Chen, Multivariate time-series repre- sentation learning via hierarchical correlation pooling boosted graph neu- ral network, IEEE Transactions on Artificial Intelligence (2023)

work page 2023
[51]

Y . Wang, Y . Xu, J. Yang, M. Wu, X. Li, L. Xie, Z. Chen, Fully-connected spatial-temporal graph for multivariate time-series data, in: AAAI con- ference on artificial intelligence, 2024

work page 2024
[52]

Z. Yang, M. Qiu, X. Fan, G. Dai, W. Ma, X. Peng, X. Fu, Y . Li, cV AN: A novel sleep staging method via cross-view alignment network, IEEE Journal of Biomedical and Health Informatics (2024). 12

work page 2024

[1] [1]

Q. Wen, T. Zhou, C. Zhang, W. Chen, Z. Ma, Z. Yan, J. Yan, L. Sun, Transformers in time series: A survey, International Joint Conference on Artificial Intelligence (2023)

work page 2023

[2] [2]

K. Li, Y . Wang, P. Gao, G. Song, Y . Liu, H. Li, Y . Qiao, UniFormer: Unified transformer for e fficient spatiotemporal representation learning (2022)

work page 2022

[3] [3]

W. Wang, E. Zuo, C. Chen, C. Chen, J. Zhong, Z. Yan, X. Lv, E fficient time series adaptive representation learning via dynamic routing sparse attention, Pattern Recognition (2025)

work page 2025

[4] [4]

Zhang, Y

T. Zhang, Y . Zhang, W. Cao, J. Bian, X. Yi, S. Zheng, J. Li, Less is more: Fast multivariate time series forecasting with light sampling-oriented mlp structures., arXiv preprint arXiv:2207.01186 (2022)

work page arXiv 2022

[5] [5]

Y . Gu, X. Yan, H. Qin, N. Akhtar, S. Yuan, H. Fu, S. Yang, A. Mian, HDTCNet: A hybrid-dimensional convolutional network for multivariate time series classification, Pattern Recognition (2025)

work page 2025

[6] [6]

H. Wu, T. Hu, Y . Liu, H. Zhou, J. Wang, M. Long, Timesnet: Temporal 2d-variation modeling for general time series analysis, in: International Conference on Learning Representations, 2023

work page 2023

[7] [7]

K. Yi, J. Fei, Q. Zhang, H. He, S. Hao, D. Lian, W. Fan, Filternet: Har- nessing frequency filters for time series forecasting, in: Neural Informa- tion Processing Systems, 2024

work page 2024

[8] [8]

S. G. Mallat, A theory for multiresolution signal decomposition: the wavelet representation, IEEE transactions on pattern analysis and ma- chine intelligence (1989)

work page 1989

[9] [9]

G. Chen, T. D. Bui, A. Krzy˙zak, Invariant pattern recognition using radon, dual-tree complex wavelet and fourier transforms, Pattern Recognition (2009)

work page 2009

[10] [10]

Linear-Phase FIR Filters

A. V . Oppenheim, R. W. Schafer, J. R. Buck, Discrete-Time Signal Pro- cessing, 3rd Edition, Prentice-Hall, 2010, section 7.4: “Linear-Phase FIR Filters”

work page 2010

[11] [11]

Dzhezyan, H

G. Dzhezyan, H. Cecotti, Symmetrical filters in convolutional neural networks, International Journal of Machine Learning and Cybernetics (2021)

work page 2021

[12] [12]

S. A. Martucci, Symmetric convolution and the discrete sine and cosine transforms, IEEE Transactions on Signal Processing (1994)

work page 1994

[13] [13]

Ahmed, T

N. Ahmed, T. Natarajan, K. R. Rao, Discrete cosine transform, IEEE transactions on Computers (2006)

work page 2006

[14] [14]

Ulicny, V

M. Ulicny, V . A. Krylov, R. Dahyot, Harmonic convolutional networks based on discrete cosine transform, Pattern Recognition (2022)

work page 2022

[15] [15]

K. Xu, M. Qin, F. Sun, Y . Wang, Y .-K. Chen, F. Ren, Learning in the frequency domain, in: Proceedings of the IEEE/CVF conference on com- puter vision and pattern recognition, 2020

work page 2020

[16] [16]

Vaswani, N

A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, Łukasz Kaiser, I. Polosukhin, Attention is all you need, in: Advances in Neural Information Processing Systems, 2017

work page 2017

[17] [17]

H. Wu, J. Xu, J. Wang, M. Long, Autoformer: Decomposition transform- ers with auto-correlation for long-term series forecasting, in: Advances in Neural Information Processing Systems, 2021

work page 2021

[18] [18]

T. Zhou, Q. Zhang, W. Chen, H. Wu, Z. Xie, Z. Yan, Y . Wang, L. Sun, FEDformer: Frequency enhanced decomposed transformer for long-term series forecasting, in: International Conference on Learning Representa- tions, 2022

work page 2022

[19] [19]

Y . Li, Z. Zhang, M. Zaheer, S. Kottur, Enhancing the locality and break- ing the memory bottleneck of transformer on time series forecasting, in: Advances in Neural Information Processing Systems, 2020

work page 2020

[20] [20]

Kitaev, Ł

N. Kitaev, Ł. Kaiser, A. Levskaya, Reformer: The e fficient transformer, in: International Conference on Learning Representations, 2020

work page 2020

[21] [21]

Zhang, Z

C. Zhang, Z. Yan, Crossformer: Transformer utilizing cross-dimension dependency for multivariate time series forecasting, in: International Conference on Learning Representations, 2023

work page 2023

[22] [22]

A. Zeng, M. Zhang, G. Gao, Z. Meng, J. Wang, J. Zhou, Are transformers all you need for long-term series forecasting?, in: Neural Information Processing Systems, 2023

work page 2023

[23] [23]

Y . Liu, Y . Zhang, H. Wu, M. Long, J. Wang, DLinear: Are linear time- series models really that simple?, in: AAAI 2023, 2023

work page 2023

[24] [24]

Y . Xin, Z. Ma, T. Wei, Y . Liu, Efficient attention mechanisms for long- sequence time-series modeling, in: ACM SIGKDD, 2024

work page 2024

[25] [25]

Dempster, F

A. Dempster, F. Petitjean, G. I. Webb, ROCKET: Exceptionally fast and accurate time series classification using random convolutional kernels, Data Mining and Knowledge Discovery (2020)

work page 2020

[26] [26]

Eldele, S

E. Eldele, S. Qu, X. Wu, Z. Cao, J. Ng, TS-TCC: Time-series represen- tation learning via temporal and contextual contrasting, in: International Joint Conference on Artificial Intelligence, 2021

work page 2021

[27] [27]

Z. Yue, G. Zhou, Q. Yang, S. Xu, Q. Kan, TS2Vec: Towards universal representation of time series, in: AAAI Conference on Artificial Intelli- gence, 2022

work page 2022

[28] [28]

Q. Meng, H. Qian, Y . Liu, L. Cui, Y . Xu, Z. Shen, MHCCL: masked hierarchical cluster-wise contrastive learning for multivariate time series, in: AAAI Conference on Artificial Intelligence, 2023

work page 2023

[29] [29]

P. Liu, Q. Kong, L. Jian, J. Kuang, T-WaveNet: A tree-structured wavelet neural network for time-series signal analysis, in: International Confer- ence on Learning Representations, 2022

work page 2022

[30] [30]

P. Liu, M. Zhang, C. Li, Q. Li, N. Laptev, SCINet: Time series modeling and forecasting with sample-convolve-interact networks, in: International Conference on Learning Representations, 2022

work page 2022

[31] [31]

P. Liu, B. Wu, N. Li, T. Dai, F. Lei, J. Bao, Y . Jiang, S.-T. Xia, WFTNet: Exploiting global and local periodicity in long-term time series forecast- ing, in: ACM SIGKDD, 2023

work page 2023

[32] [32]

Zhang, C

F. Zhang, C. Zhang, W. Chen, Z. Ma, TCE: Temporal convolutional ex- plorer helps understand 1d-cnns from the frequency perspective, in: ACM CIKM 2023, 2023

work page 2023

[33] [33]

Yang, H.-S

Y . Yang, H.-S. Hong, BTSF: Bilinear temporal-spectral fusion for unsu- pervised time-series representation learning, in: International Conference on Learning Representations, 2022

work page 2022

[34] [34]

A. L. Goldberger, L. A. Amaral, L. Glass, J. M. Hausdor ff, P. C. Ivanov, R. G. Mark, J. E. Mietus, G. B. Moody, C.-K. Peng, H. E. Stanley, Phys- iobank, physiotoolkit, and physionet: components of a new research re- source for complex physiologic signals, circulation (2000)

work page 2000

[35] [35]

Eldele, Z

E. Eldele, Z. Chen, C. Liu, M. Wu, C.-K. Kwoh, X. Li, C. Guan, An attention-based deep learning approach for sleep stage classification with single-channel EEG, IEEE Transactions on Neural Systems and Rehabil- itation Engineering (2021)

work page 2021

[36] [36]

Eldele, M

E. Eldele, M. Ragab, Z. Chen, M. Wu, X. Li, TSLANet: Rethinking trans- formers for time series representation learning, in: International Confer- ence on Machine Learning, 2024

work page 2024

[37] [37]

G. B. Moody, R. G. Mark, The impact of the mit-bih arrhythmia database, IEEE engineering in medicine and biology magazine (2001)

work page 2001

[38] [38]

Goldberger, UEA archive: Heartbeat data set (2016)

A. Goldberger, UEA archive: Heartbeat data set (2016)

work page 2016

[39] [39]

Anguita, A

D. Anguita, A. Ghio, L. Oneto, X. Parra, J. L. Reyes-Ortiz, et al., A public domain dataset for human activity recognition using smartphones., in: Esann, 2013

work page 2013

[40] [40]

J. R. Kwapisz, G. M. Weiss, S. A. Moore, Activity recognition using cell phone accelerometers, ACM SigKDD Explorations Newsletter (2011)

work page 2011

[41] [41]

Stisen, H

A. Stisen, H. Blunck, S. Bhattacharya, T. S. Prentow, M. B. Kjærgaard, A. Dey, T. Sonne, M. M. Jensen, Smart devices are di fferent: Assess- ing and mitigatingmobile sensing heterogeneities for activity recognition, in: Proceedings of the ACM conference on embedded networked sensor systems, 2015

work page 2015

[42] [42]

T. Zhou, P. Niu, L. Sun, R. Jin, et al., One fits all: Power general time series analysis by pretrained LM, Advances in neural information pro- cessing systems (2023)

work page 2023

[43] [43]

Y . Nie, N. H. Nguyen, P. Sinthong, J. Kalagnanam, A time series is worth 64 words: Long-term forecasting with transformers, in: International Conference on Learning Representations, 2023

work page 2023

[44] [44]

Khalighi, T

S. Khalighi, T. Sousa, J. M. Santos, U. Nunes, ISRUC-Sleep: A com- prehensive public dataset for sleep researchers, Computer methods and programs in biomedicine (2016)

work page 2016

[45] [45]

Goerttler, Y

S. Goerttler, Y . Wang, E. Eldele, M. Wu, F. He, MSA-CNN: A lightweight multi-scale CNN with attention for sleep stage classification, arXiv preprint arXiv:2501.02949 (2025)

work page arXiv 2025

[46] [46]

Supratak, H

A. Supratak, H. Dong, C. Wu, Y . Guo, DeepSleepNet: A model for auto- 11 matic sleep stage scoring based on raw single-channel eeg, IEEE transac- tions on neural systems and rehabilitation engineering (2017)

work page 2017

[47] [47]

V . J. Lawhern, A. J. Solon, N. R. Waytowich, S. M. Gordon, C. P. Hung, B. J. Lance, EEGNet: a compact convolutional neural network for eeg- based brain–computer interfaces, Journal of neural engineering (2018)

work page 2018

[48] [48]

Z. Jia, Y . Lin, J. Wang, X. Ning, Y . He, R. Zhou, Y . Zhou, L.-w. H. Lehman, Multi-view spatial-temporal graph convolutional networks with domain generalization for sleep stage classification, IEEE Transactions on Neural Systems and Rehabilitation Engineering (2021)

work page 2021

[49] [49]

X. Ji, Y . Li, P. Wen, Jumping knowledge based spatial-temporal graph convolutional networks for automatic sleep stage classification, IEEE Transactions on Neural Systems and Rehabilitation Engineering (2022)

work page 2022

[50] [50]

Y . Wang, M. Wu, X. Li, L. Xie, Z. Chen, Multivariate time-series repre- sentation learning via hierarchical correlation pooling boosted graph neu- ral network, IEEE Transactions on Artificial Intelligence (2023)

work page 2023

[51] [51]

Y . Wang, Y . Xu, J. Yang, M. Wu, X. Li, L. Xie, Z. Chen, Fully-connected spatial-temporal graph for multivariate time-series data, in: AAAI con- ference on artificial intelligence, 2024

work page 2024

[52] [52]

Z. Yang, M. Qiu, X. Fan, G. Dai, W. Ma, X. Peng, X. Fu, Y . Li, cV AN: A novel sleep staging method via cross-view alignment network, IEEE Journal of Biomedical and Health Informatics (2024). 12

work page 2024