Is Mamba Effective for Time Series Forecasting?

Daling Wang; Fanheng Kong; Han Zhao; Ming Wang; Shi Feng; Xiaocui Yang; Yifei Zhang; Zihan Wang

arxiv: 2403.11144 · v3 · pith:D5X63564new · submitted 2024-03-17 · 💻 cs.LG

Is Mamba Effective for Time Series Forecasting?

Zihan Wang , Fanheng Kong , Shi Feng , Ming Wang , Xiaocui Yang , Han Zhao , Daling Wang , Yifei Zhang This is my paper

classification 💻 cs.LG

keywords mambatimecomputationallayermodelpatternsseriestransformer

0 comments

read the original abstract

In the realm of time series forecasting (TSF), it is imperative for models to adeptly discern and distill hidden patterns within historical time series data to forecast future states. Transformer-based models exhibit formidable efficacy in TSF, primarily attributed to their advantage in apprehending these patterns. However, the quadratic complexity of the Transformer leads to low computational efficiency and high costs, which somewhat hinders the deployment of the TSF model in real-world scenarios. Recently, Mamba, a selective state space model, has gained traction due to its ability to process dependencies in sequences while maintaining near-linear complexity. For TSF tasks, these characteristics enable Mamba to comprehend hidden patterns as the Transformer and reduce computational overhead compared to the Transformer. Therefore, we propose a Mamba-based model named Simple-Mamba (S-Mamba) for TSF. Specifically, we tokenize the time points of each variate autonomously via a linear layer. A bidirectional Mamba layer is utilized to extract inter-variate correlations and a Feed-Forward Network is set to learn temporal dependencies. Finally, the generation of forecast outcomes through a linear mapping layer. Experiments on thirteen public datasets prove that S-Mamba maintains low computational overhead and achieves leading performance. Furthermore, we conduct extensive experiments to explore Mamba's potential in TSF tasks. Our code is available at https://github.com/wzhwzhwzh0921/S-D-Mamba.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

UniMamba: A Unified Spatial-Temporal Modeling Framework with State-Space and Attention Integration
cs.LG 2026-03 unverdicted novelty 6.0

UniMamba integrates Mamba state-space dynamics with attention layers and transforms like FFT-Laplace to outperform prior models on multivariate time series forecasting benchmarks.
Recency Biased Causal Attention for Time-series Forecasting
cs.LG 2025-02 unverdicted novelty 5.0

Introduces recency-biased causal attention via heavy-tailed decay reweighting to improve Transformer performance on time-series forecasting benchmarks.