FRWKV+: Periodic-Aware Adaptive Gating for Frequency-Space Linear Time Series Forecasting

Da Teng; Dongyue Chen; Jiaji Pan; Junhua Xiao; Qingyuan Yang; Shizhuo Deng

REVIEW 2 major objections 2 minor 22 references

FRWKV-Plus adds bounded trust-gated corrections so periodic evidence refines frequency-space forecasts without overriding the base spectral interaction.

Reviewed by Pith at T0; open to challenge. T0 means a machine referee read the full paper against a public rubric. the ladder, T0–T4 →

Challenge this review Re-run · record.json Download PDF Read on arXiv ↗

T0 review · grok-4.3

2026-06-30 19:41 UTC pith:LLCZGECS

load-bearing objection FRWKV+ adds two controlled gating pieces to the FRWKV backbone for periodic cues and stays competitive on the benchmarks with no load-bearing problems. the 2 major comments →

arxiv 2605.15690 v2 pith:LLCZGECS submitted 2026-05-15 cs.LG

FRWKV+: Periodic-Aware Adaptive Gating for Frequency-Space Linear Time Series Forecasting

Qingyuan Yang , Dongyue Chen , Da Teng , Junhua Xiao , Jiaji Pan , Shizhuo Deng This is my paper

classification cs.LG

keywords time series forecastingfrequency domainperiodic patternsgating mechanismlightweight modelmultivariate forecastingresidual correctionspectral components

verification ladder T0 review T1 audit T2 compute T3 formal T4 reserved

The pith

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper presents FRWKV-Plus as a lightweight extension to the FRWKV frequency-space forecasting model. It adds a cross-branch spectral gate that reweights each branch from its sibling and a trust-gated residual correction that uses within-period context to adjust the gates under a learned trust score. This correction is constructed to be identity-preserving at initialization and strictly bounded, ensuring periodic evidence refines but never dominates the spectral interaction. On seven benchmarks the model stays competitive with linear, frequency-domain, recurrent, and transformer forecasters while keeping the backbone's efficiency, and ablations confirm the value of each addition especially on challenging datasets.

Core claim

FRWKV-Plus processes real and imaginary spectral components through a cross-branch spectral gate and applies a trust-gated residual correction derived from compact within-period context. The correction produces a bounded, sign-flexible adjustment to the gates that preserves the base interaction at initialization. This allows the model to incorporate recurring temporal structure in a controlled manner, resulting in competitive performance on standard time series forecasting benchmarks without added computational cost.

What carries the argument

The trust-gated residual correction, which turns within-period context into a bounded adjustment of cross-branch spectral gates under a data-dependent trust score while remaining identity-preserving at initialization.

Load-bearing premise

The trust-gated residual correction remains strictly bounded and identity-preserving at initialization so periodic evidence can only refine, never dominate or invert, the base spectral interaction.

What would settle it

Initialize the model parameters for the correction and confirm that the output gates match the uncorrected base gates exactly, then after training on benchmark data check that the magnitude of the correction stays within the designed bound on held-out sequences.

Watch this falsifier — get emailed when new claim-graph text bears on it.

If this is right

The added components each contribute to performance as shown in three-seed ablations.
The benefit appears modest on strongly periodic data but pronounced on Exchange and ILI datasets.
The within-period context emerges as the most influential component.
The model preserves the lightweight profile of the FRWKV backbone across many variables and horizons.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The bounded correction mechanism could be applied to other frequency-domain time series models to add periodic awareness without risking instability.
If the trust score adapts to different periodicity strengths, the approach might improve further on mixed datasets.
Testing the initialization condition directly on the gates before training would verify the identity-preserving property.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit.

Desk Editor's Note

FRWKV+ adds two controlled gating pieces to the FRWKV backbone for periodic cues and stays competitive on the benchmarks with no load-bearing problems.

read the letter

FRWKV+ introduces a cross-branch spectral gate that lets each spectral branch see a summary of its sibling, plus a trust-gated residual correction that folds in within-period context under a learned trust score. Both sit on the existing FRWKV backbone and are designed so the correction stays bounded and identity-preserving at initialization. That architectural choice is the main novelty and it is presented clearly as an extension rather than a reinvention.

The paper reports competitive numbers against linear, frequency-domain, recurrent, and Transformer baselines on seven standard datasets. The three-seed ablations attribute the gains mainly to the within-period context term, with larger effects on the harder Exchange and ILI sets. Code release is a practical plus for anyone who wants to check the implementation.

The soft spots are modest and typical for the area. There are no error bars on the main tables, so the size of the improvements is harder to judge for robustness. The benefit is described as modest on strongly periodic data, which limits how much the new components move the needle in the easiest cases. Dataset details and exact hyperparameter settings are not in the abstract but presumably appear in the full methods section.

This is a paper for researchers already working on lightweight frequency-space forecasters who want a targeted way to make periodic signals adaptive. It shows clear, incremental thinking and honest comparison to strong baselines. I would send it to peer review.

Referee Report

2 major / 2 minor

Summary. The paper claims to introduce FRWKV-Plus, an extension of the FRWKV backbone for frequency-space linear time series forecasting. It adds a cross-branch spectral gate that reweights branches using sibling summaries and a trust-gated residual correction that converts within-period context into a bounded, sign-flexible adjustment under a learned trust score. The correction is asserted to be identity-preserving at initialization and strictly bounded by construction so that periodic evidence refines but never dominates the base spectral interaction. On seven standard benchmarks the model is reported to be consistently competitive with linear, frequency-domain, recurrent, and Transformer forecasters while remaining lightweight; three-seed ablations attribute gains primarily to the within-period context component, with larger benefits on Exchange and ILI, and the code is released.

Significance. If the architectural boundedness guarantee holds and the reported competitiveness is reproducible, the work supplies a practical, efficient mechanism for injecting periodic awareness into frequency-space models without introducing instability from unreliable cues. The public implementation is a clear strength that enables direct verification of the claimed properties and ablation results.

major comments (2)

[Abstract] Abstract (paragraph on the correction mechanism): the claim that the trust-gated residual correction is 'strictly bounded' and 'identity-preserving at initialization' by construction is load-bearing for the safety argument, yet no explicit derivation, proof, or initialization analysis is referenced; without it the assertion that periodic evidence 'can only refine, never dominate or invert' cannot be verified from the given description.
[Experiments] Experiments / results description: the competitive performance on seven benchmarks is stated without error bars, standard deviations, or statistical tests across the three seeds, which is required to substantiate the 'consistently competitive' claim and the differential benefit on harder datasets.

minor comments (2)

[Abstract] Abstract: the seven benchmarks are not named and no dataset statistics or preprocessing details are supplied, which reduces reproducibility even though the code link is provided.
[Ablations] Ablations: quantitative deltas (e.g., exact metric improvements when removing the within-period context) are summarized qualitatively rather than tabulated, making it harder to judge the relative influence of each component.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the positive recommendation of minor revision and for the constructive comments. We address each major point below and will revise the manuscript accordingly.

read point-by-point responses

Referee: [Abstract] Abstract (paragraph on the correction mechanism): the claim that the trust-gated residual correction is 'strictly bounded' and 'identity-preserving at initialization' by construction is load-bearing for the safety argument, yet no explicit derivation, proof, or initialization analysis is referenced; without it the assertion that periodic evidence 'can only refine, never dominate or invert' cannot be verified from the given description.

Authors: We agree that an explicit derivation is needed to make the boundedness and initialization properties verifiable. In the revision we will add a concise derivation (with the relevant equations) either in Section 3 or a short appendix subsection, covering the identity-preserving initialization and the strict bounds enforced by the trust-gated residual correction. revision: yes
Referee: [Experiments] Experiments / results description: the competitive performance on seven benchmarks is stated without error bars, standard deviations, or statistical tests across the three seeds, which is required to substantiate the 'consistently competitive' claim and the differential benefit on harder datasets.

Authors: We acknowledge that variance across seeds was not reported. The revision will update all result tables and the accompanying text to report mean ± standard deviation over the three seeds. This will directly support the consistency claims and the differential gains on Exchange and ILI. revision: yes

Circularity Check

0 steps flagged

No significant circularity in derivation chain

full rationale

The paper presents an empirical claim of competitive performance on seven benchmarks alongside an architectural property (trust-gated residual correction being identity-preserving at initialization and strictly bounded) that is explicitly stated to hold by construction as a design choice. No load-bearing derivations, equations, or predictions are shown to reduce to fitted parameters or self-citations; the within-period context benefits are attributed via ablations rather than forced by definition. The argument relies on released implementation and external benchmark comparisons, remaining self-contained without circular reductions.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

The central claim rests on the empirical performance of a new architecture whose internal mechanisms are described only at the level of the abstract; no explicit free parameters, axioms, or invented entities are stated beyond standard learned neural-network weights.

pith-pipeline@v0.9.1-grok · 5802 in / 1168 out tokens · 23359 ms · 2026-06-30T19:41:12.639637+00:00 · methodology

0 comments

read the original abstract

Accurate and efficient long-term multivariate time series forecasting requires capturing recurring temporal structure while keeping inference cheap across many variables and horizons. Frequency-space models represent long-range and periodic variation compactly, but they typically process the real and imaginary spectral components as weakly coupled streams and treat periodic cues as ordinary input features, even when such cues are unreliable. This paper proposes FRWKV-Plus, a lightweight periodic-aware frequency-space forecasting model built on the efficient FRWKV backbone. FRWKV-Plus introduces a cross-branch spectral gate that reweights each spectral branch using a summary of its sibling branch, and a trust-gated residual correction that converts compact within-period context into a bounded, sign-flexible adjustment of these gates under a learned, data-dependent trust score. By construction, the correction is identity-preserving at initialization and strictly bounded, so periodic evidence can refine but never dominate or invert the base interaction. On seven standard benchmarks, FRWKV-Plus is consistently competitive with strong linear, frequency-domain, recurrent-style, and Transformer-based forecasters while preserving the lightweight profile of the backbone. Controlled three-seed ablations show that each component contributes, that the benefit is modest on strongly periodic data and pronounced on the harder Exchange and ILI datasets, and that the within-period context is the most influential single component. The implementation is publicly available at https://github.com/yangqingyuan-byte/FRWKV-plus.

Figures

Figures reproduced from arXiv: 2605.15690 by Da Teng, Dongyue Chen, Jiaji Pan, Junhua Xiao, Qingyuan Yang, Shizhuo Deng.

**Figure 2.** Figure 2: Tensor-level architecture of FRWKV+. The detailed diagram shows the rFFT path, the real and imaginary [PITH_FULL_IMAGE:figures/full_fig_p005_2.png] view at source ↗

**Figure 3.** Figure 3: RWKV block used by the FRWKV frequency branches. The block generates receptance, key, value, gate, [PITH_FULL_IMAGE:figures/full_fig_p006_3.png] view at source ↗

**Figure 4.** Figure 4: Periodic Positional Context Encoder. The embedded sequence is grouped by period position, flattened over [PITH_FULL_IMAGE:figures/full_fig_p008_4.png] view at source ↗

**Figure 5.** Figure 5: Aligned ETTh2 multi-horizon prediction examples. Each panel compares the input context, ground truth, [PITH_FULL_IMAGE:figures/full_fig_p014_5.png] view at source ↗

Review history (2 revisions) →

discussion (0)

Reference graph

Works this paper leans on

22 extracted references · 22 canonical work pages · 2 internal anchors

[1]

Reversible instance normalization for accurate time-series fore- casting against distribution shift

Kim T, Kim J, Tae Y , Park C, Choi J, Choo J. Reversible instance normalization for accurate time-series fore- casting against distribution shift. In: ICLR; 2022

work page 2022
[2]

Informer: beyond efficient Transformer for long sequence time-series forecasting

Zhou H, Zhang S, Peng J, Huang Y , Li J, Xiong H, Zhang W. Informer: beyond efficient Transformer for long sequence time-series forecasting. Proceedings of the AAAI Conference on Artificial Intelligence. 2021;35(12):11106–11115

work page 2021
[3]

Autoformer: decomposition Transformers with auto-correlation for long-term series forecasting

Wu H, Xu J, Wang J, Long M. Autoformer: decomposition Transformers with auto-correlation for long-term series forecasting. In: Advances in Neural Information Processing Systems; 2021

work page 2021
[4]

FEDformer: frequency enhanced decomposed Transformer for long-term series forecasting

Zhou T, Ma Z, Wen Q, Wang X, Sun L, Jin R. FEDformer: frequency enhanced decomposed Transformer for long-term series forecasting. In: Proceedings of the 39th International Conference on Machine Learning; 2022. p. 27268–27286

work page 2022
[5]

A time series is worth 64 words: long-term forecasting with transformers

Nie Y , Nguyen NH, Sinthong P, Kalagnanam J. A time series is worth 64 words: long-term forecasting with transformers. In: ICLR; 2023

work page 2023
[6]

TimesNet: temporal 2D-variation modeling for general time series analysis

Wu H, Hu T, Liu Y , Zhou H, Wang J, Long M. TimesNet: temporal 2D-variation modeling for general time series analysis. In: ICLR; 2023

work page 2023
[7]

TimeMixer: decomposable multiscale mixing for time series forecasting

Wang S, Wu H, Shi H, Zhu H, Long M. TimeMixer: decomposable multiscale mixing for time series forecasting. In: ICLR; 2024

work page 2024
[8]

iTransformer: inverted transformers are effective for time series forecasting

Liu Y , Hu T, Zhang H, Wu H, Wang S, Ma L, Long M. iTransformer: inverted transformers are effective for time series forecasting. In: ICLR; 2024

work page 2024
[9]

Are transformers effective for time series forecasting? In: AAAI; 2023

Zeng A, Chen M, Zhang L, Xu Q. Are transformers effective for time series forecasting? In: AAAI; 2023

work page 2023
[10]

RWKV: reinventing RNNs for the Transformer era

Peng B, Alcaide E, Anthony Q, Albalak A, Arcadinho S, Biderman S, et al. RWKV: reinventing RNNs for the Transformer era. In: Findings of the Association for Computational Linguistics: EMNLP 2023; 2023. p. 14048–14077

work page 2023
[11]

Mamba: Linear-Time Sequence Modeling with Selective State Spaces

Gu A, Dao T. Mamba: linear-time sequence modeling with selective state spaces. arXiv preprint arXiv:2312.00752; 2023

work page internal anchor Pith review Pith/arXiv arXiv 2023
[12]

FRWKV: frequency-domain linear attention for long-term time series forecasting

Yang Q, Deng S, Chen D, Teng D, Gan Z. FRWKV: frequency-domain linear attention for long-term time series forecasting. arXiv preprint arXiv:2512.07539; 2025. doi:10.48550/arXiv.2512.07539

work page doi:10.48550/arxiv.2512.07539 2025
[13]

Richard Yu

Hou H, Yu FR. RWKV-TS: beyond traditional recurrent neural network for time series tasks. arXiv preprint arXiv:2401.09093; 2024

work page arXiv 2024
[14]

Is Mamba effective for time series forecasting? Neurocomputing

Wang Z, Kong F, Feng S, Wang M, Yang X, Zhao H, Wang D, Zhang Y . Is Mamba effective for time series forecasting? Neurocomputing. 2025;619:129178

work page 2025
[15]

T3Time: tri-modal time series forecasting via adaptive multi-head alignment and residual fusion

Chowdhury AM, Akter R, Arib SH. T3Time: tri-modal time series forecasting via adaptive multi-head alignment and residual fusion. Proceedings of the AAAI Conference on Artificial Intelligence. 2026;40(25):20597–20605. doi:10.1609/aaai.v40i25.39196

work page doi:10.1609/aaai.v40i25.39196 2026
[16]

TimeCMA: towards LLM-empowered multivari- ate time series forecasting via cross-modality alignment

Liu C, Xu Q, Miao H, Yang S, Zhang L, Long C, Li Z, Zhao R. TimeCMA: towards LLM-empowered multivari- ate time series forecasting via cross-modality alignment. In: Proceedings of the AAAI Conference on Artificial Intelligence; 2025. p. 18780–18788

work page 2025
[17]

Time-LLM: time series forecasting by reprogramming large language models

Jin M, Wang S, Ma L, Chu Z, Zhang JY , Shi X, Chen P-Y , Liang Y , Li Y-F, Pan S, Wen Q. Time-LLM: time series forecasting by reprogramming large language models. In: ICLR; 2024. 17 FRWKV+: Adaptive Periodic-Position Branch Interaction

work page 2024
[18]

Chronos-2: From Univariate to Universal Forecasting

Ansari AF, Shchur O, Kuken J, Auer A, Han B, Mercado P, Rangapuram SS, Shen H, Stella L, Zhang X, Goswami M, Kapoor S, Maddix DC, Guerron P, Hu T, Yin J, Erickson N, Desai PM, Wang H, Rangwala H, Karypis G, Wang Y , Bohlke-Schneider M. Chronos-2: from univariate to universal forecasting. arXiv preprint arXiv:2510.15821; 2025

work page internal anchor Pith review Pith/arXiv arXiv 2025
[19]

UniTime: a language-empowered unified model for cross-domain time series forecasting

Liu X, Hu J, Li Y , Diao S, Liang Y , Hooi B, Zimmermann R. UniTime: a language-empowered unified model for cross-domain time series forecasting. In: Proceedings of the ACM Web Conference; 2024

work page 2024
[20]

Frequency-domain MLPs are more effective learners in time series forecasting

Yi K, Zhang Q, Fan W, Wang S, Wang P, He H, Lian D, An N, Cao L, Niu Z. Frequency-domain MLPs are more effective learners in time series forecasting. In: Advances in Neural Information Processing Systems; 2023

work page 2023
[21]

A multiscale model for multivariate time series forecasting

Naghashi V , Boukadoum M, Diallo AB. A multiscale model for multivariate time series forecasting. Scientific Reports. 2025;15:1565

work page 2025
[22]

arXiv preprint arXiv:2510.04134 , year=

Niu Y , Deng J, Tong Y . PhaseFormer: from patches to phases for efficient and effective time series forecasting. In: ICLR; 2026. arXiv:2510.04134. Available at:https://arxiv.org/abs/2510.04134. 18

work page arXiv 2026

[1] [1]

Reversible instance normalization for accurate time-series fore- casting against distribution shift

Kim T, Kim J, Tae Y , Park C, Choi J, Choo J. Reversible instance normalization for accurate time-series fore- casting against distribution shift. In: ICLR; 2022

work page 2022

[2] [2]

Informer: beyond efficient Transformer for long sequence time-series forecasting

Zhou H, Zhang S, Peng J, Huang Y , Li J, Xiong H, Zhang W. Informer: beyond efficient Transformer for long sequence time-series forecasting. Proceedings of the AAAI Conference on Artificial Intelligence. 2021;35(12):11106–11115

work page 2021

[3] [3]

Autoformer: decomposition Transformers with auto-correlation for long-term series forecasting

Wu H, Xu J, Wang J, Long M. Autoformer: decomposition Transformers with auto-correlation for long-term series forecasting. In: Advances in Neural Information Processing Systems; 2021

work page 2021

[4] [4]

FEDformer: frequency enhanced decomposed Transformer for long-term series forecasting

Zhou T, Ma Z, Wen Q, Wang X, Sun L, Jin R. FEDformer: frequency enhanced decomposed Transformer for long-term series forecasting. In: Proceedings of the 39th International Conference on Machine Learning; 2022. p. 27268–27286

work page 2022

[5] [5]

A time series is worth 64 words: long-term forecasting with transformers

Nie Y , Nguyen NH, Sinthong P, Kalagnanam J. A time series is worth 64 words: long-term forecasting with transformers. In: ICLR; 2023

work page 2023

[6] [6]

TimesNet: temporal 2D-variation modeling for general time series analysis

Wu H, Hu T, Liu Y , Zhou H, Wang J, Long M. TimesNet: temporal 2D-variation modeling for general time series analysis. In: ICLR; 2023

work page 2023

[7] [7]

TimeMixer: decomposable multiscale mixing for time series forecasting

Wang S, Wu H, Shi H, Zhu H, Long M. TimeMixer: decomposable multiscale mixing for time series forecasting. In: ICLR; 2024

work page 2024

[8] [8]

iTransformer: inverted transformers are effective for time series forecasting

Liu Y , Hu T, Zhang H, Wu H, Wang S, Ma L, Long M. iTransformer: inverted transformers are effective for time series forecasting. In: ICLR; 2024

work page 2024

[9] [9]

Are transformers effective for time series forecasting? In: AAAI; 2023

Zeng A, Chen M, Zhang L, Xu Q. Are transformers effective for time series forecasting? In: AAAI; 2023

work page 2023

[10] [10]

RWKV: reinventing RNNs for the Transformer era

Peng B, Alcaide E, Anthony Q, Albalak A, Arcadinho S, Biderman S, et al. RWKV: reinventing RNNs for the Transformer era. In: Findings of the Association for Computational Linguistics: EMNLP 2023; 2023. p. 14048–14077

work page 2023

[11] [11]

Mamba: Linear-Time Sequence Modeling with Selective State Spaces

Gu A, Dao T. Mamba: linear-time sequence modeling with selective state spaces. arXiv preprint arXiv:2312.00752; 2023

work page internal anchor Pith review Pith/arXiv arXiv 2023

[12] [12]

FRWKV: frequency-domain linear attention for long-term time series forecasting

Yang Q, Deng S, Chen D, Teng D, Gan Z. FRWKV: frequency-domain linear attention for long-term time series forecasting. arXiv preprint arXiv:2512.07539; 2025. doi:10.48550/arXiv.2512.07539

work page doi:10.48550/arxiv.2512.07539 2025

[13] [13]

Richard Yu

Hou H, Yu FR. RWKV-TS: beyond traditional recurrent neural network for time series tasks. arXiv preprint arXiv:2401.09093; 2024

work page arXiv 2024

[14] [14]

Is Mamba effective for time series forecasting? Neurocomputing

Wang Z, Kong F, Feng S, Wang M, Yang X, Zhao H, Wang D, Zhang Y . Is Mamba effective for time series forecasting? Neurocomputing. 2025;619:129178

work page 2025

[15] [15]

T3Time: tri-modal time series forecasting via adaptive multi-head alignment and residual fusion

Chowdhury AM, Akter R, Arib SH. T3Time: tri-modal time series forecasting via adaptive multi-head alignment and residual fusion. Proceedings of the AAAI Conference on Artificial Intelligence. 2026;40(25):20597–20605. doi:10.1609/aaai.v40i25.39196

work page doi:10.1609/aaai.v40i25.39196 2026

[16] [16]

TimeCMA: towards LLM-empowered multivari- ate time series forecasting via cross-modality alignment

Liu C, Xu Q, Miao H, Yang S, Zhang L, Long C, Li Z, Zhao R. TimeCMA: towards LLM-empowered multivari- ate time series forecasting via cross-modality alignment. In: Proceedings of the AAAI Conference on Artificial Intelligence; 2025. p. 18780–18788

work page 2025

[17] [17]

Time-LLM: time series forecasting by reprogramming large language models

Jin M, Wang S, Ma L, Chu Z, Zhang JY , Shi X, Chen P-Y , Liang Y , Li Y-F, Pan S, Wen Q. Time-LLM: time series forecasting by reprogramming large language models. In: ICLR; 2024. 17 FRWKV+: Adaptive Periodic-Position Branch Interaction

work page 2024

[18] [18]

Chronos-2: From Univariate to Universal Forecasting

Ansari AF, Shchur O, Kuken J, Auer A, Han B, Mercado P, Rangapuram SS, Shen H, Stella L, Zhang X, Goswami M, Kapoor S, Maddix DC, Guerron P, Hu T, Yin J, Erickson N, Desai PM, Wang H, Rangwala H, Karypis G, Wang Y , Bohlke-Schneider M. Chronos-2: from univariate to universal forecasting. arXiv preprint arXiv:2510.15821; 2025

work page internal anchor Pith review Pith/arXiv arXiv 2025

[19] [19]

UniTime: a language-empowered unified model for cross-domain time series forecasting

Liu X, Hu J, Li Y , Diao S, Liang Y , Hooi B, Zimmermann R. UniTime: a language-empowered unified model for cross-domain time series forecasting. In: Proceedings of the ACM Web Conference; 2024

work page 2024

[20] [20]

Frequency-domain MLPs are more effective learners in time series forecasting

Yi K, Zhang Q, Fan W, Wang S, Wang P, He H, Lian D, An N, Cao L, Niu Z. Frequency-domain MLPs are more effective learners in time series forecasting. In: Advances in Neural Information Processing Systems; 2023

work page 2023

[21] [21]

A multiscale model for multivariate time series forecasting

Naghashi V , Boukadoum M, Diallo AB. A multiscale model for multivariate time series forecasting. Scientific Reports. 2025;15:1565

work page 2025

[22] [22]

arXiv preprint arXiv:2510.04134 , year=

Niu Y , Deng J, Tong Y . PhaseFormer: from patches to phases for efficient and effective time series forecasting. In: ICLR; 2026. arXiv:2510.04134. Available at:https://arxiv.org/abs/2510.04134. 18

work page arXiv 2026