L-Drive: Beyond a Single Mapping-Latent Context Drives Time Series Forecasting

Fan Zhang; Hua Wang; Shijun Chen

arxiv: 2605.17730 · v1 · pith:ICGD4QKUnew · submitted 2026-05-18 · 💻 cs.LG · cs.AI

L-Drive: Beyond a Single Mapping-Latent Context Drives Time Series Forecasting

Fan Zhang , Shijun Chen , Hua Wang This is my paper

Pith reviewed 2026-05-20 12:21 UTC · model grok-4.3

classification 💻 cs.LG cs.AI

keywords time series forecastinglatent contextregime changesdistribution shiftsgating mechanismrelative positional encodingchange detectionmultivariate forecasting

0 comments

The pith

Latent context with gating lets time series forecasters adapt to regime changes without the lag of direct mappings.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

Standard time series forecasting models learn one mapping from past observations directly to future values, but this unified approach tends to lag when the underlying data distribution shifts suddenly. L-Drive adds a separate latent context that tracks higher-level evolving dynamics over time and uses a gating mechanism to adjust the predicted increments accordingly. It also employs patch-shared relative positional basis functions to capture structures inside data segments more reliably. If effective, this change-aware setup should reduce error buildup during transitions between different system behaviors. Readers in fields like energy or finance would value the potential for more reliable predictions amid frequent changes.

Core claim

L-Drive claims that by introducing a Latent-Context to explicitly characterize high-level dynamics evolving over time and using gating to modulate increment representations, the framework provides more timely change cues and improves adaptation to changing segments, while patch-shared relative positional basis functions strengthen intra-segment structural modeling and reduce overfitting from absolute-position memorization.

What carries the argument

The Latent-Context that tracks high-level temporal dynamics, combined with gating on increment representations and patch-shared relative positional basis functions.

If this is right

Forecasting accuracy improves around turning points where data patterns change abruptly.
Error accumulation is reduced within windows of distribution shifts.
Models achieve a better balance between prediction accuracy and computational efficiency.
Intra-segment structures are modeled more effectively without overfitting to specific positions.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

This separation of high-level dynamics from direct value mapping could apply to other sequential tasks like natural language processing or video prediction.
Future work might explore how the latent context evolves in very long sequences or non-stationary environments.
Testing on real-world datasets with documented regime changes would confirm the timely cue provision.

Load-bearing premise

That direct mapping from history to future in observation space must lag at turning points, and that the latent context plus gating supplies accurate change cues without introducing fitting instabilities.

What would settle it

If experiments on time series with known abrupt shifts show that L-Drive still exhibits similar error spikes around change points as standard direct-mapping models.

Figures

Figures reproduced from arXiv: 2605.17730 by Fan Zhang, Hua Wang, Shijun Chen.

**Figure 1.** Figure 1: Comparison on synthetic data (original settings for each baseline). Our model adapts faster, reducing lag around switches. health (Chhabra et al., 2024). In these scenarios, accurate multi-step forecasting not only directly affects resource allocation and decision quality, but also determines system safety and robustness under uncertainty. At present, many mainstream approaches for multivariate time serie… view at source ↗

**Figure 2.** Figure 2: Overview of L-Drive. It consists of two key components: (a) L-Context Generator and (b) Struct-Aided Predictor. direction of change at the initial time step: ∆x ′ = D(x ′ ), (Dx ′ )t = 0, t = 1, x ′ t − x ′ t−1, t = 2, . . . , T. (5) It should be noted that normalization mainly provides globalscale stabilization, and it cannot eliminate local spikes or instantaneous high-frequency disturbances that appe… view at source ↗

**Figure 3.** Figure 3: Visualization of L-Context on the ECL dataset [PITH_FULL_IMAGE:figures/full_fig_p008_3.png] view at source ↗

**Figure 4.** Figure 4: Averaged MSE and MAE results under different dpos. 5.4.4. HYPERPARAMETER SENSITIVITY To study the impact of the capacity of the patch-level relative position basis functions on model performance, we keep all other configurations unchanged and only vary the dimension of the relative position basis dpos ∈ {2, 4, 6, 8}. We compare the results on six datasets. The experimental results are shown in [PITH_FULL… view at source ↗

**Figure 5.** Figure 5: Computational Efficiency analysis. and slight degradation occurs in some scenarios. This indicates that the patch-level relative position is mainly used to distinguish relative relationships within a segment, and low-dimensional basis functions are sufficient to express these key structures. Higher-dimensional basis functions may introduce redundant representations, which can lead to slight overfitting. 5… view at source ↗

read the original abstract

Mainstream methods for multivariate time-series forecasting largely follow the Direct-Mapping paradigm. They learn a unified mapping from history to the future in the observation space to fit value-level dependencies. However, real-world systems often undergo distribution shifts and regime changes. In such cases, a unified mapping can exhibit response lag around turning points, causing error accumulation within the switching window and reducing forecasting reliability. To address this issue, we propose L-Drive, a change-aware forecasting framework. L-Drive introduces a Latent-Context, to explicitly characterize high-level dynamics evolving over time, and uses gating to modulate increment representations. This provides more timely change cues and improves adaptation to changing segments. In addition, it incorporates patch-shared relative positional basis functions to strengthen intra-segment structural modeling and reduce overfitting caused by absolute-position memorization. Extensive experiments validate the effectiveness of L-Drive and show a better overall trade-off between forecasting accuracy and computational efficiency.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

L-Drive adds a latent context and gating on increments to cut response lag at regime shifts in time series, but the gains may just come from extra capacity rather than any forced change detection.

read the letter

The core idea is straightforward. Direct mapping from past observations to future values tends to lag when the underlying regime shifts, and the authors try to fix that by keeping a separate latent context that tracks higher-level dynamics over time. They gate the increment representations with this context and add patch-shared relative positional basis functions to handle structure inside segments without memorizing absolute positions. That combination is the actual new piece; similar latent or context tricks exist elsewhere, but the specific mix with gating on increments and the relative basis is their engineering proposal. It targets a practical pain point in non-stationary series, and the framing is honest about the limitation of unified mappings. The paper does a clean job laying out why response lag happens around turning points and why an extra context might supply faster cues. The relative positional basis also looks like a sensible way to reduce overfitting on position. The soft spot is that nothing in the architecture description forces the latent context to detect shifts earlier than a strong direct baseline would. If the latent just ends up learning a smoothed version of the same history-to-future map, any accuracy lift could be explained by added parameters rather than the claimed change-awareness. The abstract mentions extensive experiments, but without seeing the numbers, ablations, or turning-point specific metrics, it is impossible to tell whether the mechanism works as advertised or whether the gains are real. This is the kind of paper that would interest applied time-series people who already use transformers or patch-based models and want a lightweight adaptation trick for drifting data. It is not reshaping theory, but the problem it names is common enough that a careful referee could check whether the added components actually isolate the lag reduction. I would send it to review and ask specifically for ablations that hold capacity fixed and measure performance right at documented change points.

Referee Report

3 major / 2 minor

Summary. The manuscript proposes L-Drive, a change-aware framework for multivariate time series forecasting. It argues that direct-mapping methods, which learn a unified history-to-future mapping in observation space, suffer response lag at turning points under distribution shifts and regime changes. L-Drive introduces a Latent-Context to explicitly model evolving high-level dynamics, a gating mechanism to modulate increment representations for timely change cues, and patch-shared relative positional basis functions to improve intra-segment structural modeling while reducing overfitting from absolute positions. Extensive experiments are claimed to validate improved forecasting accuracy and a better accuracy-efficiency trade-off.

Significance. If the central claims hold and the improvements are isolated to the latent-context and gating mechanisms rather than extra capacity, the work could meaningfully advance non-stationary time series forecasting by providing an explicit way to handle regime shifts without lag. The patch-shared relative positional basis is a concrete technical contribution that addresses a known overfitting issue in patch-based models. However, the significance is tempered by the absence of quantitative results, error bars, or detailed ablations in the provided material, making it difficult to assess whether the framework delivers falsifiable gains over strong direct-mapping baselines.

major comments (3)

[Abstract / §2] Abstract and motivation section: the claim that any unified direct-mapping necessarily exhibits response lag around turning points is presented as a premise but lacks a supporting derivation, theorem, or controlled isolation experiment. Without an explicit comparison to a high-capacity direct-mapping baseline (e.g., deeper transformer or larger hidden size) that still uses only observation-space mapping, it remains possible that observed gains stem from increased expressivity rather than the claimed change-awareness of the Latent-Context.
[§3.2 / §3.3] Architecture description (latent context and gating): the gating is said to modulate increment representations to supply timely change cues, yet no explicit mechanism (change-point supervision, divergence penalty on the latent trajectory, or auxiliary loss) is described that would force the latent context to detect shifts earlier than a standard recurrent or attention-based encoder. If the latent context simply learns a smoothed version of the same mapping, the lag problem is not solved and any accuracy improvement could be attributable to extra parameters.
[§5] Experiments: the abstract asserts that extensive experiments validate effectiveness and a better accuracy-efficiency trade-off, but the provided material contains no quantitative tables, error bars, baseline comparisons, or ablation results. This absence makes it impossible to verify whether the added components improve adaptation to changing segments or merely increase model capacity.

minor comments (2)

[§3.4] Notation for the patch-shared relative positional basis functions should be defined with an explicit equation (e.g., Eq. (X)) rather than described only in prose, to allow readers to verify how it differs from standard relative positional encodings.
[§4] The manuscript would benefit from a clearer statement of the exact loss function used to train the Latent-Context and gating components, including any auxiliary terms.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for the constructive and detailed feedback. We address each major comment point by point below, clarifying our claims and indicating the revisions made to strengthen the manuscript.

read point-by-point responses

Referee: [Abstract / §2] Abstract and motivation section: the claim that any unified direct-mapping necessarily exhibits response lag around turning points is presented as a premise but lacks a supporting derivation, theorem, or controlled isolation experiment. Without an explicit comparison to a high-capacity direct-mapping baseline (e.g., deeper transformer or larger hidden size) that still uses only observation-space mapping, it remains possible that observed gains stem from increased expressivity rather than the claimed change-awareness of the Latent-Context.

Authors: We agree that the motivation would benefit from additional rigor. In the revised manuscript we have added a brief illustrative derivation in Section 2 showing how a single observation-space mapping must compromise across regimes, producing lag at transitions. We have also included a controlled experiment comparing L-Drive against a high-capacity direct-mapping baseline (deeper Transformer with matched parameter count). Results demonstrate that the baseline still exhibits measurable lag at turning points while L-Drive adapts faster, supporting that gains arise from the latent-context mechanism rather than capacity alone. Error bars from multiple runs are now reported. revision: yes
Referee: [§3.2 / §3.3] Architecture description (latent context and gating): the gating is said to modulate increment representations to supply timely change cues, yet no explicit mechanism (change-point supervision, divergence penalty on the latent trajectory, or auxiliary loss) is described that would force the latent context to detect shifts earlier than a standard recurrent or attention-based encoder. If the latent context simply learns a smoothed version of the same mapping, the lag problem is not solved and any accuracy improvement could be attributable to extra parameters.

Authors: We thank the referee for this observation. The latent context is trained end-to-end with the forecasting objective to capture evolving high-level dynamics; the gating then modulates increments using this state. No explicit change-point supervision or auxiliary loss is used. In the revision we have clarified this design choice in Section 3.3, added visualizations of the latent trajectory that precede observed regime shifts, and included an ablation that removes the gating and latent context while keeping total capacity comparable. The ablation shows that the performance gain exceeds what extra parameters alone would explain. revision: yes
Referee: [§5] Experiments: the abstract asserts that extensive experiments validate effectiveness and a better accuracy-efficiency trade-off, but the provided material contains no quantitative tables, error bars, baseline comparisons, or ablation results. This absence makes it impossible to verify whether the added components improve adaptation to changing segments or merely increase model capacity.

Authors: We apologize if the review copy omitted the experimental section. The full manuscript contains Section 5 with quantitative tables reporting MAE/MSE on standard benchmarks, comparisons against eight strong baselines, error bars from five random seeds, and component-wise ablations (latent context, gating, and patch-shared relative positional basis). We have added a dedicated analysis of accuracy at turning points and an accuracy-efficiency plot (FLOPs vs. error). All tables and figures are now explicitly included in the revised submission. revision: yes

Circularity Check

0 steps flagged

No significant circularity in L-Drive framework proposal

full rationale

The paper presents L-Drive as an architectural framework that augments direct-mapping time-series models with a Latent-Context module and gating to supply change cues, plus patch-shared relative positional basis functions. No equations or derivation steps are shown that define a target quantity in terms of itself, rename a fitted parameter as a prediction, or reduce the central claim to a self-citation chain. The load-bearing premise (direct mapping exhibits lag at regime shifts) is stated as an empirical observation rather than derived from prior self-work, and the proposed components are introduced as design choices whose value is assessed via external experiments. The derivation chain therefore remains self-contained and does not collapse to its inputs by construction.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 1 invented entities

The framework rests on the domain assumption that high-level dynamics can be usefully separated from value-level increments and that gating will reliably detect regime changes without additional supervision.

axioms (1)

domain assumption Real-world multivariate time series frequently undergo distribution shifts and regime changes that cause unified mappings to lag.
Stated in the abstract as motivation for moving beyond direct-mapping.

invented entities (1)

Latent-Context no independent evidence
purpose: To explicitly characterize high-level dynamics evolving over time.
Introduced as the core new representation; no independent evidence of its existence or properties is provided in the abstract.

pith-pipeline@v0.9.0 · 5687 in / 1079 out tokens · 35135 ms · 2026-05-20T12:21:29.547623+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

we introduce Latent-Context (L-Context) to characterize dynamic patterns that evolve over time, and use it to modulate incremental representations... gating mechanism... first-order difference... GRU(h_t) = L-Context
IndisputableMonolith/Foundation/ArithmeticFromLogic.lean embed_strictMono_of_one_lt unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

ˆy_t ≈ ρ ˆy_{t-1} + (1-ρ) g_t + ρ Δĝ_t ... lim sup |e_t| ≤ ρ/(1-ρ) ε̄

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

50 extracted references · 50 canonical work pages · 3 internal anchors

[1]

2011 , publisher=

Usefulness of adaptive and rational expectations in economics , author=. 2011 , publisher=

work page 2011
[2]

Applied Energy , volume=

Forecasting day-ahead electricity prices: A review of state-of-the-art algorithms, best practices and an open-access benchmark , author=. Applied Energy , volume=. 2021 , publisher=

work page 2021
[3]

Proceedings of the AAAI Conference on Artificial Intelligence , volume=

Unlocking the Power of Patch: Patch-Based MLP for Long-Term Time Series Forecasting , author=. Proceedings of the AAAI Conference on Artificial Intelligence , volume=

work page
[4]

A Time Series is Worth 64 Words: Long-term Forecasting with Transformers

A Time Series is Worth 64Words: Long-term Forecasting with Transformers , author=. arXiv preprint arXiv:2211.14730 , year=

work page internal anchor Pith review Pith/arXiv arXiv
[5]

iTransformer: Inverted Transformers Are Effective for Time Series Forecasting

itransformer: Inverted transformers are effective for time series forecasting , author=. arXiv preprint arXiv:2310.06625 , year=

work page internal anchor Pith review Pith/arXiv arXiv
[6]

Advances in Neural Information Processing Systems , volume=

Timexer: Empowering transformers for time series forecasting with exogenous variables , author=. Advances in Neural Information Processing Systems , volume=

work page
[7]

Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining V

CrossLinear: Plug-and-play cross-correlation embedding for time series forecasting with exogenous variables , author=. Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining V. 2 , pages=

work page
[8]

Proceedings of the AAAI Conference on Artificial Intelligence , volume=

Filterts: Comprehensive frequency filtering for multivariate time series forecasting , author=. Proceedings of the AAAI Conference on Artificial Intelligence , volume=

work page
[9]

TimesNet: Temporal 2D-Variation Modeling for General Time Series Analysis

Timesnet: Temporal 2d-variation modeling for general time series analysis , author=. arXiv preprint arXiv:2210.02186 , year=

work page internal anchor Pith review Pith/arXiv arXiv
[10]

Proceedings of the 34th ACM International Conference on Information and Knowledge Management , pages=

BALM-TSF: Balanced Multimodal Alignment for LLM-Based Time Series Forecasting , author=. Proceedings of the 34th ACM International Conference on Information and Knowledge Management , pages=

work page
[11]

The Twelfth International Conference on Learning Representations , year=

Multi-resolution diffusion models for time series forecasting , author=. The Twelfth International Conference on Learning Representations , year=

work page
[12]

arXiv preprint arXiv:2410.04442 , year=

Timebridge: Non-stationarity matters for long-term time series forecasting , author=. arXiv preprint arXiv:2410.04442 , year=

work page arXiv
[13]

International conference on learning representations , year=

Reversible instance normalization for accurate time-series forecasting against distribution shift , author=. International conference on learning representations , year=

work page
[14]

Advances in neural information processing systems , volume=

Koopa: Learning non-stationary time series dynamics with koopman predictors , author=. Advances in neural information processing systems , volume=

work page
[15]

Frontiers in Energy Research , volume=

Evaluation of electrical load demand forecasting using various machine learning algorithms , author=. Frontiers in Energy Research , volume=. 2024 , publisher=

work page 2024
[16]

IEEE Access , year=

Residential electrical load forecasting based on a real-time evidential time series prediction method , author=. IEEE Access , year=

work page
[17]

Energy Conversion and Management , volume=

Forecast-driven stochastic optimization scheduling of an energy management system for an isolated hydrogen microgrid , author=. Energy Conversion and Management , volume=. 2023 , publisher=

work page 2023
[18]

2024 , publisher=

Deep learning for financial time series prediction: A state-of-the-art review of standalone and hybrid models , author=. 2024 , publisher=

work page 2024
[19]

Machines , volume=

Predicting machine failures from multivariate time series: An industrial case study , author=. Machines , volume=. 2024 , publisher=

work page 2024
[20]

Results in Engineering , volume=

Time series trend analysis and forecasting of climate variability using deep learning in Thailand , author=. Results in Engineering , volume=. 2024 , publisher=

work page 2024
[21]

Sustainable Technology and Entrepreneurship , volume=

Sustainable and intelligent time-series models for epidemic disease forecasting and analysis , author=. Sustainable Technology and Entrepreneurship , volume=. 2024 , publisher=

work page 2024
[22]

Proceedings of the AAAI conference on artificial intelligence , volume=

Are transformers effective for time series forecasting? , author=. Proceedings of the AAAI conference on artificial intelligence , volume=

work page
[23]

Advances in neural information processing systems , volume=

Autoformer: Decomposition transformers with auto-correlation for long-term series forecasting , author=. Advances in neural information processing systems , volume=

work page
[24]

Information Fusion , volume=

Long sequence time-series forecasting with deep learning: A survey , author=. Information Fusion , volume=. 2023 , publisher=

work page 2023
[25]

arXiv preprint arXiv:2502.10721 , year=

A comprehensive survey of deep learning for multivariate time series forecasting: A channel strategy perspective , author=. arXiv preprint arXiv:2502.10721 , year=

work page arXiv
[26]

Artificial Intelligence Review , volume=

A comprehensive survey of deep learning for time series forecasting: architectural diversity and open challenges , author=. Artificial Intelligence Review , volume=. 2025 , publisher=

work page 2025
[27]

Philosophical transactions of the royal society a: mathematical, physical and engineering sciences , volume=

Time-series forecasting with deep learning: a survey , author=. Philosophical transactions of the royal society a: mathematical, physical and engineering sciences , volume=. 2021 , publisher=

work page 2021
[28]

Beyond the time domain: Recent advances on frequency transforms in time series analysis.arXiv preprint arXiv:2504.07099, 2025

Time series analysis in frequency domain: A survey of open challenges opportunities and benchmarks , author=. arXiv: 2504.07099 , year=

work page arXiv
[29]

Multivariate Behavioral Research , volume=

Non-stationarity in time-series analysis: Modeling stochastic and deterministic trends , author=. Multivariate Behavioral Research , volume=. 2025 , publisher=

work page 2025
[30]

Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining V

A survey on deep learning based time series analysis with frequency transformation , author=. Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining V. 2 , pages=

work page
[31]

Mathematics , volume=

Deep time series forecasting models: A comprehensive survey , author=. Mathematics , volume=. 2024 , publisher=

work page 2024
[32]

Information , volume=

Deep learning for time series forecasting: Advances and open problems , author=. Information , volume=. 2023 , publisher=

work page 2023
[33]

Recent Advances in Computer Science and Communications (Formerly: Recent Patents on Computer Science) , volume=

Contemporary approaches to analyze non-stationary time-series: Some solutions and challenges , author=. Recent Advances in Computer Science and Communications (Formerly: Recent Patents on Computer Science) , volume=. 2023 , publisher=

work page 2023
[34]

Forecasting with artificial intelligence: theory and applications , pages=

Handling concept drift in global time series forecasting , author=. Forecasting with artificial intelligence: theory and applications , pages=. 2023 , publisher=

work page 2023
[35]

arXiv preprint arXiv:2401.17548 , year=

Rethinking channel dependence for multivariate time series forecasting: Learning from leading indicators , author=. arXiv preprint arXiv:2401.17548 , year=

work page arXiv
[36]

Advances in Neural Information Processing Systems , volume=

From similarity to superiority: Channel clustering for time series forecasting , author=. Advances in Neural Information Processing Systems , volume=

work page
[37]

Advances in Neural Information Processing Systems , volume=

Rethinking fourier transform from a basis functions perspective for long-term time series forecasting , author=. Advances in Neural Information Processing Systems , volume=

work page
[38]

Journal of Machine Learning Research , volume=

From fourier to koopman: Spectral methods for long-term time series prediction , author=. Journal of Machine Learning Research , volume=

work page
[39]

Proceedings of the AAAI conference on artificial intelligence , volume=

Informer: Beyond efficient transformer for long sequence time-series forecasting , author=. Proceedings of the AAAI conference on artificial intelligence , volume=

work page
[40]

Wu, Xingjian and Qiu, Xiangfei and Li, Zhengyu and Wang, Yihang and Hu, Jilin and Guo, Chenjuan and Xiong, Hui and Yang, Bin , booktitle=

work page
[41]

ICLR , year=

Gcgnet: Graph-consistent generative network for time series forecasting with exogenous variables , author=. ICLR , year=

work page
[42]

ICLR , year=

ASTGI: Adaptive Spatio-Temporal Graph Interactions for Irregular Multivariate Time Series Forecasting , author=. ICLR , year=

work page
[43]

AAAI , year=

Rethinking Irregular Time Series Forecasting: A Simple yet Effective Baseline , author=. AAAI , year=

work page
[44]

Jensen and Zhenli Sheng and Bin Yang , booktitle =

Xiangfei Qiu and Jilin Hu and Lekui Zhou and Xingjian Wu and Junyang Du and Buang Zhang and Chenjuan Guo and Aoying Zhou and Christian S. Jensen and Zhenli Sheng and Bin Yang , booktitle =

work page
[45]

TAB: Unified Benchmarking of Time Series Anomaly Detection Methods , author =. Proc. 2025 , pages =

work page 2025
[46]

IEEE Transactions on Knowledge and Data Engineering , year=

STABLE: Efficient Hybrid Nearest Neighbor Search via Magnitude-Uniformity and Cardinality-Robustness , author=. IEEE Transactions on Knowledge and Data Engineering , year=

work page
[47]

ICASSP 2025-2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) , pages=

Pair: Complementarity-guided disentanglement for composed image retrieval , author=. ICASSP 2025-2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) , pages=. 2025 , organization=

work page 2025
[48]

Proceedings of the AAAI Conference on Artificial Intelligence , volume=

INTENT: Invariance and Discrimination-aware Noise Mitigation for Robust Composed Image Retrieval , author=. Proceedings of the AAAI Conference on Artificial Intelligence , volume=

work page
[49]

Proceedings of the ACM International Conference on Multimedia , pages =

OFFSET: Segmentation-based Focus Shift Revision for Composed Image Retrieval , author =. Proceedings of the ACM International Conference on Multimedia , pages =

work page
[50]

2026 , eprint=

Air-Know: Arbiter-Calibrated Knowledge-Internalizing Robust Network for Composed Image Retrieval , author=. 2026 , eprint=

work page 2026

[1] [1]

2011 , publisher=

Usefulness of adaptive and rational expectations in economics , author=. 2011 , publisher=

work page 2011

[2] [2]

Applied Energy , volume=

Forecasting day-ahead electricity prices: A review of state-of-the-art algorithms, best practices and an open-access benchmark , author=. Applied Energy , volume=. 2021 , publisher=

work page 2021

[3] [3]

Proceedings of the AAAI Conference on Artificial Intelligence , volume=

Unlocking the Power of Patch: Patch-Based MLP for Long-Term Time Series Forecasting , author=. Proceedings of the AAAI Conference on Artificial Intelligence , volume=

work page

[4] [4]

A Time Series is Worth 64 Words: Long-term Forecasting with Transformers

A Time Series is Worth 64Words: Long-term Forecasting with Transformers , author=. arXiv preprint arXiv:2211.14730 , year=

work page internal anchor Pith review Pith/arXiv arXiv

[5] [5]

iTransformer: Inverted Transformers Are Effective for Time Series Forecasting

itransformer: Inverted transformers are effective for time series forecasting , author=. arXiv preprint arXiv:2310.06625 , year=

work page internal anchor Pith review Pith/arXiv arXiv

[6] [6]

Advances in Neural Information Processing Systems , volume=

Timexer: Empowering transformers for time series forecasting with exogenous variables , author=. Advances in Neural Information Processing Systems , volume=

work page

[7] [7]

Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining V

CrossLinear: Plug-and-play cross-correlation embedding for time series forecasting with exogenous variables , author=. Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining V. 2 , pages=

work page

[8] [8]

Proceedings of the AAAI Conference on Artificial Intelligence , volume=

Filterts: Comprehensive frequency filtering for multivariate time series forecasting , author=. Proceedings of the AAAI Conference on Artificial Intelligence , volume=

work page

[9] [9]

TimesNet: Temporal 2D-Variation Modeling for General Time Series Analysis

Timesnet: Temporal 2d-variation modeling for general time series analysis , author=. arXiv preprint arXiv:2210.02186 , year=

work page internal anchor Pith review Pith/arXiv arXiv

[10] [10]

Proceedings of the 34th ACM International Conference on Information and Knowledge Management , pages=

BALM-TSF: Balanced Multimodal Alignment for LLM-Based Time Series Forecasting , author=. Proceedings of the 34th ACM International Conference on Information and Knowledge Management , pages=

work page

[11] [11]

The Twelfth International Conference on Learning Representations , year=

Multi-resolution diffusion models for time series forecasting , author=. The Twelfth International Conference on Learning Representations , year=

work page

[12] [12]

arXiv preprint arXiv:2410.04442 , year=

Timebridge: Non-stationarity matters for long-term time series forecasting , author=. arXiv preprint arXiv:2410.04442 , year=

work page arXiv

[13] [13]

International conference on learning representations , year=

Reversible instance normalization for accurate time-series forecasting against distribution shift , author=. International conference on learning representations , year=

work page

[14] [14]

Advances in neural information processing systems , volume=

Koopa: Learning non-stationary time series dynamics with koopman predictors , author=. Advances in neural information processing systems , volume=

work page

[15] [15]

Frontiers in Energy Research , volume=

Evaluation of electrical load demand forecasting using various machine learning algorithms , author=. Frontiers in Energy Research , volume=. 2024 , publisher=

work page 2024

[16] [16]

IEEE Access , year=

Residential electrical load forecasting based on a real-time evidential time series prediction method , author=. IEEE Access , year=

work page

[17] [17]

Energy Conversion and Management , volume=

Forecast-driven stochastic optimization scheduling of an energy management system for an isolated hydrogen microgrid , author=. Energy Conversion and Management , volume=. 2023 , publisher=

work page 2023

[18] [18]

2024 , publisher=

Deep learning for financial time series prediction: A state-of-the-art review of standalone and hybrid models , author=. 2024 , publisher=

work page 2024

[19] [19]

Machines , volume=

Predicting machine failures from multivariate time series: An industrial case study , author=. Machines , volume=. 2024 , publisher=

work page 2024

[20] [20]

Results in Engineering , volume=

Time series trend analysis and forecasting of climate variability using deep learning in Thailand , author=. Results in Engineering , volume=. 2024 , publisher=

work page 2024

[21] [21]

Sustainable Technology and Entrepreneurship , volume=

Sustainable and intelligent time-series models for epidemic disease forecasting and analysis , author=. Sustainable Technology and Entrepreneurship , volume=. 2024 , publisher=

work page 2024

[22] [22]

Proceedings of the AAAI conference on artificial intelligence , volume=

Are transformers effective for time series forecasting? , author=. Proceedings of the AAAI conference on artificial intelligence , volume=

work page

[23] [23]

Advances in neural information processing systems , volume=

Autoformer: Decomposition transformers with auto-correlation for long-term series forecasting , author=. Advances in neural information processing systems , volume=

work page

[24] [24]

Information Fusion , volume=

Long sequence time-series forecasting with deep learning: A survey , author=. Information Fusion , volume=. 2023 , publisher=

work page 2023

[25] [25]

arXiv preprint arXiv:2502.10721 , year=

A comprehensive survey of deep learning for multivariate time series forecasting: A channel strategy perspective , author=. arXiv preprint arXiv:2502.10721 , year=

work page arXiv

[26] [26]

Artificial Intelligence Review , volume=

A comprehensive survey of deep learning for time series forecasting: architectural diversity and open challenges , author=. Artificial Intelligence Review , volume=. 2025 , publisher=

work page 2025

[27] [27]

Philosophical transactions of the royal society a: mathematical, physical and engineering sciences , volume=

Time-series forecasting with deep learning: a survey , author=. Philosophical transactions of the royal society a: mathematical, physical and engineering sciences , volume=. 2021 , publisher=

work page 2021

[28] [28]

Beyond the time domain: Recent advances on frequency transforms in time series analysis.arXiv preprint arXiv:2504.07099, 2025

Time series analysis in frequency domain: A survey of open challenges opportunities and benchmarks , author=. arXiv: 2504.07099 , year=

work page arXiv

[29] [29]

Multivariate Behavioral Research , volume=

Non-stationarity in time-series analysis: Modeling stochastic and deterministic trends , author=. Multivariate Behavioral Research , volume=. 2025 , publisher=

work page 2025

[30] [30]

Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining V

A survey on deep learning based time series analysis with frequency transformation , author=. Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining V. 2 , pages=

work page

[31] [31]

Mathematics , volume=

Deep time series forecasting models: A comprehensive survey , author=. Mathematics , volume=. 2024 , publisher=

work page 2024

[32] [32]

Information , volume=

Deep learning for time series forecasting: Advances and open problems , author=. Information , volume=. 2023 , publisher=

work page 2023

[33] [33]

Recent Advances in Computer Science and Communications (Formerly: Recent Patents on Computer Science) , volume=

Contemporary approaches to analyze non-stationary time-series: Some solutions and challenges , author=. Recent Advances in Computer Science and Communications (Formerly: Recent Patents on Computer Science) , volume=. 2023 , publisher=

work page 2023

[34] [34]

Forecasting with artificial intelligence: theory and applications , pages=

Handling concept drift in global time series forecasting , author=. Forecasting with artificial intelligence: theory and applications , pages=. 2023 , publisher=

work page 2023

[35] [35]

arXiv preprint arXiv:2401.17548 , year=

Rethinking channel dependence for multivariate time series forecasting: Learning from leading indicators , author=. arXiv preprint arXiv:2401.17548 , year=

work page arXiv

[36] [36]

Advances in Neural Information Processing Systems , volume=

From similarity to superiority: Channel clustering for time series forecasting , author=. Advances in Neural Information Processing Systems , volume=

work page

[37] [37]

Advances in Neural Information Processing Systems , volume=

Rethinking fourier transform from a basis functions perspective for long-term time series forecasting , author=. Advances in Neural Information Processing Systems , volume=

work page

[38] [38]

Journal of Machine Learning Research , volume=

From fourier to koopman: Spectral methods for long-term time series prediction , author=. Journal of Machine Learning Research , volume=

work page

[39] [39]

Proceedings of the AAAI conference on artificial intelligence , volume=

Informer: Beyond efficient transformer for long sequence time-series forecasting , author=. Proceedings of the AAAI conference on artificial intelligence , volume=

work page

[40] [40]

Wu, Xingjian and Qiu, Xiangfei and Li, Zhengyu and Wang, Yihang and Hu, Jilin and Guo, Chenjuan and Xiong, Hui and Yang, Bin , booktitle=

work page

[41] [41]

ICLR , year=

Gcgnet: Graph-consistent generative network for time series forecasting with exogenous variables , author=. ICLR , year=

work page

[42] [42]

ICLR , year=

ASTGI: Adaptive Spatio-Temporal Graph Interactions for Irregular Multivariate Time Series Forecasting , author=. ICLR , year=

work page

[43] [43]

AAAI , year=

Rethinking Irregular Time Series Forecasting: A Simple yet Effective Baseline , author=. AAAI , year=

work page

[44] [44]

Jensen and Zhenli Sheng and Bin Yang , booktitle =

Xiangfei Qiu and Jilin Hu and Lekui Zhou and Xingjian Wu and Junyang Du and Buang Zhang and Chenjuan Guo and Aoying Zhou and Christian S. Jensen and Zhenli Sheng and Bin Yang , booktitle =

work page

[45] [45]

TAB: Unified Benchmarking of Time Series Anomaly Detection Methods , author =. Proc. 2025 , pages =

work page 2025

[46] [46]

IEEE Transactions on Knowledge and Data Engineering , year=

STABLE: Efficient Hybrid Nearest Neighbor Search via Magnitude-Uniformity and Cardinality-Robustness , author=. IEEE Transactions on Knowledge and Data Engineering , year=

work page

[47] [47]

ICASSP 2025-2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) , pages=

Pair: Complementarity-guided disentanglement for composed image retrieval , author=. ICASSP 2025-2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) , pages=. 2025 , organization=

work page 2025

[48] [48]

Proceedings of the AAAI Conference on Artificial Intelligence , volume=

INTENT: Invariance and Discrimination-aware Noise Mitigation for Robust Composed Image Retrieval , author=. Proceedings of the AAAI Conference on Artificial Intelligence , volume=

work page

[49] [49]

Proceedings of the ACM International Conference on Multimedia , pages =

OFFSET: Segmentation-based Focus Shift Revision for Composed Image Retrieval , author =. Proceedings of the ACM International Conference on Multimedia , pages =

work page

[50] [50]

2026 , eprint=

Air-Know: Arbiter-Calibrated Knowledge-Internalizing Robust Network for Composed Image Retrieval , author=. 2026 , eprint=

work page 2026