Online Irregular Multivariate Time Series Forecasting via Uncertainty-Driven Dual-Expert Calibration

Hanyang Chen; Haonan Wen; Songhe Feng

arxiv: 2605.28603 · v1 · pith:DCH6YOCXnew · submitted 2026-05-27 · 💻 cs.LG · cs.AI

Online Irregular Multivariate Time Series Forecasting via Uncertainty-Driven Dual-Expert Calibration

Haonan Wen , Hanyang Chen , Songhe Feng This is my paper

Pith reviewed 2026-06-29 14:22 UTC · model grok-4.3

classification 💻 cs.LG cs.AI

keywords online forecastingirregular multivariate time seriesuncertainty estimationdual-expert calibrationonline adaptationdistribution shiftmodel calibration

0 comments

The pith

Uncertainty estimates route samples between dual experts to adapt a frozen forecasting model for online irregular multivariate time series.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper seeks to maintain forecasting accuracy when irregular multivariate time series encounter distribution shifts during online deployment, a setting where standard temporal patterns break down and full retraining is costly. It establishes that a lightweight calibration module driven by uncertainty can handle adaptation while the original model stays unchanged. The uncertainty estimator first scores each incoming batch, after which an adaptive router sends high-uncertainty samples to one expert for targeted updates and low-uncertainty samples to the other for stable refinement. Reliable samples then improve both the reliable expert and the uncertainty estimator, while challenging samples refine the second expert. A reader would care because this yields consistent benchmark gains at low added cost without touching the base forecaster.

Core claim

Under-Cali is an uncertainty-driven dual-expert calibration framework for online IMTS forecasting. An uncertainty estimator acts as the central control signal that first evaluates each batch. The adaptive routing module then sends high-uncertainty samples to the unreliable expert for calibration and low-uncertainty samples to the reliable expert. The system subsequently updates the reliable expert and the uncertainty estimator using well-calibrated samples while updating the unreliable expert with difficult samples, all while the source forecasting model remains frozen. This produces stable, efficient online adaptation through a model-agnostic lightweight module.

What carries the argument

The uncertainty estimator that jointly controls inference and adaptation by routing batches to a reliable expert or an unreliable expert for selective updates.

If this is right

The source forecasting model can remain frozen while adaptation occurs only in the lightweight dual-expert module.
High-uncertainty samples receive calibration from the unreliable expert while low-uncertainty samples reinforce the reliable expert.
Updates to the reliable expert and uncertainty estimator use only well-calibrated samples, preserving stability.
The framework remains model-agnostic and adds only modest computational overhead during online operation.
Consistent accuracy gains appear across standard IMTS benchmark datasets.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same uncertainty-routing pattern could apply to online adaptation in other irregularly sampled domains such as event streams or sensor networks.
If uncertainty estimates become more accurate, the separation between reliable and unreliable experts might sharpen further without extra design changes.
The dual-expert split offers a template for lightweight online calibration in non-time-series tasks that also face distribution drift.

Load-bearing premise

Uncertainty estimates from the estimator can reliably decide routing and produce stable expert updates without any changes to the base forecasting model.

What would settle it

Run the IMTS benchmarks with the uncertainty estimator replaced by random routing decisions and check whether the reported performance gains disappear or reverse.

Figures

Figures reproduced from arXiv: 2605.28603 by Hanyang Chen, Haonan Wen, Songhe Feng.

**Figure 2.** Figure 2: The framework of Under-Cali. For each incoming batch, (1) samples are calibrated by the reliable expert first for preliminary predictions, which are then fed into the uncertainty estimator along with input to compute per-sample uncertainty scores. Guided by these scores, the adaptive routing module directs high-uncertainty samples to the unreliable expert for secondary calibration to obtain final predictio… view at source ↗

**Figure 3.** Figure 3: Parameter Study of Human Activity on tPatchGNN. [PITH_FULL_IMAGE:figures/full_fig_p008_3.png] view at source ↗

**Figure 4.** Figure 4: Case study on tPatchGNN [PITH_FULL_IMAGE:figures/full_fig_p009_4.png] view at source ↗

read the original abstract

Irregular multivariate time series forecasting is critical in many real-world applications, where time series are irregularly sampled and exhibit dynamically evolving missingness patterns. Although existing methods perform well in offline settings, they often suffer from significant performance degradation when deployed online due to dynamic shifts in data distribution. Maintaining forecasting capability in such dynamic scenarios typically necessitates online adaptation techniques. Since irregular sampling fundamentally undermines temporal continuity and periodicity, we cannot leverage these widely studied characteristics from regular MTS for online learning. To this end, we study the problem of online IMTS forecasting and propose Under-Cali, an uncertainty-driven dual-expert calibration framework consisting of three core components: an uncertainty estimator, a dual-expert calibration module, and an adaptive routing module. We design an uncertainty estimator that serves as the core control signal to jointly manage inference and adaptation processes. In our framework, the uncertainty estimator first assesses uncertainty for each incoming batch. The adaptive routing module then directs samples with high uncertainty to the unreliable expert for calibration, while low uncertainty samples remain with the reliable expert. Subsequently, the system updates the reliable expert and the uncertainty estimator using well-calibrated reliable samples, and updates the unreliable expert with challenging samples, enabling stable and efficient online learning. Under-Cali keeps the source forecasting model frozen and performs adaptation only through a lightweight, model-agnostic calibration module, enabling efficient adaptation. Extensive experiments on IMTS benchmarks demonstrate consistent improvements with low computational cost. Our code is available at https://github.com/HaonanWen/Under-Cali.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

Under-Cali adds a lightweight dual-expert calibration layer driven by uncertainty routing for online IMTS while freezing the base model, but the bootstrap stability of that uncertainty signal is not obviously secured.

read the letter

The core contribution is a practical setup for online adaptation on irregular multivariate time series. Under-Cali freezes the source forecaster and routes batches through an uncertainty estimator to two calibration experts: high-uncertainty samples go to the unreliable expert, low-uncertainty ones stay with the reliable expert. The reliable expert and the estimator are then updated on the well-calibrated samples while the unreliable expert sees the harder ones. This keeps adaptation cheap and model-agnostic.

The framing is new enough in the online IMTS setting. Treating uncertainty as the explicit control signal for both routing and selective updates, rather than relying on temporal continuity that irregular sampling destroys, is a reasonable way to handle evolving missingness. Releasing code helps.

The main soft spot is the initialization and stability of the uncertainty estimator itself. The routing logic assumes the estimator already produces usable scores on the first incoming batches before any updates occur. Under distribution shift in missingness patterns, early routing errors could feed bad samples to both experts and degrade the whole system. The abstract does not spell out pre-training, warm-starting, or regularization steps that would prevent this. Experiments are claimed to show consistent gains at low cost, but without numbers or ablation on the estimator's early behavior it is hard to judge whether the mechanism actually works as described.

This paper is for people already working on online time-series methods who need a lightweight adaptation module. It is coherent on its own terms and deserves a serious referee to examine the full experiments and the uncertainty estimator's behavior under shift.

Referee Report

2 major / 1 minor

Summary. The paper proposes Under-Cali, an uncertainty-driven dual-expert calibration framework for online irregular multivariate time series (IMTS) forecasting. It keeps the source forecasting model frozen and performs adaptation via a lightweight, model-agnostic module with three components: an uncertainty estimator that assesses incoming batches, an adaptive routing module that directs high-uncertainty samples to an unreliable expert and low-uncertainty samples to a reliable expert, and dual-expert updates where reliable samples update the reliable expert plus the estimator while challenging samples update the unreliable expert. The approach claims to enable stable, efficient online learning under dynamic missingness patterns, with experiments on IMTS benchmarks showing consistent improvements at low computational cost.

Significance. If the uncertainty-driven routing and selective updates prove stable without degrading the frozen base model, the framework could provide a practical, model-agnostic solution for online IMTS adaptation in domains with evolving distributions, where standard online learning fails due to irregular sampling. The emphasis on keeping the source model frozen and using only lightweight calibration is a potential strength for deployment efficiency if empirically validated.

major comments (2)

[Framework description (§3)] Framework description (abstract and §3): The central mechanism requires the uncertainty estimator to produce reliable routing signals from the initial batches onward to avoid corrupting the dual experts, yet no initialization, pre-training step, or regularization is described that would secure accurate uncertainty estimates before any updates occur under evolving missingness. This bootstrap issue is load-bearing for the stability claim.
[Experimental section (§4)] Experimental validation (abstract and §4): The claims of 'consistent improvements' and 'low computational cost' on IMTS benchmarks are stated without any reported metrics, baselines, error bars, ablation results, or statistical tests in the provided text, preventing assessment of whether the dual-expert calibration actually outperforms alternatives.

minor comments (1)

[Abstract] The GitHub link for code is provided, which supports reproducibility; ensure the released code includes the exact initialization procedure for the uncertainty estimator once added.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive comments on our manuscript. We address each major comment point by point below and will revise the manuscript to incorporate clarifications and additional details where needed.

read point-by-point responses

Referee: [Framework description (§3)] Framework description (abstract and §3): The central mechanism requires the uncertainty estimator to produce reliable routing signals from the initial batches onward to avoid corrupting the dual experts, yet no initialization, pre-training step, or regularization is described that would secure accurate uncertainty estimates before any updates occur under evolving missingness. This bootstrap issue is load-bearing for the stability claim.

Authors: We agree that the bootstrap issue for the uncertainty estimator is important for ensuring reliable routing signals from the outset. The submitted manuscript does not describe an explicit initialization, pre-training, or regularization procedure for the uncertainty estimator. In the revised version, we will add a dedicated subsection in §3 detailing the initialization strategy (e.g., a short warm-up phase on initial batches or pre-training the estimator on source-domain data) to secure accurate estimates before online updates begin under dynamic missingness. revision: yes
Referee: [Experimental section (§4)] Experimental validation (abstract and §4): The claims of 'consistent improvements' and 'low computational cost' on IMTS benchmarks are stated without any reported metrics, baselines, error bars, ablation results, or statistical tests in the provided text, preventing assessment of whether the dual-expert calibration actually outperforms alternatives.

Authors: We acknowledge that the provided text does not include the specific quantitative metrics, baselines, error bars, ablation results, or statistical tests needed to fully evaluate the claims. Although the manuscript references extensive experiments, we will revise §4 to explicitly report all required details (including tables with metrics, comparisons to baselines, error bars, ablations, and statistical significance tests) so that the performance improvements and computational efficiency can be properly assessed. revision: yes

Circularity Check

0 steps flagged

No circularity in framework proposal

full rationale

The paper proposes Under-Cali as a new conceptual framework with an uncertainty estimator, dual-expert calibration module, and adaptive routing module. No equations, parameter-fitting procedures, or self-citations are described that would reduce the claimed online adaptation or performance improvements to quantities defined by their own inputs. The central claim of lightweight, model-agnostic calibration with the base model frozen is presented as an independent design choice rather than a self-referential derivation, rendering the method self-contained.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract provides no explicit free parameters, axioms, or invented entities; assessment is limited to the high-level description of the framework.

pith-pipeline@v0.9.1-grok · 5803 in / 935 out tokens · 36163 ms · 2026-06-29T14:22:10.297148+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

46 extracted references · 11 canonical work pages · 3 internal anchors

[1]

Oren Anava, Elad Hazan, Shie Mannor, and Ohad Shamir. 2013. Online learning for time series prediction. InConference on learning theory. PMLR, 172–184

2013
[2]

Marin Biloš, Johanna Sommer, Syama Sundar Rangapuram, Tim Januschowski, and Stephan Günnemann. 2021. Neural flows: Efficient alternative to neural ODEs.Advances in neural information processing systems34 (2021), 21325–21337

2021
[3]

Zhengping Che, Sanjay Purushotham, Kyunghyun Cho, David Sontag, and Yan Liu. 2018. Recurrent neural networks for multivariate time series with missing values.Scientific reports8, 1 (2018), 6085

2018
[4]

Ranak Roy Chowdhury, Jiacheng Li, Xiyuan Zhang, Dezhi Hong, Rajesh K Gupta, and Jingbo Shang. 2023. Primenet: Pre-training for irregular multivariate time series. InProceedings of the AAAI Conference on Artificial Intelligence, Vol. 37. 7184–7192

2023
[5]

Edward De Brouwer, Jaak Simm, Adam Arany, and Yves Moreau. 2019. GRU- ODE-Bayes: Continuous modeling of sporadically-observed time series.Advances in neural information processing systems32 (2019)

2019
[6]

Yarin Gal and Zoubin Ghahramani. 2016. Dropout as a bayesian approximation: Representing model uncertainty in deep learning. Ininternational conference on machine learning. PMLR, 1050–1059

2016
[7]

Jindong Han, Weijia Zhang, Hao Liu, Tao Tao, Naiqiang Tan, and Hui Xiong
[8]

Bigst: Linear complexity spatio-temporal graph neural network for traffic forecasting on large-scale road networks.Proceedings of the VLDB Endowment 17, 5 (2024), 1081–1090

2024
[9]

Max Horn, Michael Moor, Christian Bock, Bastian Rieck, and Karsten Borg- wardt. 2020. Set functions for time series. InInternational Conference on Machine Learning. PMLR, 4353–4363

2020
[10]

AE Johnson, Tom J Pollard, Lu Shen, Li-wei H Lehman, Mengling Feng, Moham- mad Ghassemi, Benjamin Moody, Peter Szolovits, L Anthony Celi, and Roger G Mark. 2016. MIMIC-III, a freely accessible critical care database, Sci.Data3, 1 (2016), 1–9

2016
[11]

HyunGi Kim, Siwon Kim, Jisoo Mok, and Sungroh Yoon. 2025. Battling the non- stationarity in time series forecasting via test-time adaptation. InProceedings of the AAAI Conference on Artificial Intelligence, Vol. 39. 17868–17876

2025
[12]

Nikita Kitaev, Łukasz Kaiser, and Anselm Levskaya. 2020. Reformer: The efficient transformer.arXiv preprint arXiv:2001.04451(2020)

work page internal anchor Pith review Pith/arXiv arXiv 2020
[13]

Ying-yee Ava Lau, Zhiwen Shao, and Dit-Yan Yeung. 2025. Fast and Slow Streams for Online Time Series Forecasting Without Information Leakage. InThe Thir- teenth International Conference on Learning Representations

2025
[14]

Boyuan Li, Zhen Liu, Yicheng Luo, and Qianli Ma. 2026. Learning Recursive Multi-Scale Representations for Irregular Multivariate Time Series Forecasting. InInternational Conference on Learning Representations

2026
[15]

Boyuan Li, Yicheng Luo, Zhen Liu, Junhao Zheng, Jianming Lv, and Qianli Ma
[16]

HyperIMTS: Hypergraph Neural Network for Irregular Multivariate Time Series Forecasting.arXiv preprint arXiv:2505.17431(2025)

work page arXiv 2025
[17]

Yong Liu, Tengge Hu, Haoran Zhang, Haixu Wu, Shiyu Wang, Lintao Ma, and Mingsheng Long. 2023. itransformer: Inverted transformers are effective for time series forecasting.arXiv preprint arXiv:2310.06625(2023)

work page internal anchor Pith review Pith/arXiv arXiv 2023
[18]

Zhiding Liu, Mingyue Cheng, Guanhao Zhao, Jiqian Yang, Qi Liu, and Enhong Chen. 2025. Improving Time Series Forecasting via Instance-aware Post-hoc Revision.arXiv preprint arXiv:2505.23583(2025)

work page arXiv 2025
[19]

Yicheng Luo, Zhen Liu, Linghao Wang, Binquan Wu, Junhao Zheng, and Qianli Ma. 2024. Knowledge-empowered dynamic graph network for irregularly sam- pled medical time series.Advances in Neural Information Processing Systems37 (2024), 67172–67199

2024
[20]

Yicheng Luo, Bowen Zhang, Zhen Liu, and Qianli Ma. [n. d.]. Hi-Patch: Hi- erarchical Patch GNN for Irregular Multivariate Time Series. InForty-second International Conference on Machine Learning
[21]

Laurens van der Maaten and Geoffrey Hinton. 2008. Visualizing data using t-SNE. Journal of machine learning research9, Nov (2008), 2579–2605

2008
[22]

2016.Long-term daily and monthly climate records from stations across the contiguous United States (US Historical Climatology Network)

MJ Menne, CN Williams Jr, and RS Vose. 2016.Long-term daily and monthly climate records from stations across the contiguous United States (US Historical Climatology Network). Technical Report. Environmental System Science Data Infrastructure for a Virtual Ecosystem

2016
[23]

Giangiacomo Mercatali, Andre Freitas, and Jie Chen. 2024. Graph neural flows for unveiling systemic interactions among irregularly sampled time series.Advances in Neural Information Processing Systems37 (2024), 57183–57206

2024
[24]

Y Nie. 2022. A Time Series is Worth 64Words: Long-term Forecasting with Transformers.arXiv preprint arXiv:2211.14730(2022)

work page internal anchor Pith review Pith/arXiv arXiv 2022
[25]

Quang Pham, Chenghao Liu, Doyen Sahoo, and Steven CH Hoi. 2022. Learning fast and slow for online time series forecasting.arXiv preprint arXiv:2202.11672 (2022)

work page arXiv 2022
[26]

Yulia Rubanova, Ricky TQ Chen, and David K Duvenaud. 2019. Latent ordinary differential equations for irregularly-sampled time series.Advances in neural information processing systems32 (2019)

2019
[27]

Mona Schirmer, Mazin Eltayeb, Stefan Lessmann, and Maja Rudolph. 2022. Mod- eling irregular time series with continuous recurrent units. InInternational con- ference on machine learning. PMLR, 19388–19405

2022
[28]

Zongjiang Shang, Ling Chen, Binqing Wu, and Dongliang Cui. 2024. Ada- MSHyper: adaptive multi-scale hypergraph transformer for time series forecast- ing.Advances in Neural Information Processing Systems37 (2024), 33310–33337

2024
[29]

Satya Narayan Shukla and Benjamin M Marlin. 2020. A survey on principles, models and methods for learning from irregularly sampled time series.arXiv preprint arXiv:2012.00168(2020)

work page arXiv 2020
[30]

Satya Narayan Shukla and Benjamin M Marlin. 2021. Multi-time attention networks for irregularly sampled time series.arXiv preprint arXiv:2101.10318 (2021)

work page arXiv 2021
[31]

Ikaro Silva, George Moody, Daniel J Scott, Leo A Celi, and Roger G Mark. 2012. Predicting in-hospital mortality of icu patients: The physionet/computing in cardiology challenge 2012. In2012 computing in cardiology. IEEE, 245–248

2012
[32]

Roberto Vio, María Diaz-Trigo, and Paola Andreani. 2013. Irregular time series in astronomy and the use of the Lomb–Scargle periodogram.Astronomy and Computing1 (2013), 5–16

2013
[33]

Qingsong Wen, Weiqi Chen, Liang Sun, Zhang Zhang, Liang Wang, Rong Jin, Tieniu Tan, et al. 2023. Onenet: Enhancing time series forecasting models under concept drift by online ensembling.Advances in Neural Information Processing Systems36 (2023), 69949–69980

2023
[34]

Vijaya Krishna Yalavarthi, Kiran Madhusudhanan, Randolf Scholz, Nourhan Ahmed, Johannes Burchert, Shayan Jawed, Stefan Born, and Lars Schmidt-Thieme
[35]

InProceed- ings of the AAAI Conference on Artificial Intelligence, Vol

Grafiti: Graphs for forecasting irregularly sampled time series. InProceed- ings of the AAAI Conference on Artificial Intelligence, Vol. 38. 16255–16263
[36]

Zhen-Jie Yao, Jie Bi, and Yi-Xin Chen. 2018. Applying deep learning to individual and community health monitoring data: A survey.International Journal of Automation and Computing15, 6 (2018), 643–655

2018
[37]

Kun Yi, Qi Zhang, Wei Fan, Shoujin Wang, Pengyang Wang, Hui He, Ning An, Defu Lian, Longbing Cao, and Zhendong Niu. 2023. Frequency-domain MLPs are more effective learners in time series forecasting.Advances in Neural Information Processing Systems36 (2023), 76656–76679

2023
[38]

Jiawen Zhang, Shun Zheng, Wei Cao, Jiang Bian, and Jia Li. 2023. Warpformer: A multi-scale modeling approach for irregular clinical time series. InProceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 3273–3285

2023
[39]

Kexin Zhang, Qingsong Wen, Chaoli Zhang, Rongyao Cai, Ming Jin, Yong Liu, James Y Zhang, Yuxuan Liang, Guansong Pang, Dongjin Song, et al. 2024. Self- supervised learning for time series analysis: Taxonomy, progress, and prospects. IEEE transactions on pattern analysis and machine intelligence46, 10 (2024), 6775– 6794

2024
[40]

Weijia Zhang, Chenlong Yin, Hao Liu, Xiaofang Zhou, and Hui Xiong. 2024. Irregular multivariate time series forecasting: A transformable patching graph neural networks approach. InForty-first International Conference on Machine Learning

2024
[41]

Xiang Zhang, Marko Zeman, Theodoros Tsiligkaridis, and Marinka Zitnik. 2021. Graph-guided network for irregularly sampled multivariate time series.arXiv preprint arXiv:2110.05357(2021)

work page arXiv 2021
[42]

YiFan Zhang, Weiqi Chen, Zhaoyang Zhu, Dalin Qin, Liang Sun, Xue Wang, Qingsong Wen, Zhang Zhang, Liang Wang, and Rong Jin. 2024. Addressing Concept Shift in Online Time Series Forecasting: Detect-then-Adapt.arXiv preprint arXiv:2403.14949(2024)

work page arXiv 2024
[43]

Yunhao Zhang and Junchi Yan. 2023. Crossformer: Transformer utilizing cross- dimension dependency for multivariate time series forecasting. InThe eleventh international conference on learning representations

2023
[44]

Lifan Zhao and Yanyan Shen. 2025. Proactive model adaptation against concept drift for online time series forecasting. InProceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining V. 1. 2020–2031

2025
[45]

Haoyi Zhou, Shanghang Zhang, Jieqi Peng, Shuai Zhang, Jianxin Li, Hui Xiong, and Wancai Zhang. 2021. Informer: Beyond efficient transformer for long se- quence time-series forecasting. InProceedings of the AAAI conference on artificial intelligence, Vol. 35. 11106–11115

2021
[46]

Tian Zhou, Ziqing Ma, Qingsong Wen, Xue Wang, Liang Sun, and Rong Jin. 2022. Fedformer: Frequency enhanced decomposed transformer for long-term series forecasting. InInternational conference on machine learning. PMLR, 27268–27286. A Additional Experiments A.1 Additional Main Results We report additional forecasting results measured by the MAE metric in Ta...

work page arXiv 2022

[1] [1]

Oren Anava, Elad Hazan, Shie Mannor, and Ohad Shamir. 2013. Online learning for time series prediction. InConference on learning theory. PMLR, 172–184

2013

[2] [2]

Marin Biloš, Johanna Sommer, Syama Sundar Rangapuram, Tim Januschowski, and Stephan Günnemann. 2021. Neural flows: Efficient alternative to neural ODEs.Advances in neural information processing systems34 (2021), 21325–21337

2021

[3] [3]

Zhengping Che, Sanjay Purushotham, Kyunghyun Cho, David Sontag, and Yan Liu. 2018. Recurrent neural networks for multivariate time series with missing values.Scientific reports8, 1 (2018), 6085

2018

[4] [4]

Ranak Roy Chowdhury, Jiacheng Li, Xiyuan Zhang, Dezhi Hong, Rajesh K Gupta, and Jingbo Shang. 2023. Primenet: Pre-training for irregular multivariate time series. InProceedings of the AAAI Conference on Artificial Intelligence, Vol. 37. 7184–7192

2023

[5] [5]

Edward De Brouwer, Jaak Simm, Adam Arany, and Yves Moreau. 2019. GRU- ODE-Bayes: Continuous modeling of sporadically-observed time series.Advances in neural information processing systems32 (2019)

2019

[6] [6]

Yarin Gal and Zoubin Ghahramani. 2016. Dropout as a bayesian approximation: Representing model uncertainty in deep learning. Ininternational conference on machine learning. PMLR, 1050–1059

2016

[7] [7]

Jindong Han, Weijia Zhang, Hao Liu, Tao Tao, Naiqiang Tan, and Hui Xiong

[8] [8]

Bigst: Linear complexity spatio-temporal graph neural network for traffic forecasting on large-scale road networks.Proceedings of the VLDB Endowment 17, 5 (2024), 1081–1090

2024

[9] [9]

Max Horn, Michael Moor, Christian Bock, Bastian Rieck, and Karsten Borg- wardt. 2020. Set functions for time series. InInternational Conference on Machine Learning. PMLR, 4353–4363

2020

[10] [10]

AE Johnson, Tom J Pollard, Lu Shen, Li-wei H Lehman, Mengling Feng, Moham- mad Ghassemi, Benjamin Moody, Peter Szolovits, L Anthony Celi, and Roger G Mark. 2016. MIMIC-III, a freely accessible critical care database, Sci.Data3, 1 (2016), 1–9

2016

[11] [11]

HyunGi Kim, Siwon Kim, Jisoo Mok, and Sungroh Yoon. 2025. Battling the non- stationarity in time series forecasting via test-time adaptation. InProceedings of the AAAI Conference on Artificial Intelligence, Vol. 39. 17868–17876

2025

[12] [12]

Nikita Kitaev, Łukasz Kaiser, and Anselm Levskaya. 2020. Reformer: The efficient transformer.arXiv preprint arXiv:2001.04451(2020)

work page internal anchor Pith review Pith/arXiv arXiv 2020

[13] [13]

Ying-yee Ava Lau, Zhiwen Shao, and Dit-Yan Yeung. 2025. Fast and Slow Streams for Online Time Series Forecasting Without Information Leakage. InThe Thir- teenth International Conference on Learning Representations

2025

[14] [14]

Boyuan Li, Zhen Liu, Yicheng Luo, and Qianli Ma. 2026. Learning Recursive Multi-Scale Representations for Irregular Multivariate Time Series Forecasting. InInternational Conference on Learning Representations

2026

[15] [15]

Boyuan Li, Yicheng Luo, Zhen Liu, Junhao Zheng, Jianming Lv, and Qianli Ma

[16] [16]

HyperIMTS: Hypergraph Neural Network for Irregular Multivariate Time Series Forecasting.arXiv preprint arXiv:2505.17431(2025)

work page arXiv 2025

[17] [17]

Yong Liu, Tengge Hu, Haoran Zhang, Haixu Wu, Shiyu Wang, Lintao Ma, and Mingsheng Long. 2023. itransformer: Inverted transformers are effective for time series forecasting.arXiv preprint arXiv:2310.06625(2023)

work page internal anchor Pith review Pith/arXiv arXiv 2023

[18] [18]

Zhiding Liu, Mingyue Cheng, Guanhao Zhao, Jiqian Yang, Qi Liu, and Enhong Chen. 2025. Improving Time Series Forecasting via Instance-aware Post-hoc Revision.arXiv preprint arXiv:2505.23583(2025)

work page arXiv 2025

[19] [19]

Yicheng Luo, Zhen Liu, Linghao Wang, Binquan Wu, Junhao Zheng, and Qianli Ma. 2024. Knowledge-empowered dynamic graph network for irregularly sam- pled medical time series.Advances in Neural Information Processing Systems37 (2024), 67172–67199

2024

[20] [20]

Yicheng Luo, Bowen Zhang, Zhen Liu, and Qianli Ma. [n. d.]. Hi-Patch: Hi- erarchical Patch GNN for Irregular Multivariate Time Series. InForty-second International Conference on Machine Learning

[21] [21]

Laurens van der Maaten and Geoffrey Hinton. 2008. Visualizing data using t-SNE. Journal of machine learning research9, Nov (2008), 2579–2605

2008

[22] [22]

2016.Long-term daily and monthly climate records from stations across the contiguous United States (US Historical Climatology Network)

MJ Menne, CN Williams Jr, and RS Vose. 2016.Long-term daily and monthly climate records from stations across the contiguous United States (US Historical Climatology Network). Technical Report. Environmental System Science Data Infrastructure for a Virtual Ecosystem

2016

[23] [23]

Giangiacomo Mercatali, Andre Freitas, and Jie Chen. 2024. Graph neural flows for unveiling systemic interactions among irregularly sampled time series.Advances in Neural Information Processing Systems37 (2024), 57183–57206

2024

[24] [24]

Y Nie. 2022. A Time Series is Worth 64Words: Long-term Forecasting with Transformers.arXiv preprint arXiv:2211.14730(2022)

work page internal anchor Pith review Pith/arXiv arXiv 2022

[25] [25]

Quang Pham, Chenghao Liu, Doyen Sahoo, and Steven CH Hoi. 2022. Learning fast and slow for online time series forecasting.arXiv preprint arXiv:2202.11672 (2022)

work page arXiv 2022

[26] [26]

Yulia Rubanova, Ricky TQ Chen, and David K Duvenaud. 2019. Latent ordinary differential equations for irregularly-sampled time series.Advances in neural information processing systems32 (2019)

2019

[27] [27]

Mona Schirmer, Mazin Eltayeb, Stefan Lessmann, and Maja Rudolph. 2022. Mod- eling irregular time series with continuous recurrent units. InInternational con- ference on machine learning. PMLR, 19388–19405

2022

[28] [28]

Zongjiang Shang, Ling Chen, Binqing Wu, and Dongliang Cui. 2024. Ada- MSHyper: adaptive multi-scale hypergraph transformer for time series forecast- ing.Advances in Neural Information Processing Systems37 (2024), 33310–33337

2024

[29] [29]

Satya Narayan Shukla and Benjamin M Marlin. 2020. A survey on principles, models and methods for learning from irregularly sampled time series.arXiv preprint arXiv:2012.00168(2020)

work page arXiv 2020

[30] [30]

Satya Narayan Shukla and Benjamin M Marlin. 2021. Multi-time attention networks for irregularly sampled time series.arXiv preprint arXiv:2101.10318 (2021)

work page arXiv 2021

[31] [31]

Ikaro Silva, George Moody, Daniel J Scott, Leo A Celi, and Roger G Mark. 2012. Predicting in-hospital mortality of icu patients: The physionet/computing in cardiology challenge 2012. In2012 computing in cardiology. IEEE, 245–248

2012

[32] [32]

Roberto Vio, María Diaz-Trigo, and Paola Andreani. 2013. Irregular time series in astronomy and the use of the Lomb–Scargle periodogram.Astronomy and Computing1 (2013), 5–16

2013

[33] [33]

Qingsong Wen, Weiqi Chen, Liang Sun, Zhang Zhang, Liang Wang, Rong Jin, Tieniu Tan, et al. 2023. Onenet: Enhancing time series forecasting models under concept drift by online ensembling.Advances in Neural Information Processing Systems36 (2023), 69949–69980

2023

[34] [34]

Vijaya Krishna Yalavarthi, Kiran Madhusudhanan, Randolf Scholz, Nourhan Ahmed, Johannes Burchert, Shayan Jawed, Stefan Born, and Lars Schmidt-Thieme

[35] [35]

InProceed- ings of the AAAI Conference on Artificial Intelligence, Vol

Grafiti: Graphs for forecasting irregularly sampled time series. InProceed- ings of the AAAI Conference on Artificial Intelligence, Vol. 38. 16255–16263

[36] [36]

Zhen-Jie Yao, Jie Bi, and Yi-Xin Chen. 2018. Applying deep learning to individual and community health monitoring data: A survey.International Journal of Automation and Computing15, 6 (2018), 643–655

2018

[37] [37]

Kun Yi, Qi Zhang, Wei Fan, Shoujin Wang, Pengyang Wang, Hui He, Ning An, Defu Lian, Longbing Cao, and Zhendong Niu. 2023. Frequency-domain MLPs are more effective learners in time series forecasting.Advances in Neural Information Processing Systems36 (2023), 76656–76679

2023

[38] [38]

Jiawen Zhang, Shun Zheng, Wei Cao, Jiang Bian, and Jia Li. 2023. Warpformer: A multi-scale modeling approach for irregular clinical time series. InProceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 3273–3285

2023

[39] [39]

Kexin Zhang, Qingsong Wen, Chaoli Zhang, Rongyao Cai, Ming Jin, Yong Liu, James Y Zhang, Yuxuan Liang, Guansong Pang, Dongjin Song, et al. 2024. Self- supervised learning for time series analysis: Taxonomy, progress, and prospects. IEEE transactions on pattern analysis and machine intelligence46, 10 (2024), 6775– 6794

2024

[40] [40]

Weijia Zhang, Chenlong Yin, Hao Liu, Xiaofang Zhou, and Hui Xiong. 2024. Irregular multivariate time series forecasting: A transformable patching graph neural networks approach. InForty-first International Conference on Machine Learning

2024

[41] [41]

Xiang Zhang, Marko Zeman, Theodoros Tsiligkaridis, and Marinka Zitnik. 2021. Graph-guided network for irregularly sampled multivariate time series.arXiv preprint arXiv:2110.05357(2021)

work page arXiv 2021

[42] [42]

YiFan Zhang, Weiqi Chen, Zhaoyang Zhu, Dalin Qin, Liang Sun, Xue Wang, Qingsong Wen, Zhang Zhang, Liang Wang, and Rong Jin. 2024. Addressing Concept Shift in Online Time Series Forecasting: Detect-then-Adapt.arXiv preprint arXiv:2403.14949(2024)

work page arXiv 2024

[43] [43]

Yunhao Zhang and Junchi Yan. 2023. Crossformer: Transformer utilizing cross- dimension dependency for multivariate time series forecasting. InThe eleventh international conference on learning representations

2023

[44] [44]

Lifan Zhao and Yanyan Shen. 2025. Proactive model adaptation against concept drift for online time series forecasting. InProceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining V. 1. 2020–2031

2025

[45] [45]

Haoyi Zhou, Shanghang Zhang, Jieqi Peng, Shuai Zhang, Jianxin Li, Hui Xiong, and Wancai Zhang. 2021. Informer: Beyond efficient transformer for long se- quence time-series forecasting. InProceedings of the AAAI conference on artificial intelligence, Vol. 35. 11106–11115

2021

[46] [46]

Tian Zhou, Ziqing Ma, Qingsong Wen, Xue Wang, Liang Sun, and Rong Jin. 2022. Fedformer: Frequency enhanced decomposed transformer for long-term series forecasting. InInternational conference on machine learning. PMLR, 27268–27286. A Additional Experiments A.1 Additional Main Results We report additional forecasting results measured by the MAE metric in Ta...

work page arXiv 2022