pith. sign in

arxiv: 2605.27269 · v1 · pith:JQRKZDAJnew · submitted 2026-05-26 · 💻 cs.LG · stat.AP

Transfer Learning using 66 Diseases for Disease Forecasting Applications

classification 💻 cs.LG stat.AP
keywords datadiseaseforecastingdifferentmodelsstreamslearningwork
0
0 comments X
read the original abstract

Disease forecasting models typically rely on a single data stream, making models brittle when histories are short or noisy. Recent top-performing models have shown that synthesizing multiple reporting systems for the same disease improves performance. Other recent work takes this idea a step further, using transfer learning to train a forecasting model for one disease using data from a different disease. We expand upon each of these approaches greatly, training machine learning models on data that span 66 infectious diseases and several data streams. We investigate the value of incorporating different data streams for forecasting 20 different disease data streams. We find that incorporating other data streams improves forecasting in the vast majority (84.9%) of time series and model structures considered. However, our work highlights that the quality of the added data matters, where adding data extremely different from the target data stream can sometimes degrade forecast performance. A major contribution of this work is in compiling a publicly-available database of data for use by the infectious disease forecasting community.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.