Forecasting Seasonal Influenza Epidemics with Physics-Informed Neural Networks

Bruno Lepri; Gabriele Santin; Giulia Cencetti; Martina Rama; Michele Tizzoni

arxiv: 2506.03897 · v2 · pith:M3HJ7V43new · submitted 2025-06-04 · ⚛️ physics.soc-ph

Forecasting Seasonal Influenza Epidemics with Physics-Informed Neural Networks

Martina Rama , Gabriele Santin , Giulia Cencetti , Michele Tizzoni , Bruno Lepri This is my paper

Pith reviewed 2026-05-22 00:39 UTC · model grok-4.3

classification ⚛️ physics.soc-ph

keywords physics-informed neural networksSIR modelepidemic forecastinginfluenzaprobabilistic forecastsparameter inferenceMarkov chain Monte Carlohybrid modeling

0 comments

The pith

A neural network that embeds the SIR epidemic model structure, trained only on synthetic data, infers transmission parameters from limited noisy observations and produces accurate probabilistic forecasts for seasonal influenza.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces SIR-INN, a hybrid model that incorporates the classical Susceptible-Infectious-Recovered compartmental structure directly into a neural network. This model is trained once on synthetic epidemic scenarios and then generalizes to new conditions without any retraining. From sparse and noisy real-world observations it uses Markov chain Monte Carlo to infer key transmission parameters and generate short- and long-term probabilistic forecasts. Validation on Italian national influenza surveillance data for the 2023-2024 and 2024-2025 seasons shows performance comparable to current state-of-the-art methods, especially on the Weighted Interval Score, while maintaining credible uncertainty intervals.

Core claim

SIR-INN integrates the mechanistic structure of the classical SIR model into a neural network architecture. Trained once on synthetic epidemic scenarios, the model generalizes across epidemic conditions without retraining. From limited and noisy observations, it infers key transmission parameters via Markov chain Monte Carlo, generating probabilistic short- and long-term forecasts that are validated on national influenza data from Italy in the 2023-2024 and 2024-2025 seasons.

What carries the argument

The SIR-INN hybrid architecture that embeds the SIR compartmental model inside a neural network to allow single training on synthetic data followed by MCMC-based parameter inference on real observations.

If this is right

The model supplies computationally efficient real-time predictions together with uncertainty quantification for epidemic dynamics.
It achieves competitive accuracy across nearly all phases of an outbreak and shows improved performance in the 2024-2025 season.
Credible uncertainty intervals are produced consistently while coverage metrics indicate remaining room for calibration improvement.
The single-training generalization property removes the need for repeated retraining when epidemic conditions change.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same synthetic-training strategy could be tested on other compartmental structures such as SEIR or models with vital dynamics for different respiratory pathogens.
If the hybrid design scales, national surveillance systems might adopt it to issue earlier alerts without collecting massive new training datasets each season.
The approach invites direct comparison with purely data-driven neural forecasters to quantify how much the embedded SIR structure improves long-horizon reliability.

Load-bearing premise

The classical SIR compartmental structure remains an adequate mechanistic skeleton for real seasonal influenza dynamics when embedded in the neural network and when parameters are inferred from limited noisy national surveillance data.

What would settle it

A side-by-side comparison of SIR-INN's forecasted epidemic peak timing and magnitude against the actual observed peaks in a subsequent influenza season where the model deviates substantially from both ground truth and established forecasting methods.

read the original abstract

Accurate epidemic forecasting is critical for informing public health decisions and timely interventions. While Physics-Informed Neural Networks have shown promise in various scientific domains, their potential application to real-time epidemic forecasting remains underexplored. Here, we present SIR-INN, a hybrid forecasting framework that integrates the mechanistic structure of the classical Susceptible-Infectious-Recovered (SIR) model into a neural network architecture. Trained once on synthetic epidemic scenarios, the model is able to generalize across epidemic conditions without retraining. From limited and noisy observations, SIR-INN infers key transmission parameters via Markov chain Monte Carlo, generating probabilistic short- and long-term forecasts. We validate SIR-INN using national influenza data from the Italian National Institute of Health in the 2023-2024 and 2024-2025 seasons. The model performs competitively with current state-of-the-art approaches, particularly in terms of Weighted Interval Score. It shows accurate predictive performance in nearly all phases of the outbreak, with improved accuracy observed for the 2024-2025 influenza season. Credible uncertainty intervals are consistently maintained, while coverage metrics highlight room for improvement in uncertainty calibration. SIR-INN offers a computationally efficient, transparent, and generalizable solution for epidemic forecasting, appropriately leveraging the framework's hybrid design. Its ability to provide real-time predictions of epidemic dynamics, together with uncertainty quantification, makes it a promising tool for real-world epidemic forecasting.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper's main contribution is a hybrid SIR-INN that trains once on synthetic trajectories then uses MCMC to infer parameters from real Italian flu data for probabilistic forecasts without retraining.

read the letter

The central point here is a hybrid setup where a neural network learns SIR dynamics from synthetic data in one training pass, then applies MCMC to pull transmission and recovery rates out of limited noisy observations and produce short- and long-term forecasts. This is presented as generalizable across epidemic conditions on the 2023-2024 and 2024-2025 Italian seasons, with competitive Weighted Interval Scores and maintained credible intervals in most phases.

Referee Report

2 major / 2 minor

Summary. The manuscript introduces SIR-INN, a hybrid physics-informed neural network that embeds the classical SIR compartmental model. The network is trained once on synthetic SIR trajectories and then, from limited noisy national surveillance observations, infers transmission and recovery rates via MCMC to produce probabilistic short- and long-term forecasts. Validation is performed on Italian influenza incidence data for the 2023-2024 and 2024-2025 seasons, with competitive Weighted Interval Score performance reported relative to existing methods.

Significance. If the central generalization claim holds, the framework would offer a computationally efficient route to mechanistic forecasting with built-in uncertainty quantification that does not require retraining per season. The use of synthetic pre-training followed by MCMC inference on real data is a clear methodological strength when the SIR skeleton is adequate. However, the practical significance is tempered by the risk that any mismatch between classical SIR dynamics and real influenza processes (reporting delays, under-ascertainment, antigenic drift) is absorbed into the inferred parameters rather than diagnosed as model error.

major comments (2)

[Abstract] Abstract and Results section: the headline claim that a single training run on synthetic SIR scenarios enables generalization across real epidemic conditions without retraining rests on the untested premise that the classical SIR ODEs remain an adequate mechanistic skeleton once confronted with national surveillance noise; no sensitivity experiments that inject non-SIR features (time-varying transmission, reporting delays, or multi-strain dynamics) into the test data are reported, leaving the robustness of the inferred parameters open to question.
[Methods] Methods section on MCMC inference: parameter inference is performed on the same limited observations used for forecasting; while the SIR structure is external, the effective transmission rates become fitted quantities whose predictive use therefore carries a circularity burden that is not quantified by any held-out validation or posterior predictive check against independent data streams.

minor comments (2)

[Figures] Figure captions should explicitly define all metrics (WIS, coverage, etc.) so that tables and figures are self-contained.
[Methods] The description of the neural-network architecture would benefit from a clear statement of the relative weighting between the data-fidelity term and the physics residual term in the loss function.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their constructive comments, which help clarify the scope and limitations of the SIR-INN framework. We respond to each major comment below and outline the revisions we will implement.

read point-by-point responses

Referee: [Abstract] Abstract and Results section: the headline claim that a single training run on synthetic SIR scenarios enables generalization across real epidemic conditions without retraining rests on the untested premise that the classical SIR ODEs remain an adequate mechanistic skeleton once confronted with national surveillance noise; no sensitivity experiments that inject non-SIR features (time-varying transmission, reporting delays, or multi-strain dynamics) into the test data are reported, leaving the robustness of the inferred parameters open to question.

Authors: We agree that controlled sensitivity experiments would strengthen the robustness claims. Our validation already uses real national surveillance data from two influenza seasons, which inherently contain reporting noise, under-ascertainment, and other non-SIR effects, and the model produced competitive forecasts without retraining. To address the specific request, the revised manuscript will include new experiments that inject time-varying transmission and reporting delays into synthetic test trajectories to quantify their effects on inferred parameters and forecast accuracy. revision: yes
Referee: [Methods] Methods section on MCMC inference: parameter inference is performed on the same limited observations used for forecasting; while the SIR structure is external, the effective transmission rates become fitted quantities whose predictive use therefore carries a circularity burden that is not quantified by any held-out validation or posterior predictive check against independent data streams.

Authors: Inference uses data available up to the forecast origin to predict subsequent incidence, which follows standard real-time forecasting practice. We acknowledge that explicit quantification of any circularity via held-out checks would improve transparency. In the revision we will add posterior predictive checks on held-out segments of the Italian surveillance series and, where feasible, comparisons against independent data streams to evaluate the reliability of the inferred parameters. revision: yes

Circularity Check

0 steps flagged

No significant circularity; derivation uses external SIR structure and real-data validation

full rationale

The paper trains the SIR-INN hybrid model once on synthetic trajectories generated from the classical SIR equations, then applies the fixed network to real national surveillance data by inferring transmission parameters via MCMC and producing forecasts. Performance is assessed on the 2023-2024 and 2024-2025 Italian influenza seasons against external benchmarks and state-of-the-art methods, with no reduction of the central generalization or forecasting claims to the synthetic training inputs by construction. The mechanistic SIR skeleton is a standard, independently established model rather than a self-defined or self-cited construct, and the MCMC step is ordinary parameter inference for forecasting rather than a fitted input relabeled as a prediction. No self-citation chains, ansatz smuggling, or renaming of known results appear as load-bearing elements.

Axiom & Free-Parameter Ledger

1 free parameters · 2 axioms · 0 invented entities

The central claim rests on the adequacy of the SIR compartmental model for influenza, the representativeness of the synthetic training distribution, and the reliability of national surveillance counts as observations. No new physical entities are postulated.

free parameters (1)

transmission and recovery rates
Inferred via MCMC from limited noisy observations for each forecast window; these are the quantities the model learns to predict future incidence.

axioms (2)

domain assumption The SIR compartmental structure is a sufficient mechanistic skeleton for seasonal influenza dynamics.
Invoked by embedding the SIR equations inside the neural network architecture.
domain assumption Synthetic epidemic trajectories generated from the SIR model are representative of real seasonal influenza conditions.
Required for the one-time training to generalize without retraining.

pith-pipeline@v0.9.0 · 5794 in / 1451 out tokens · 39416 ms · 2026-05-22T00:39:52.214873+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

the classical Susceptible-Infectious-Recovered (SIR) model into a neural network architecture... dS/dt = −β/N S I, dI/dt = β/N S I − γ I, dR/dt = γ I
IndisputableMonolith/Foundation/RealityFromDistinction.lean reality_from_one_distinction unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

Trained once on synthetic epidemic scenarios... infers key transmission parameters via Markov chain Monte Carlo

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.