pith. sign in

arxiv: 2605.17129 · v1 · pith:ZRRXFDCOnew · submitted 2026-05-16 · 🧬 q-bio.PE · q-bio.QM

Within-host immunology to age-of-infection epidemiology via a virtual cohort

Pith reviewed 2026-05-20 14:24 UTC · model grok-4.3

classification 🧬 q-bio.PE q-bio.QM
keywords within-host modelage-of-infectionsynthetic populationvirtual cohortSARS-CoV-2individual heterogeneityepidemiological modeltransmission dynamics
0
0 comments X

The pith

Individual immune variation in SARS-CoV-2 can be scaled directly into population transmission models via ranked virtual patient profiles.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper sets out a chain of steps that turns numerical results from a model of virus and immune responses inside one person into inputs for a model of how the disease moves through many people. First, the within-host model is run many times to spot which parameters create the widest range of outcomes. Those parameters then define a group of simulated individuals whose response curves are saved over time since infection. The curves are ordered from mildest to most severe, including death, and this ordering supplies the parameters for an age-of-infection SIR model at the population scale. A reader would care because the method supplies a concrete route to include biological differences in epidemic forecasts without requiring full individual-level data.

Core claim

By numerically identifying the within-host parameters that produce the largest variation in model outputs, the authors generate a synthetic population whose time-dependent severity profiles, ranked from mild infection to death, serve as direct inputs to parameterize an age-of-infection structured epidemiological model, thereby establishing a one-directional link from individual heterogeneity to population-level transmission dynamics.

What carries the argument

The virtual cohort, a synthetic population created from the most variable within-host parameters and then ranked by severity as a function of time since infection, which supplies the parameter values for the age-of-infection SIR model.

Load-bearing premise

The parameters that produce the largest variation inside the within-host model are sufficient by themselves to generate synthetic severity profiles that match real-world heterogeneity and can be fed straight into the population model without further calibration against observed data.

What would settle it

Finding that the distribution of mild, severe, and fatal outcomes in the synthetic cohort deviates substantially from the proportions recorded in real SARS-CoV-2 case data would show that the direct parameterization step does not hold.

Figures

Figures reproduced from arXiv: 2605.17129 by Clotilde Djuikem, Julien Arino, Kang-Ling Liao, Morgan Craig, St\'ephanie Portet.

Figure 1
Figure 1. Figure 1: Conceptual overview of the method. Step 0 – Choose a within-host differential equations model describing the course of an infection in hosts and an age-of-infection structured epidemiological model describing the spread of the pathogen in a pop￾ulation. Determine which age-of-infection characteristics need to be extracted to parametrise the model, in order to guide the next steps. Step 1 – Carry out a glob… view at source ↗
Figure 2
Figure 2. Figure 2: Flowchart of the simplified within-host model. The state variables are target cells ( [PITH_FULL_IMAGE:figures/full_fig_p003_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: Ranked global sensitivity scores of parameters, where the global sensitivity is defined as the sum of [PITH_FULL_IMAGE:figures/full_fig_p007_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: Peak lung damage and timing (in days) of peak, [PITH_FULL_IMAGE:figures/full_fig_p010_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: Effects of thresholds ξ h and ξ d . (a) Length of hospital stay as a function of the hospitalisation threshold ξ h when ξ d = 85%. Boxplots show the distribution of hospital stay duration (days). Pink dots indicate the mean duration. Percentages correspond to the fraction of surviving hospitalised patients. (b) Percentage of the types of outcomes (mild, ICU and deaths) depending on the lethality threshold … view at source ↗
Figure 6
Figure 6. Figure 6: Distribution of the infectious period duration [PITH_FULL_IMAGE:figures/full_fig_p012_6.png] view at source ↗
Figure 7
Figure 7. Figure 7: Estimation of population-level age-of-infection disease-induced death rate from the virtual cohort. [PITH_FULL_IMAGE:figures/full_fig_p014_7.png] view at source ↗
Figure 8
Figure 8. Figure 8: Estimated (a) time-to-recovery probability distribution [PITH_FULL_IMAGE:figures/full_fig_p015_8.png] view at source ↗
Figure 9
Figure 9. Figure 9: Transmissibility estimated from within-host viral load using four different summary functions, the [PITH_FULL_IMAGE:figures/full_fig_p016_9.png] view at source ↗
Figure 10
Figure 10. Figure 10: Comparison of between-host incidence trajectories [PITH_FULL_IMAGE:figures/full_fig_p018_10.png] view at source ↗
read the original abstract

We present a methodology providing a one-directional link from within-host individual heterogeneity to population-level disease transmission dynamics. The methodology works in several steps. A within-host model is investigated numerically to determine pathogen and immunological parameters leading to the largest variation of model responses. These key parameters are used to generate a synthetic population of individuals whose temporal immunological response profiles are recorded. These responses are ranked in terms of the severity of experienced outcomes, from mild infections to death, as a function of time since infection. This is used to parametrise an age-of-infection structured epidemiological model to study the transmission dynamics of the disease at the population level. The approach is illustrated using a within-host model describing SARS-CoV-2 infection and an SIR population-level model.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

3 major / 2 minor

Summary. The manuscript presents a methodology to link within-host immunological heterogeneity to population-level transmission dynamics. It identifies key parameters from a within-host SARS-CoV-2 model that maximize response variation, generates a virtual cohort of synthetic individuals with ranked severity profiles over time since infection, and uses these ranks to parametrize an age-of-infection structured SIR epidemiological model.

Significance. If the virtual cohort's severity rankings prove representative of real heterogeneity, the approach could provide a structured, one-directional pipeline for incorporating individual immunological variation into epidemiological models, potentially improving predictions of transmission and outcomes for pathogens like SARS-CoV-2. The framing as a virtual cohort bridge between scales is conceptually clear, but the absence of any empirical validation or comparison limits its current significance.

major comments (3)
  1. [Abstract and Methods] Abstract and Methods: The claim that the virtual cohort 'faithfully transmits heterogeneity' to the epidemiological model rests on the assumption that parameters maximizing numerical variation in the within-host model produce severity profiles representative of real SARS-CoV-2 outcome distributions; no quantitative validation, error analysis, or comparison against empirical cohort data is provided to support this transfer.
  2. [Results] Results: The ranked time-since-infection severity profiles are directly inserted into the age-of-infection SIR transmission rates without calibration against observed distributions of infectiousness or clinical severity; this makes the population-level dynamics dependent on an untested modeling choice rather than data-constrained heterogeneity.
  3. [Methods] The selection of 'key within-host parameters' is described only at a high level as those leading to largest variation; without explicit criteria, sensitivity analysis, or justification that these parameters capture the dominant sources of real-world outcome heterogeneity, the downstream epidemiological conclusions lack a clear foundation.
minor comments (2)
  1. [Methods] Notation for the severity ranking function and its mapping to transmission rates could be made more explicit to improve reproducibility.
  2. [Figures] Figure captions should include more detail on how the virtual cohort is sampled and ranked to aid reader interpretation.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for the constructive review and the opportunity to strengthen the manuscript. We have revised the text to clarify the scope of our methodological pipeline, replace imprecise language, expand the description of parameter selection with explicit sensitivity analysis details, and add an explicit discussion of limitations including the absence of empirical validation. Our point-by-point responses follow.

read point-by-point responses
  1. Referee: [Abstract and Methods] The claim that the virtual cohort 'faithfully transmits heterogeneity' to the epidemiological model rests on the assumption that parameters maximizing numerical variation in the within-host model produce severity profiles representative of real SARS-CoV-2 outcome distributions; no quantitative validation, error analysis, or comparison against empirical cohort data is provided to support this transfer.

    Authors: We agree the original phrasing risked overstating biological representativeness. The pipeline is intended to propagate numerical heterogeneity generated by the within-host model; it does not claim that the resulting severity profiles match real-world distributions. We have replaced 'faithfully transmits heterogeneity' with 'transmits the modeled heterogeneity' throughout the abstract and methods. A new limitations paragraph has been added to the Discussion that explicitly states the lack of empirical validation or error analysis against cohort data and identifies this as a target for future work. revision: yes

  2. Referee: [Results] The ranked time-since-infection severity profiles are directly inserted into the age-of-infection SIR transmission rates without calibration against observed distributions of infectiousness or clinical severity; this makes the population-level dynamics dependent on an untested modeling choice rather than data-constrained heterogeneity.

    Authors: The direct use of ranked profiles is a deliberate, uncalibrated modeling choice that allows the pipeline to remain one-directional and data-light at the population scale. We accept that this renders the quantitative epidemiological outputs illustrative of the method rather than calibrated predictions. In revision we have added a sensitivity analysis that re-ranks profiles under alternative severity metrics and reports the resulting variation in R0 and peak incidence; we have also rewritten the Results and Discussion to state clearly that the population-level findings demonstrate the linking procedure and are not intended as calibrated forecasts. revision: partial

  3. Referee: [Methods] The selection of 'key within-host parameters' is described only at a high level as those leading to largest variation; without explicit criteria, sensitivity analysis, or justification that these parameters capture the dominant sources of real-world outcome heterogeneity, the downstream epidemiological conclusions lack a clear foundation.

    Authors: Parameter selection was performed via a variance-based global sensitivity analysis (Sobol indices) on the within-host model outputs, retaining parameters whose total-order index exceeded 0.1 for peak viral load or infection duration. We have expanded the Methods section to report the full sensitivity procedure, the numerical threshold applied, and the resulting parameter set. While we acknowledge that real-world heterogeneity may include factors absent from the within-host model, the selection is now explicitly justified as the dominant sources of variation within the given model. revision: yes

Circularity Check

0 steps flagged

No circularity: forward simulation pipeline from within-host variation to age-of-infection parametrization

full rationale

The paper outlines a sequential methodology consisting of numerical exploration of a within-host model to identify parameters producing largest response variation, generation of a synthetic virtual cohort with recorded temporal profiles, ranking of those profiles by severity as a function of time since infection, and direct insertion of the ranked profiles to parametrize an age-of-infection structured SIR model. This constitutes a one-directional forward chain with no self-definitional steps, no fitted inputs renamed as predictions, and no load-bearing self-citations or imported uniqueness theorems that would reduce the central claim to prior work by the same authors. The derivation remains self-contained as an explicit modeling pipeline rather than any closed loop equivalent to its inputs.

Axiom & Free-Parameter Ledger

1 free parameters · 1 axioms · 0 invented entities

The central claim rests on the assumption that numerical identification of high-variance parameters in the within-host model produces a representative synthetic population and that severity ranking can be mapped directly onto transmission parameters without further empirical adjustment.

free parameters (1)
  • key within-host parameters
    Parameters selected numerically for producing largest variation in model responses; their specific values are fitted or chosen to maximize heterogeneity before cohort generation.
axioms (1)
  • domain assumption The within-host model responses can be ranked by severity in a manner that directly informs age-of-infection transmission rates.
    Invoked when moving from ranked virtual cohort profiles to parametrization of the epidemiological model.

pith-pipeline@v0.9.0 · 5669 in / 1444 out tokens · 31347 ms · 2026-05-20T14:24:15.939924+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

44 extracted references · 44 canonical work pages

  1. [1]

    Acharya D., Liu G., and Gack M. U. Dysregulation of type I interferon responses in COVID-19.Nature Reviews Immunology, 20(7):397–398, 2020

  2. [2]

    Almocera A. E. S., Nguyen V . K., and Hernandez-Vargas E. A. Multiscale model within-host and between- host for viral infectious diseases.Journal of Mathematical Biology, 77(4):1035–1057, May 2018

  3. [3]

    Almocera A. E. S., Nguyen V . K., and Hernandez-Vargas E. A. Multiscale model within-host and between- host for viral infectious diseases.Journal of Mathematical Biology, 77(4):1035–1057, 2018

  4. [4]

    S., Wandeler G., et al

    Blaser N., Wettstein C., Estill J., Vizcaya L. S., Wandeler G., et al. Impact of viral load and the duration of primary infection on HIV transmission: systematic review and meta-analysis.AIDS, 28(7):1021, 2014

  5. [5]

    M., El Moustaid F., Gajewski Z., Kadelka S., Nikin-Beers R., et al

    Childs L. M., El Moustaid F., Gajewski Z., Kadelka S., Nikin-Beers R., et al. Linked within-host and between-host models and data for infectious diseases: a systematic review.PeerJ, 7:e7057, 2019

  6. [6]

    W., Morgan W

    Curran J. W., Morgan W. M., Hardy A. M., Jaffe H. W., Darrow W. W., and Dowdle W. R. The epidemiol- ogy of AIDS: current status and future prospects.Science, 229(4720):1352–1357, 1985

  7. [7]

    and Martcheva M

    DebRoy S. and Martcheva M. Immuni-epidemiology and HIV-AIDS: a modeling perspective.Mathemat- ical Biology Research Trends, pages 175–192, 2008

  8. [8]

    W., Thompson R

    Doran J. W., Thompson R. N., Yates C. A., and Bowness R. Mathematical methods for scaling from within-host to population-scale in infectious disease systems.Epidemics, 45:100724, 2023. 19

  9. [9]

    M., Gilbertson B

    Edenborough K. M., Gilbertson B. P., and Brown L. E. A mouse model for the study of contact- dependent transmission of influenza A virus and the factors that govern transmissibility.Journal of Vi- rology, 86(23):12544–12551, 2012

  10. [10]

    Apoptosis: a review of programmed cell death.Toxicologic Pathology, 35(4):495–516, 2007

    Elmore S. Apoptosis: a review of programmed cell death.Toxicologic Pathology, 35(4):495–516, 2007

  11. [11]

    A mathematical model for coupling within-host and between-host dynamics in an environmentally-driven infectious disease.Mathematical Biosciences, 241(1):49–55, 2013

    Feng Z., Velasco-Hernandez J., and Tapia-Santos B. A mathematical model for coupling within-host and between-host dynamics in an environmentally-driven infectious disease.Mathematical Biosciences, 241(1):49–55, 2013

  12. [12]

    Feng Z., Velasco-Hernandez J., Tapia-Santos B., and Leite M. C. A. A model for coupling within-host and between-host dynamics in an infectious disease.Nonlinear Dynamics, 68:401–411, 2012

  13. [13]

    A complete categorization of multiscale models of infectious disease systems.Journal of Biological Dynamics, 11(1):378–435, 2017

    Garira W. A complete categorization of multiscale models of infectious disease systems.Journal of Biological Dynamics, 11(1):378–435, 2017

  14. [14]

    Gilchrist M. A. and Coombs D. Evolution of virulence: interdependence, constraints, and selection using nested models.Theoretical Population Biology, 69(2):145–153, 2006

  15. [15]

    B., Cardozo-Ojeda E

    Goyal A., Reeves D. B., Cardozo-Ojeda E. F., Schiffer J. T., and Mayer B. T. Viral load and contact heterogeneity predict SARS-CoV-2 transmission and super-spreading events.ELife, 10:e63537, 2021

  16. [16]

    Quantifying the relationship between within-host dynamics and transmission for viral diseases of livestock.Journal of The Royal Society Interface, 21(211), 2024

    Gubbins S. Quantifying the relationship between within-host dynamics and transmission for viral diseases of livestock.Journal of The Royal Society Interface, 21(211), 2024

  17. [17]

    B., Galinski M

    Gutierrez J. B., Galinski M. R., Cantrell S., and V oit E. O. From within host dynamics to the epidemiology of infectious disease: scientific overview and challenges.Mathematical Biosciences, 270:143–155, 2015

  18. [18]

    and Rohani P

    Handel A. and Rohani P. Crossing the scale from within-host infection dynamics to between-host trans- mission fitness: a discussion of current assumptions and knowledge.Philosophical Transactions of the Royal Society B: Biological Sciences, 370(1675):20140302, 2015

  19. [19]

    Immunoepidemiology–bridging the gap between immunology and epidemiology.TRENDS in Parasitology, 17(2):102–106, 2001

    Hellriegel B. Immunoepidemiology–bridging the gap between immunology and epidemiology.TRENDS in Parasitology, 17(2):102–106, 2001

  20. [21]

    Transmissibility of COVID-19 depends on the viral load around onset in adult and symptomatic patients.PLoS One, 15(12):e0243597, 2020

    Kawasuji H., Takegoshi Y ., Kaneda M., Ueno A., Miyajima Y ., et al. Transmissibility of COVID-19 depends on the viral load around onset in adult and symptomatic patients.PLoS One, 15(12):e0243597, 2020

  21. [22]

    P., Smith R

    Ke R., Martinez P. P., Smith R. L., Gibson L. L., Mirza A., et al. Daily longitudinal sampling of sars-cov-2 infection reveals substantial heterogeneity in infectiousness.Nature Microbiology, 7(5):640–652, 2022

  22. [23]

    D., Ribeiro R

    Ke R., Zitzmann C., Ho D. D., Ribeiro R. M., and Perelson A. S. In vivo kinetics of sars-cov-2 infection and its relationship with a person’s infectiousness.Proceedings of the National Academy of Sciences, 118(49):e2111477118, 2021

  23. [24]

    and Ferguson N

    Lange A. and Ferguson N. M. Antigenic diversity, transmission mechanisms, and the evolution of pathogens.PLoS Computational Biology, 5(10):e1000536, 2009

  24. [25]

    Quantifying the relationship between SARS-CoV-2 viral load and infectiousness.ELife, 10:e69302, 2021

    Marc A., Kerioui M., Blanquart F., Bertrand J., Mitja O., et al. Quantifying the relationship between SARS-CoV-2 viral load and infectiousness.ELife, 10:e69302, 2021. 20

  25. [26]

    Coupling within-host and between-host infectious diseases models.Biomath, 4(2):ID–1510091, 2015

    Martcheva M., Tuncer N., and St Mary C. Coupling within-host and between-host infectious diseases models.Biomath, 4(2):ID–1510091, 2015

  26. [27]

    Linking within-and between-host dynamics in the evolutionary epidemi- ology of infectious diseases.Trends in Ecology & Evolution, 23(9):511–517, 2008

    Mideo N., Alizon S., and Day T. Linking within-and between-host dynamics in the evolutionary epidemi- ology of infectious diseases.Trends in Ecology & Evolution, 23(9):511–517, 2008

  27. [28]

    A., and Velasco-Hernández J

    Núñez-López M., Castro-Echeverría J. A., and Velasco-Hernández J. X. A simple within-host, between- host model for a vector-transmitted disease. November 2022

  28. [29]

    P., Sahu M

    Singh D. P., Sahu M. C., Pagdhune A., Viramgami A., Perumal S., et al. Viral load could be an important determinant for fomites-based transmission of viral infections.Journal of Family Medicine and Primary Care, 10(2):929, 2021

  29. [30]

    T., Alizon S., and Michalakis Y

    Sofonea M. T., Alizon S., and Michalakis Y . From within-host interactions to epidemiological competi- tion: a general model for multiple infections.Philosophical Transactions of the Royal Society B: Biological Sciences, 370(1675):20140303, August 2015

  30. [31]

    A., and Peirce S

    Walpole J., Papin J. A., and Peirce S. M. Multiscale computational models of complex biological systems. Annual Review of Biomedical Engineering, 15(1):137–154, 2013

  31. [32]

    Lessons from a decade of individual-based models for infectious disease transmission: a systematic review (2006-2015).BMC Infectious Diseases, 17(1):1–16, 2017

    Willem L., Verelst F., Bilcke J., Hens N., and Beutels P. Lessons from a decade of individual-based models for infectious disease transmission: a systematic review (2006-2015).BMC Infectious Diseases, 17(1):1–16, 2017

  32. [33]

    A class of SIR epidemic model with saturation incidence and age of infection

    Yang J., Zhang F., and Wang X. A class of SIR epidemic model with saturation incidence and age of infection. InEighth ACIS International Conference on Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing (SNPD 2007), volume 1, pages 146–149. IEEE, 2007

  33. [34]

    outbreak

    Yeghiazarian L., Cumberland W. G., and Yang O. O. A stochastic multi-scale model of HIV-1 transmis- sion for decision-making: application to a MSM population.PLoS One, 8(11):e70578, 2013. 21 Within-host immunology to age-of-infection epidemiologyviaa virtual cohort Supplementary material Julien Arino, Morgan Craig, Clotilde Djuikem, Kang-Ling Liao, Stépha...

  34. [35]

    A unifying framework for parallel and distributed processing in R using Futures.The R Journal, 13(2):208–227, 2021

    Bengtsson H. A unifying framework for parallel and distributed processing in R using Futures.The R Journal, 13(2):208–227, 2021

  35. [36]

    R package version 1.2.0

    Carnell R.lhs: Latin Hypercube Samples, 2024. R package version 1.2.0

  36. [37]

    Timing of Antiviral Treatment Ini- tiation is Critical to Reduce SARS-CoV-2 Viral Load.CPT: Pharmacometrics & Systems Pharmacology, 9(9):509–514, August 2020

    Gonçalves A., Bertrand J., Ke R., Comets E., de Lamballerie X., et al. Timing of Antiviral Treatment Ini- tiation is Critical to Reduce SARS-CoV-2 Viral Load.CPT: Pharmacometrics & Systems Pharmacology, 9(9):509–514, August 2020. 13

  37. [38]

    Hernandez-Vargas E. A. and Velasco-Hernandez J. X. In-host Mathematical Modelling of COVID-19 in Humans.Annual Reviews in Control, 50:448–456, 2020

  38. [39]

    R package version 1.30.1

    Iooss B., Da Veiga S., Janon A., and Pujol G.Sensitivity: Global Sensitivity Analysis of Model Outputs and Importance Measures, 2024. R package version 1.30.1

  39. [40]

    L., Aogo R

    Jenner A. L., Aogo R. A., Alfonso S., Crowe V ., Deng X., et al. COVID-19 virtual patient cohort suggests immune mechanisms driving disease outcomes.PLoS Pathogens, 17(7):e1009753, 2021

  40. [41]

    Multistrain disease dynamics

    Martcheva M. Multistrain disease dynamics. InAn Introduction to Mathematical Epidemiology, pages 183–214. Springer, 2015

  41. [42]

    Dynamics of two-strain influenza with isolation and partial cross-immunity.SIAM Journal on Applied Mathematics, 65(3):964–982, 2005

    Nuno M., Feng Z., Martcheva M., and Castillo-Chavez C. Dynamics of two-strain influenza with isolation and partial cross-immunity.SIAM Journal on Applied Mathematics, 65(3):964–982, 2005

  42. [43]

    Owens K., Esmaeili S., and Schiffer J. T. Heterogeneous sars-cov-2 kinetics due to variable timing and intensity of immune responses.JCI Insight, 9(9):e176286, 2024

  43. [44]

    and Watmough J

    van den Driessche P. and Watmough J. Reproduction numbers and sub-threshold endemic equilibria for compartmental models of disease transmission.Mathematical Biosciences, 180(1):29–48, 2002

  44. [45]

    A class of SIR epidemic model with saturation incidence and age of infection

    Yang J., Zhang F., and Wang X. A class of SIR epidemic model with saturation incidence and age of infection. InEighth ACIS International Conference on Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing (SNPD 2007), volume 1, pages 146–149. IEEE, 2007. 14