Temporal dependence in exposure and hazard-based infectious disease interventions
Pith reviewed 2026-05-18 19:10 UTC · model grok-4.3
The pith
Selection bias emerges over time in RCTs of infectious disease interventions even when all participants start identical.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
In randomized controlled trials of infectious disease interventions with homogeneous participants and no individual heterogeneity at baseline, temporal dependence in exposure produces selection bias over time that distorts the hazard ratio. This bias arises because the dependence creates differential cumulative exposure risks between intervention and control arms even when the intervention has no effect, and the magnitude of the distortion can be quantified directly from the exposure process.
What carries the argument
Temporal dependence in exposure, which creates cumulative differences in infection risk across trial arms by linking an individual's exposure status at one time to later periods.
If this is right
- Hazard ratios reported from such trials will deviate from the true intervention effect even under perfect randomization and homogeneity.
- Trial protocols should incorporate exposure monitoring or adjustment methods to reduce the influence of time-linked exposure patterns.
- Interpretation of results must include sensitivity checks for the strength of temporal dependence present in the study population.
- The bias size increases with stronger serial correlation in exposure events and longer trial durations.
Where Pith is reading between the lines
- The same exposure-dependence mechanism could distort estimates in observational cohort studies of infectious disease interventions.
- Combining this source of bias with baseline heterogeneity would likely produce larger net distortions than either source alone.
- Explicit modeling of the exposure process in the statistical analysis offers a route to correct the hazard ratio after data collection.
Load-bearing premise
Exposure to the infectious agent depends on time in a manner that differs systematically between the intervention and control groups as the study continues.
What would settle it
A simulation of a randomized trial with identical participants at the start, time-varying exposure probabilities drawn from a realistic transmission model, and a true hazard ratio of one that nevertheless yields a biased estimate away from one after several time steps.
Figures
read the original abstract
In randomized controlled trials (RCTs) of infectious disease interventions, it is well recognized that unmeasured individual heterogeneity at baseline can induce selection bias over time, thereby complicating the interpretation of the estimated hazard ratio. The present study examines a simplified setting: RCTs consisting of homogeneous participants, with no individual heterogeneity at baseline. However, even in such an apparently ideal setting, selection bias can emerge over time due to temporal dependence in exposure, a realistic feature of infectious disease transmission. In this study, we mathematically characterize the mechanism underlying this bias and quantitatively evaluate its magnitude. Our results show that this bias should be recognized as an issue in both the design and interpretation of RCTs of infectious disease interventions.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper examines selection bias in hazard ratio estimates from RCTs of infectious disease interventions. In a simplified homogeneous population with no baseline individual heterogeneity, it claims that temporal dependence in exposure (a realistic transmission feature) can still induce bias over time. The authors mathematically characterize the mechanism and quantitatively evaluate the bias magnitude under stochastic exposure processes, concluding that the bias warrants attention in RCT design and interpretation.
Significance. If the central derivations hold and the quantitative results demonstrate non-negligible bias under empirically plausible dependence strengths, the work would be a useful contribution to statistical methods in infectious disease epidemiology. It identifies a source of bias distinct from the usual unmeasured heterogeneity and could inform adjustments or sensitivity analyses in hazard-based analyses. The strength of the contribution rests on whether the magnitude evaluation uses realistic parameter values rather than arbitrary ones.
major comments (2)
- [§3.2, Eq. (8)] §3.2, Eq. (8): the derived expression for the observed hazard ratio under autocorrelated exposure shows bias emerging from the dependence parameter ρ, but the subsequent quantitative evaluation in §4 plugs in values of ρ without reference to empirical contact or exposure data; this makes it unclear whether the reported bias magnitudes are detectable under realistic transmission settings where dependence may be weaker.
- [Table 2] Table 2, rows for dependence strength 0.6–0.9: the simulated bias in the estimated HR reaches 15–25% by t=6 months, but no sensitivity analysis is provided for lower, empirically motivated values of temporal dependence (e.g., from household transmission studies); this weakens the claim that the bias 'should be recognized as an issue' for typical RCT designs.
minor comments (2)
- [§2] The notation for the exposure process (e.g., the definition of the autocorrelation function) is introduced in §2 but not consistently carried through the results; a brief reminder in §4 would improve readability.
- [Figure 3] Figure 3 caption does not specify the number of simulation replicates or the random seed used; this should be added for reproducibility.
Simulated Author's Rebuttal
We thank the referee for the constructive comments, which help clarify how our results connect to empirical transmission settings. We agree that additional context on realistic values of temporal dependence strengthens the practical implications for RCT design. We address each major comment below and will incorporate revisions to improve the manuscript.
read point-by-point responses
-
Referee: [§3.2, Eq. (8)] §3.2, Eq. (8): the derived expression for the observed hazard ratio under autocorrelated exposure shows bias emerging from the dependence parameter ρ, but the subsequent quantitative evaluation in §4 plugs in values of ρ without reference to empirical contact or exposure data; this makes it unclear whether the reported bias magnitudes are detectable under realistic transmission settings where dependence may be weaker.
Authors: We thank the referee for this observation. The values of ρ in Section 4 were chosen illustratively to show how bias scales with the autocorrelation parameter, as the closed-form expression in Eq. (8) is valid for any ρ in (-1,1). The derivation itself does not depend on specific empirical magnitudes. That said, we acknowledge the value of grounding the quantitative results in contact data. In revision we will add a short discussion citing household transmission studies that report positive serial dependence in exposure (arising from repeated contacts within clusters) and will include sensitivity results for weaker ρ to demonstrate when the bias remains detectable. revision: yes
-
Referee: [Table 2] Table 2, rows for dependence strength 0.6–0.9: the simulated bias in the estimated HR reaches 15–25% by t=6 months, but no sensitivity analysis is provided for lower, empirically motivated values of temporal dependence (e.g., from household transmission studies); this weakens the claim that the bias 'should be recognized as an issue' for typical RCT designs.
Authors: The referee correctly identifies that Table 2 emphasizes higher dependence strengths. We will revise the table (or add a supplementary panel) to include results for ρ = 0.2–0.5, drawing on ranges suggested by household contact studies. This will allow direct assessment of bias magnitude under more conservative dependence assumptions. Even at these lower values the cumulative selection effect produces non-negligible bias by six months in our simulations, supporting the broader claim, but the added analyses will make the recommendation more robust for typical RCT durations. revision: yes
Circularity Check
No significant circularity; direct mathematical characterization of bias mechanism
full rationale
The paper derives the emergence of selection bias in RCTs from temporal dependence in exposure within a homogeneous population (no baseline heterogeneity). This is presented as a first-principles stochastic process model whose outputs are not shown to reduce to fitted inputs or self-referential definitions. No load-bearing self-citations, uniqueness theorems, or ansatzes imported from prior author work are invoked to force the central result. Quantitative magnitude is evaluated by plugging chosen dependence parameters into the derived expressions, but these are not framed as predictions that are statistically forced by the fitting process itself. The derivation chain remains self-contained and independent of the target bias magnitude.
Axiom & Free-Parameter Ledger
axioms (2)
- domain assumption RCTs consist of homogeneous participants with no individual heterogeneity at baseline
- domain assumption Exposure has temporal dependence as a realistic feature of infectious disease transmission
Reference graph
Works this paper leans on
-
[1]
Cox, D. R. (1972). Regression models and life-tables. Journal of the Royal Statistical Society Series B 34 (2), 187–220. 14
work page 1972
-
[2]
Falsey, A., M. Sobieszczyk, I. Hirsch, and et al. (2021). Phase 3 safety and e fficacy of azd1222 (chadox1 ncov-19) covid-19 vaccine. The New England Journal of Medicine 385 (25), 2348–2360
work page 2021
-
[3]
Halloran, M., I. Longini, and C. Struchiner (1996). Estimability and interp retation of vaccine efficacy using frailty mixing models. American Journal of Epidemiology 144 (1), 83–97
work page 1996
-
[4]
Halloran, M., I. Longini, and C. Struchiner (2010). Design and Analysis of Vaccine Studies . Springer
work page 2010
-
[5]
He, X., E. Laum, P. Wu, and et al. (2020). Temporal dynamics in viral shedding an d transmissibility of covid-19. Nature Medicine 26 (5), 672–675
work page 2020
-
[6]
Kahn, R., M. Hitchings, S. Bellan, and et al. (2018). Impact of stochasticall y generated hetero- geneity in hazard rates on individually randomized vaccine efficacy trial s. Clinical Trials 15 (2), 207–211
work page 2018
-
[7]
Lipsitch, M., F. Krammer, G. Regev-Yochay, and et al. (2022). Sars-cov- 2 breakthrough infections in vaccinated individuals: measurement, causes and impact. Nature Reviews Immunology 22 (1), 57–65
work page 2022
-
[8]
Nishi, A., G. Dewey, A. Endo, and et al. (2020). Network interventions for managi ng the covid-19 pandemic and sustaining economy. Proceedings of the National Academy of Sciences 117 (48), 30285–30294. O’Hagan, J., M. Lipsitch, and M. Hernan (2014). Estimating the per-exposure effect of infectious disease interventions. Epidemiology 25 (1), 134–138
work page 2020
-
[9]
Stensrud, M. and L. Smith (2023). Identification of vaccine effects when e xposure status is unknown. Epidemiology 34 (2), 216–224. U.S. Food and Drug Administration (2021). Eemergency use authorization (eu a) for an unapproved product review memorandum. 15 ”Supplement to ’Temporal exposure dependence bias in vaccine efficacy trials’” Hiroyasu Ando ∗ Departme...
work page 2023
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.