Evaluating the impact of longitudinal treatment strategies in the presence of informative monitoring and time-dependent confounding
Pith reviewed 2026-05-10 15:55 UTC · model grok-4.3
The pith
Incorporating monitoring indicators as time-dependent confounders removes bias from estimates of longitudinal treatment effects on time-to-event outcomes in electronic health records.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The central claim is that the causal effects of longitudinal treatment strategies on survival outcomes can be consistently estimated by adapting inverse probability weighting, G-computation, and longitudinal targeted maximum likelihood estimation to include monitoring indicator variables as additional time-dependent confounders. This adaptation handles both static strategies such as always treat or never treat and more general strategies that allow variation in treatment initiation and duration, provided the monitoring process is captured by these indicators and the usual causal assumptions continue to hold after their inclusion.
What carries the argument
Monitoring indicator variables, included as time-dependent confounders that affect both the treatment process and the outcome.
If this is right
- Ignoring monitoring indicators produces biased estimates of treatment effects on survival.
- The three adapted methods recover unbiased estimates when monitoring indicators are treated as time-dependent confounders.
- The approach works for both static treatment rules and rules that allow flexible timing of treatment start and stop.
- In the intensive care example, the methods support comparison of early and delayed ventilation strategies on mortality risk.
- Simulation evidence confirms that bias from informative monitoring is removed by the adaptation.
Where Pith is reading between the lines
- The same monitoring-indicator adjustment may apply to other irregularly observed data sources such as insurance claims or wearable sensor records.
- Future studies using electronic health records could routinely record or model monitoring patterns to enable this correction.
- Researchers should verify that positivity holds for the combined set of treatment and monitoring variables after inclusion.
Load-bearing premise
That including the monitoring indicators fully accounts for the dependence between monitoring frequency and health status, with no remaining unmeasured factors linking monitoring to treatment or outcome.
What would settle it
A simulation in which the true treatment effect is known but the adapted methods still show bias after adding the monitoring indicators, or a dataset where positivity fails for certain monitoring patterns leading to unstable estimates.
Figures
read the original abstract
Routinely collected data from electronic health records (EHR) provide opportunities to study effects of longitudinal treatment strategies in real-world clinical settings. A challenge presented by EHR data is that frequency of covariate monitoring differs by patient, covariate type and over time, and may be informative about a patient's health status. Many causal inference methods assume measurements of covariates are observed at a common set of regular time points. In this paper we describe and evaluate methods for estimating causal effects of longitudinal treatments on time-to-event outcomes in the presence of informative monitoring of time-dependent confounders. We show how methods based on inverse probability weighting, G-computation and longitudinal targeted maximum likelihood estimation (TMLE) can be adapted to allow for informative monitoring by incorporating monitoring indicator variables as additional time-dependent confounders. We evaluate these methods using a simulation study, comparing against more simple approaches that ignore monitoring variables. We demonstrate that ignoring monitoring can result in biased estimates of treatment effects. The methods are illustrated through an investigation into the effect of early versus delayed initiation of invasive mechanical ventilation on mortality of intensive care patients using routinely-collected data from an intensive care unit. We consider static treatment strategies such as `always treat' and `never treat' but also generalise to treatment strategies that allow for flexibility in the exact initiation time and duration of treatment.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript proposes adapting inverse probability weighting, G-computation, and longitudinal targeted maximum likelihood estimation to estimate causal effects of longitudinal treatment strategies on time-to-event outcomes when time-dependent confounders are subject to informative monitoring in electronic health record data. By treating binary monitoring indicators as additional time-dependent confounders, the adapted methods aim to correct for dependence between monitoring frequency and patient health status. Simulations demonstrate bias in naive approaches that ignore monitoring and recovery of unbiased estimates when indicators are included; the methods are illustrated with an application to early versus delayed mechanical ventilation in ICU data, considering both static and flexible treatment strategies.
Significance. If the adaptation correctly identifies the target causal parameter under the stated assumptions, the work would be significant for applied causal inference in routinely collected clinical data, where informative monitoring is ubiquitous. The simulation provides a direct external check against known truth, and the generalization to flexible treatment strategies broadens applicability. This could improve reliability of observational studies on treatment timing and duration in critical care and similar settings.
major comments (2)
- [Methods] Methods section (description of adapted g-formula/IPW/TMLE): Treating the monitoring indicator M_t as an ordinary time-dependent confounder to be conditioned on does not automatically identify the causal effect when L_t is only observed conditional on M_t=1. The manuscript does not specify the handling of missing L_t values (e.g., last-observation-carried-forward, imputation, or complete-case), leaving open the possibility that E[L_t | history, M_t=0] remains unidentified and that residual bias persists after weighting or G-computation on M alone. This assumption is load-bearing for the central claim that the adapted estimators recover the truth.
- [Simulation study] Simulation study (data-generating process and results tables): The reported recovery of unbiased estimates when including M_t assumes the simulation's missingness mechanism matches the real-data process exactly. If the simulation generates L_t unconditionally and then sets M_t, rather than generating L_t only when M_t=1, the simulation may understate bias from the missing-not-at-random structure typical in EHR data, weakening the evidence that the adaptation fully corrects for informative monitoring.
minor comments (2)
- [Notation and data structure] The notation for the observed data structure (history of A, L, M, Y) could be clarified with an explicit diagram or table showing which variables are observed at each time point under different M_t values.
- [Application] In the ICU application, the definition of the 'flexible' treatment strategies (allowing variation in initiation time and duration) should include explicit rules for how monitoring affects eligibility or censoring.
Simulated Author's Rebuttal
We thank the referee for their constructive and detailed comments, which have helped us improve the clarity and rigor of the manuscript. We address each major comment point by point below.
read point-by-point responses
-
Referee: [Methods] Methods section (description of adapted g-formula/IPW/TMLE): Treating the monitoring indicator M_t as an ordinary time-dependent confounder to be conditioned on does not automatically identify the causal effect when L_t is only observed conditional on M_t=1. The manuscript does not specify the handling of missing L_t values (e.g., last-observation-carried-forward, imputation, or complete-case), leaving open the possibility that E[L_t | history, M_t=0] remains unidentified and that residual bias persists after weighting or G-computation on M alone. This assumption is load-bearing for the central claim that the adapted estimators recover the truth.
Authors: We appreciate the referee highlighting the need for explicit clarification on this point. In the adapted estimators, M_t is incorporated into the observed history for all models (treatment, monitoring, covariate, and outcome). For g-computation and TMLE, the conditional distributions are estimated from the observed data (i.e., L_t is modeled only on instances where M_t=1), and predictions for L_t at times with M_t=0 are obtained by evaluating the fitted model at the observed history including M_t=0. This approach assumes that, conditional on the full observed history and M_t, the monitoring mechanism accounts for the dependence, with no additional unmeasured factors linking monitoring to the outcome. We have revised the Methods section to explicitly describe this handling of unobserved L_t values and to state the identifying assumptions more clearly, including a note that the framework relies on correct specification of the monitoring process. revision: yes
-
Referee: [Simulation study] Simulation study (data-generating process and results tables): The reported recovery of unbiased estimates when including M_t assumes the simulation's missingness mechanism matches the real-data process exactly. If the simulation generates L_t unconditionally and then sets M_t, rather than generating L_t only when M_t=1, the simulation may understate bias from the missing-not-at-random structure typical in EHR data, weakening the evidence that the adaptation fully corrects for informative monitoring.
Authors: We agree that the simulation must faithfully represent the informative monitoring process. Our data-generating mechanism generates the full underlying covariate trajectory L_t at every time point (reflecting that patient health status evolves continuously), then draws M_t conditionally on current and past L_t values (inducing MNAR missingness for the observed data), and applies the estimators to the resulting partially observed dataset while including M_t in all models. This is the standard and appropriate way to simulate MNAR monitoring in longitudinal settings, as the latent health process exists regardless of whether it is measured. We have expanded the simulation description and supplementary material with the exact DGP equations to make this structure explicit and to emphasize how it mirrors EHR data. revision: partial
Circularity Check
No significant circularity; simulation provides external validation
full rationale
The paper adapts IPW, G-computation and longitudinal TMLE by treating monitoring indicators M_t as additional time-dependent confounders in the adjustment set. This is a modeling choice justified by the data-generating process description rather than by definition or self-citation. The central claim is evaluated via simulation against known ground truth and via real-data illustration; neither step reduces to a fitted quantity renamed as prediction nor to a load-bearing self-citation chain. No equations or derivations are shown to be tautological with their inputs.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Standard identifiability assumptions for longitudinal causal inference hold after including monitoring indicators.
Reference graph
Works this paper leans on
-
[1]
P. J. Clare, T. A. Dobbins, and R. P. Mattick, Causal models adjusting for time-varying confounding—a systematic review of the literature.International Journal of Epidemiology, 2019. 48(1): p. 254–265
work page 2019
-
[2]
R. M. Daniel, S. N. Cousens, B. L. De Stavola, et al., Methods for dealing with time-dependent con- founding.Statistics in Medicine, 2013. 32(9): p. 1584–1618
work page 2013
-
[3]
J. M. Robins, M. ´A. Hern´ an, and B. Brumback, Marginal structural models and causal inference in epidemiology.Epidemiology, 2000. 11(5): p. 550–560
work page 2000
-
[4]
M. ´A. Hern´ an, B. Brumback, and J. M. Robins, Marginal structural models to estimate the causal effect of zidovudine on the survival of HIV-positive men.Epidemiology, 2000. 11(5): p. 561–570
work page 2000
-
[5]
J. Robins, A new approach to causal inference in mortality studies with a sustained exposure period- application to control of the healthy worker survivor effect.Mathematical Modelling, 1986. 7(9-12). Publisher: Pergamon: p. 1393–1512
work page 1986
-
[6]
R. M. Daniel, B. L. De Stavola, and S. N. Cousens, Gformula: Estimating Causal Effects in the Presence of Time-Varying Confounding or Mediation using the G-Computation Formula.The Stata Journal: Promoting communications on statistics and Stata, 2011. 11(4): p. 479–517
work page 2011
-
[7]
M. J. v. d. Laan and D. Rubin, Targeted Maximum Likelihood Learning.The International Journal of Biostatistics, 2006. 2(1)
work page 2006
-
[8]
M. J. v. d. Laan and S. Gruber, Targeted Minimum Loss Based Estimation of Causal Effects of Multiple Time Point Interventions.The International Journal of Biostatistics, 2012. 8(1)
work page 2012
-
[9]
M. J. v. d. Laan and S. Rose, Targeted Learning: Causal Inference for Observational and Experimental Data. Springer Science & Business Media, 2011. 678 pp
work page 2011
- [10]
-
[11]
R. Neugebauer, J. A. Schmittdiel, A. S. Adams, et al., Identification of the Joint Effect of a Dynamic Treatment Intervention and a Stochastic Monitoring Intervention Under the No Direct Effect Assump- tion.Journal of Causal Inference, 2017. 5(1)
work page 2017
- [12]
-
[13]
M. A. Hern´ an, M. McAdams, N. McGrath, et al., Observation plans in longitudinal studies with time- varying treatments.Statistical Methods in Medical Research, 2009. 18(1): p. 27–52
work page 2009
-
[14]
K. N. Wanis, A. L. Sarvet, L. Wen, et al., Grace periods in comparative effectiveness studies of sustained treatments.Journal of the Royal Statistical Society. Series A, (Statistics in Society), 2024. 187(3): p. 796–810
work page 2024
-
[15]
I. Torjesen, Covid-19: When to start invasive ventilation is“the million dollar question”.BMJ (Clinical research ed.), 2021. 372: p. 121–121
work page 2021
-
[16]
B. K. Patel, J. P. Kress, and J. B. Hall, Alternatives to Invasive Ventilation in the COVID-19 Pandemic. JAMA - Journal of the American Medical Association, 2020. 324(1): p. 43–44
work page 2020
-
[17]
P. M. Lepper and R. M. Muellenbach, Mechanical ventilation in early COVID-19 ARDS.EClini- calMedicine, 2020. 28
work page 2020
-
[18]
E. Papoutsi, V. G. Giannakoulis, E. Xourgia, et al., Effect of timing of intubation on clinical outcomes of critically ill patients with COVID-19: a systematic review and meta-analysis of non-randomized cohort studies. en.Critical Care, 2021. 25(1): p. 121
work page 2021
-
[19]
K. N. Kangelaris, L. B. Ware, C. Y. Wang, et al., Timing of Intubation and Clinical Outcomes in Adults With Acute Respiratory Distress Syndrome. eng.Critical Care Medicine, 2016. 44(1): p. 120–129
work page 2016
-
[20]
P. R. Bauer, O. Gajic, R. Nanchal, et al., Association between timing of intubation and outcome in critically ill patients: A secondary analysis of the ICON audit. eng.Journal of Critical Care, 2017. 42: p. 1–5
work page 2017
-
[21]
N. A. Xixi, P. Kremmydas, E. Xourgia, et al., Association between timing of intubation and clinical outcomes of critically ill patients: A meta-analysis. eng.Journal of Critical Care, 2022. 71: p. 154062
work page 2022
-
[22]
J. Gonz´ alez, I. D. Ben´ ıtez, D. de Gonzalo-Calvo, et al., Impact of time to intubation on mortality and pulmonary sequelae in critically ill patients with COVID-19: a prospective cohort study. eng.Critical Care (London, England), 2022. 26(1): p. 18
work page 2022
-
[23]
K. N. Wanis, A. L. Madenci, S. Hao, et al., Emulating Target Trials Comparing Early and Delayed Intubation Strategies.Chest, 2023. 164(4): p. 885–891. [24]url:https://hic.nihr.ac.uk/critical+care
work page 2023
-
[24]
S. Harris, S. Shi, D. Brealey, et al., Critical Care Health Informatics Collaborative (CCHIC): Data, tools and methods for reproducible research: A multi-centre UK intensive care database.International Journal of Medical Informatics, 2018. 112. Publisher: Elsevier Ireland Ltd: p. 82–89. 32
work page 2018
-
[25]
M. Petersen, J. Schwab, S. Gruber, et al., Targeted Maximum Likelihood Estimation for Dynamic and Static Longitudinal Marginal Structural Working Models.Journal of Causal Inference, 2014. 2(2): p. 147–185
work page 2014
-
[26]
L. Wen, J. G. Young, J. M. Robins, et al., Parametric g-formula implementations for causal survival analyses. en.Biometrics, 2021. 77(2). eprint: https://onlinelibrary.wiley.com/doi/pdf/10.1111/biom.13321: p. 740–753
-
[27]
H. Bang and J. M. Robins, Doubly robust estimation in missing data and causal inference models. Biometrics, 2005. 61(4): p. 962–973
work page 2005
-
[28]
T. P. Morris, I. R. White, and M. J. Crowther, Using simulation studies to evaluate statistical methods. Statistics in Medicine, 2019. 38(11): p. 2074–2102
work page 2019
-
[29]
R. V. Phillips, M. J. van der Laan, H. Lee, et al., Practical considerations for specifying a super learner. International Journal of Epidemiology, 2023. 52(4): p. 1276–1285
work page 2023
-
[30]
R. H. Keogh, S. R. Seaman, J. M. Gran, et al., Simulating longitudinal data from marginal structural models using the additive hazard model.Biometrical Journal, 2021: p. 1–16
work page 2021
-
[31]
M. A. Hern´ an and J. M. Robins, Using Big Data to Emulate a Target Trial When a Randomized Trial Is Not Available.American Journal of Epidemiology, 2016. 183(8): p. 758–764
work page 2016
-
[32]
J. G. Young, M. J. Stensrud, E. J. Tchetgen Tchetgen, et al., A causal framework for classical statistical estimands in failure-time settings with competing events.Statistics in Medicine, 2020. 39(8): p. 1199– 1236. 33
work page 2020
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.