Estimation of time-varying treatment effects using marginal structural models dependent on partial treatment history

Masataka Taguri; Nodoka Seya; Takeo Ishii

arxiv: 2412.08042 · v3 · submitted 2024-12-11 · 📊 stat.ME

Estimation of time-varying treatment effects using marginal structural models dependent on partial treatment history

Nodoka Seya , Masataka Taguri , Takeo Ishii This is my paper

Pith reviewed 2026-05-23 07:44 UTC · model grok-4.3

classification 📊 stat.ME

keywords marginal structural modelsinverse probability weightingtime-varying treatment effectspartial treatment historyclosed testing procedurestime-varying confounding

0 comments

The pith

New inverse probability weights and closed testing let marginal structural models depend on partial treatment history for time-varying effects.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper develops methods to estimate time-varying treatment effects more efficiently by letting marginal structural models depend on only a partial history of treatments instead of the full sequence. Existing inverse probability weighting accumulates weights over all time points, causing inefficiency, and full-history models can be misspecified. The authors introduce new weights specific to partial histories and a closed testing procedure to select how far back the dependence goes. These changes aim to produce consistent and more efficient estimators under the model's assumptions, as shown in simulations outperforming prior approaches and in an application to hemodialysis data.

Core claim

The central claim is that new IP-weights for MSMs dependent on partial treatment history, together with closed testing procedures for selecting the partial history, provide improved estimators for time-varying treatment effects. The methods are shown to outperform existing ones in simulation studies for both estimation performance and history selection, with theoretical properties derived under known weights and extensions discussed for estimated weights, and demonstrated on real hemodialysis patient data.

What carries the argument

New inverse probability weights for marginal structural models that depend on partial treatment history, paired with closed testing procedures to determine the appropriate history length.

If this is right

The new weights reduce inefficiency from cumulating all time points in the full history.
The closed testing procedure selects the partial history length to limit misspecification bias.
Estimators achieve better performance than existing methods in simulations for both effect estimation and history selection.
The approach applies directly to real longitudinal data such as hemodialysis patient records.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The selection procedure could be applied in other longitudinal studies to simplify models without losing causal information.
If the extra assumptions can be checked from data as noted, practitioners might verify them before using the new weights.
Extensions to estimated weights could make the methods usable in observational settings where true weights are unknown.

Load-bearing premise

The theoretical consistency and efficiency of the new weights and testing procedure depend on additional assumptions beyond standard identifiability assumptions that may not always hold.

What would settle it

A simulation where the additional assumptions are violated and the proposed estimators show higher bias or lower efficiency than standard full-history methods.

Figures

Figures reproduced from arXiv: 2412.08042 by Masataka Taguri, Nodoka Seya, Takeo Ishii.

**Figure 1.** Figure 1: Plots of the selection probability of ∈ {1, 2, 3, 4} corresponding to the main effect model over 1000 simulation runs based on the data generation process described in Section 5.1 with (0, 1, 2, 1, 0, 1, 2, 3) = (0, 0, 1, 1, 0, 1, 2, 0), (a) setting 1 = 2.5 and changing 1 ∈ {0.25, 0.50, 0.75, 1.00, 1.25, 1.50, 1.75, 2.00} and (b) setting 1 = 1.5 and changing 1 ∈ {0.5, 1.0, 1.5, 2.0, 2.5, 3.0, 3.5, 4.0}. I… view at source ↗

**Figure 2.** Figure 2: Box-plots of estimates of () over 1000 simulation runs of the first scenario (0, 1, 2, 1, 0, 1, 2, 3) = (0, 0, 1, 4, 0, 1, 2, 1) for the normal outcome. The horizontal line is drawn at true value () = 4. Twenty-two methods for estimating () with combinations of selection methods and IP-weights are compared. Six gray blocks represent selection methods, where QICw, cQICw, ztest05, ztest20, pztest05, pztest2… view at source ↗

**Figure 3.** Figure 3: Box-plots of estimates of () over 1000 simulation runs for the time-to-event outcome. The horizontal line is drawn at true value () = −0.87. Sixteen methods for estimating () with combinations of selection methods and IP-weights are compared. Four gray blocks represent selection methods, where ztest05, ztest20, pztest05, pztest20 is ˜0.05, ˜0.20, ˆ0.05, ˆ0.20, respectively. For ∈ {˜ 0.05, ˜ 0.20, ˆ 0.05, ˆ… view at source ↗

read the original abstract

Inverse probability (IP) weighting of marginal structural models (MSMs) can provide consistent estimators of time-varying treatment effects under correct model specifications and identifiability assumptions, even in the presence of time-varying confounding. However, this method has two problems: (i) inefficiency due to IP-weights cumulating all time points and (ii) bias and inefficiency due to the MSM misspecification. To address these problems, we propose (i) new IP-weights for estimating parameters of the MSM that depends on partial treatment history and (ii) closed testing procedures for selecting partial treatment history (how far back in time the MSM depends on past treatments). We derive the theoretical properties of our proposed methods under known IP-weights and discuss their extension to estimated IP-weights. Although some of our theoretical results are derived under additional assumptions beyond standard identifiability assumptions, some of which can be checked empirically from the data. In simulation studies, our proposed methods outperformed existing methods both in terms of performance in estimating time-varying treatment effects and in selecting partial treatment history. Our proposed methods have also been applied to real data of hemodialysis patients with reasonable results.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper gives new IP weights for partial-history MSMs plus a closed testing procedure to pick history length, but the consistency claims rest on extra assumptions whose violation is not clearly probed in the simulations.

read the letter

The paper introduces IP weights built specifically for marginal structural models that condition on only a slice of treatment history, along with a closed testing procedure to decide how much history to keep. This directly targets the two problems stated in the abstract: weight instability from cumulating over every time point and bias from forcing the MSM to depend on the full history when that may not be needed.

Referee Report

2 major / 2 minor

Summary. The paper proposes new inverse probability (IP) weights for marginal structural models (MSMs) that depend only on partial treatment history, along with closed testing procedures to select that history, in order to improve efficiency and reduce bias from full-history weighting and MSM misspecification when estimating time-varying treatment effects under time-varying confounding. Theoretical properties are derived under known weights (with extension to estimated weights) subject to additional assumptions beyond standard identifiability; simulations are reported to show outperformance versus existing methods in both estimation accuracy and history selection, with an application to hemodialysis data.

Significance. If the additional assumptions hold in practice and the reported simulation advantages are robust, the methods could yield more efficient estimators and better-calibrated model selection for longitudinal causal inference, addressing two recognized limitations of standard IP-weighted MSMs.

major comments (2)

[Simulation studies] Simulation studies section: the data-generating processes are not described as including cases that violate the additional assumptions required for the consistency and efficiency claims of the new weights and closed testing procedure. Because the central claim is outperformance in simulations, absence of such stress tests leaves open whether the reported gains persist or whether type-I error control for the testing procedure degrades when the assumptions fail.
[Theoretical results] Theoretical results section: the extension of the closed testing procedure to estimated IP-weights is stated to follow from the known-weights case, but no explicit bound or simulation evidence is given on how estimation error in the weights propagates to the family-wise error rate of the closed test under the additional assumptions.

minor comments (2)

[Abstract and introduction] The abstract states that some additional assumptions 'can be checked empirically from the data'; an explicit list of these assumptions together with the corresponding diagnostic procedures would improve readability.
[Notation and model] Notation for the partial treatment history (e.g., the truncation lag) is introduced without an early concrete numerical example; adding one would clarify the MSM specification.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their constructive feedback on our manuscript. We respond to each major comment below, indicating the revisions we will make to address the concerns.

read point-by-point responses

Referee: [Simulation studies] Simulation studies section: the data-generating processes are not described as including cases that violate the additional assumptions required for the consistency and efficiency claims of the new weights and closed testing procedure. Because the central claim is outperformance in simulations, absence of such stress tests leaves open whether the reported gains persist or whether type-I error control for the testing procedure degrades when the assumptions fail.

Authors: Our simulation studies are performed under the additional assumptions because the theoretical properties of the proposed weights and closed testing procedure are established under these conditions. However, we agree that examining performance when the assumptions are violated would provide valuable insight into the robustness of the methods. In the revised manuscript, we will expand the simulation section to include data-generating processes that violate the additional assumptions and report the resulting estimation accuracy and type-I error rates for the closed testing procedure. revision: yes
Referee: [Theoretical results] Theoretical results section: the extension of the closed testing procedure to estimated IP-weights is stated to follow from the known-weights case, but no explicit bound or simulation evidence is given on how estimation error in the weights propagates to the family-wise error rate of the closed test under the additional assumptions.

Authors: The manuscript notes that the results for estimated weights follow from the known-weights case under the additional assumptions, but we did not include explicit bounds or simulation studies specifically addressing the propagation of weight estimation error to the family-wise error rate. We will revise the theoretical results section to incorporate simulation evidence showing the effect of estimated weights on the closed testing procedure's error control under the assumptions. revision: yes

Circularity Check

0 steps flagged

No significant circularity detected

full rationale

The paper derives new IP-weights for MSMs depending on partial treatment history and closed testing procedures for history selection, then states theoretical properties under known weights (with extension to estimated weights) and reports simulation outperformance. No quoted equations or steps reduce a claimed prediction or result to a fitted parameter or self-citation by construction; the additional assumptions are explicitly flagged as beyond standard identifiability and the simulation claims rest on independent empirical comparison rather than tautological redefinition of inputs.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 0 invented entities

The central claims rest on standard causal identifiability assumptions plus additional assumptions for the theoretical properties of the new weights and testing procedure; no free parameters or invented entities are explicitly described in the abstract.

axioms (2)

domain assumption Standard identifiability assumptions for consistent estimation of time-varying treatment effects via IP-weighted MSMs
Stated as required for the method to provide consistent estimators even with time-varying confounding.
ad hoc to paper Additional assumptions beyond standard identifiability for the theoretical properties of the proposed weights and closed testing
Explicitly noted in the abstract as required for some theoretical results, with some checkable empirically.

pith-pipeline@v0.9.0 · 5737 in / 1368 out tokens · 31077 ms · 2026-05-23T07:44:31.384387+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

21 extracted references · 21 canonical work pages

[1]

Margina l mean models for dynamic regimes

Murphy SA, van der Laan MJ, Robins JM, Group CPPR. Margina l mean models for dynamic regimes. Journal of the American Statistical Association. 2001;96(456):1410-23

work page 2001
[2]

Marginal Structural Models versus Structura l nested Models as Tools for Causal 27 inference

Robins JM. Marginal Structural Models versus Structura l nested Models as Tools for Causal 27 inference. In: Halloran ME, Berry D, editors. Statistical M odels in Epidemiology, the Environment, and Clinical Trials. New Y ork, NY : Springer New Y ork; 2000. p. 95-133

work page 2000
[3]

Causal in ference in longitudinal studies with history-restricted marginal structural models

Neugebauer R, van der Laan MJ, Joﬀe MM, Tager IB. Causal in ference in longitudinal studies with history-restricted marginal structural models. Electron ic Journal of Statistics. 2007;1:119-54

work page 2007
[4]

An information criterion for marginal structural models

Platt RW, Brookhart MA, Cole SR, Westreich D, Schisterma n EF. An information criterion for marginal structural models. Statistics in Medicine. 2013; 32(8):1383-93

work page 2013
[5]

Comments on ‘An information crite rion for marginal structural models’ by R

Taguri M, Matsuyama Y . Comments on ‘An information crite rion for marginal structural models’ by R. W. Platt, M. A. Brookhart, S. R. Cole, D. Westreich, and E . F. Schisterman. Statistics in Medicine. 2013;32(20):3590-1

work page 2013
[6]

A /u1D436/u1D45Dcriterion for semiparametric causal inference

Baba T, Kanemori T, Ninomiya Y . A /u1D436/u1D45Dcriterion for semiparametric causal inference. Biometrik a. 2017;104(4):845-61

work page 2017
[7]

Marginal structural mo dels and causal inference in epidemi- ology

Robins JM, Hernan MA, Brumback B. Marginal structural mo dels and causal inference in epidemi- ology. Epidemiology. 2000;11(5):550-60

work page 2000
[8]

Marginal structural mo dels to estimate the joint causal eﬀect of nonrandomized treatments

Hernan MA, Brumback B, Robins JM. Marginal structural mo dels to estimate the joint causal eﬀect of nonrandomized treatments. Journal of the American Stati stical Association. 2001;96(454):440-8

work page 2001
[9]

A test for the correct speciﬁcation of marginal structural models

Sall A, Aube K, Trudel X, Brisson C, Talbot D. A test for the correct speciﬁcation of marginal structural models. Statistics in Medicine. 2019;38(17):3 168-83

work page 2019
[10]

A ca utionary note concerning the use of stabilized weights in marginal structural models

Talbot D, Atherton J, Rossi AM, Bacon SL, Lefebvre G. A ca utionary note concerning the use of stabilized weights in marginal structural models. Statist ics in Medicine. 2015;34(5):812-23

work page 2015
[11]

In: Causality and Structural Models in Social S cience and Economics

Pearl J. In: Causality and Structural Models in Social S cience and Economics. Cambridge University Press; 2009. p. 133-72. 28

work page 2009
[12]

Marginal structural m odels to estimate the causal eﬀect of zidovudine on the survival of HIV-positive men

Hernan MA, Brumback B, Robins JM. Marginal structural m odels to estimate the causal eﬀect of zidovudine on the survival of HIV-positive men. Epidemiolo gy. 2000;11(5):561-70

work page 2000
[13]

Simulation from a known Cox MSM using standard parametric models for the g-formula

Y oung JG, Tchetgen Tchetgen EJ. Simulation from a known Cox MSM using standard parametric models for the g-formula. Statistics in Medicine. 2014;33( 6):1001-14

work page 2014
[14]

Allopurinol, febuxostat, and nonuse of xanthine oxidoreductase inhibitor treatment in patient s receiving Hemodialysis: A Longitudinal Analysis

Ishii T, Seya N, Taguri M, Wakui H, Y oshimura A, Tamura K. Allopurinol, febuxostat, and nonuse of xanthine oxidoreductase inhibitor treatment in patient s receiving Hemodialysis: A Longitudinal Analysis. Kidney Medicine. 2024;6(11):100896

work page 2024
[15]

MICE: Multivaria te imputation by chained equations in R

Van Buuren S, Groothuis-Oudshoorn K. MICE: Multivaria te imputation by chained equations in R. Journal of Statistical Software. 2011;45:1-67

work page 2011
[16]

Post-selection inference [Journal Article]

Kuchibhotla AK, Kolassa JE, Kuﬀner TA. Post-selection inference [Journal Article]. Annual Review of Statistics and Its Application. 2022;9(Volume 9, 2022): 505-27

work page 2022
[17]

Targeted maximum likelihood estimation for dynamic and static longitudinalmarginal structural working models

Petersen M, Schwab J, Gruber S, Blaser N, Schomaker M, va n der Laan M. Targeted maximum likelihood estimation for dynamic and static longitudinalmarginal structural working models. Journal of Causal Inference. 2014;2(2):147-85

work page 2014
[18]

Robust estimation in sequentially ignorabl e missing data and causal inference models

Robins JM. Robust estimation in sequentially ignorabl e missing data and causal inference models. Proceedings of the American Statistical Association Section on Bayesian Statistical Science. 2000:6- 10

work page 2000
[19]

Commentary on ’Using inverse weighting and p redictive inference to estimate the eﬀects of time-varying treatments on the discrete-time hazard’

Robins JM. Commentary on ’Using inverse weighting and p redictive inference to estimate the eﬀects of time-varying treatments on the discrete-time hazard’. S tatistics in Medicine. 2002;21(12):1663- 80. 29

work page 2002
[20]

Multiply robust estimators of causal eﬀects for survival outcomes

Wen L, Hernan MA, Robins JM. Multiply robust estimators of causal eﬀects for survival outcomes. Scandinavian Journal of Statistics. 2022;49(3):1304-28

work page 2022
[21]

Robust estimation of inverse probability weights for marginal structural models

Imai K, Ratkovic M. Robust estimation of inverse probability weights for marginal structural models. Journal of the American Statistical Association. 2015;110 (511):1013-23. 30 A Identiﬁability assumptions A.1 Identiﬁability assumptions of E[ /u1D44C¯/u1D44E] for ¯/u1D44E∈ ¯A (A1) consistency If ¯/u1D434= ¯/u1D44E,then /u1D44C= /u1D44C¯/u1D44E, for ¯/u1D4...

work page 2015

[1] [1]

Margina l mean models for dynamic regimes

Murphy SA, van der Laan MJ, Robins JM, Group CPPR. Margina l mean models for dynamic regimes. Journal of the American Statistical Association. 2001;96(456):1410-23

work page 2001

[2] [2]

Marginal Structural Models versus Structura l nested Models as Tools for Causal 27 inference

Robins JM. Marginal Structural Models versus Structura l nested Models as Tools for Causal 27 inference. In: Halloran ME, Berry D, editors. Statistical M odels in Epidemiology, the Environment, and Clinical Trials. New Y ork, NY : Springer New Y ork; 2000. p. 95-133

work page 2000

[3] [3]

Causal in ference in longitudinal studies with history-restricted marginal structural models

Neugebauer R, van der Laan MJ, Joﬀe MM, Tager IB. Causal in ference in longitudinal studies with history-restricted marginal structural models. Electron ic Journal of Statistics. 2007;1:119-54

work page 2007

[4] [4]

An information criterion for marginal structural models

Platt RW, Brookhart MA, Cole SR, Westreich D, Schisterma n EF. An information criterion for marginal structural models. Statistics in Medicine. 2013; 32(8):1383-93

work page 2013

[5] [5]

Comments on ‘An information crite rion for marginal structural models’ by R

Taguri M, Matsuyama Y . Comments on ‘An information crite rion for marginal structural models’ by R. W. Platt, M. A. Brookhart, S. R. Cole, D. Westreich, and E . F. Schisterman. Statistics in Medicine. 2013;32(20):3590-1

work page 2013

[6] [6]

A /u1D436/u1D45Dcriterion for semiparametric causal inference

Baba T, Kanemori T, Ninomiya Y . A /u1D436/u1D45Dcriterion for semiparametric causal inference. Biometrik a. 2017;104(4):845-61

work page 2017

[7] [7]

Marginal structural mo dels and causal inference in epidemi- ology

Robins JM, Hernan MA, Brumback B. Marginal structural mo dels and causal inference in epidemi- ology. Epidemiology. 2000;11(5):550-60

work page 2000

[8] [8]

Marginal structural mo dels to estimate the joint causal eﬀect of nonrandomized treatments

Hernan MA, Brumback B, Robins JM. Marginal structural mo dels to estimate the joint causal eﬀect of nonrandomized treatments. Journal of the American Stati stical Association. 2001;96(454):440-8

work page 2001

[9] [9]

A test for the correct speciﬁcation of marginal structural models

Sall A, Aube K, Trudel X, Brisson C, Talbot D. A test for the correct speciﬁcation of marginal structural models. Statistics in Medicine. 2019;38(17):3 168-83

work page 2019

[10] [10]

A ca utionary note concerning the use of stabilized weights in marginal structural models

Talbot D, Atherton J, Rossi AM, Bacon SL, Lefebvre G. A ca utionary note concerning the use of stabilized weights in marginal structural models. Statist ics in Medicine. 2015;34(5):812-23

work page 2015

[11] [11]

In: Causality and Structural Models in Social S cience and Economics

Pearl J. In: Causality and Structural Models in Social S cience and Economics. Cambridge University Press; 2009. p. 133-72. 28

work page 2009

[12] [12]

Marginal structural m odels to estimate the causal eﬀect of zidovudine on the survival of HIV-positive men

Hernan MA, Brumback B, Robins JM. Marginal structural m odels to estimate the causal eﬀect of zidovudine on the survival of HIV-positive men. Epidemiolo gy. 2000;11(5):561-70

work page 2000

[13] [13]

Simulation from a known Cox MSM using standard parametric models for the g-formula

Y oung JG, Tchetgen Tchetgen EJ. Simulation from a known Cox MSM using standard parametric models for the g-formula. Statistics in Medicine. 2014;33( 6):1001-14

work page 2014

[14] [14]

Allopurinol, febuxostat, and nonuse of xanthine oxidoreductase inhibitor treatment in patient s receiving Hemodialysis: A Longitudinal Analysis

Ishii T, Seya N, Taguri M, Wakui H, Y oshimura A, Tamura K. Allopurinol, febuxostat, and nonuse of xanthine oxidoreductase inhibitor treatment in patient s receiving Hemodialysis: A Longitudinal Analysis. Kidney Medicine. 2024;6(11):100896

work page 2024

[15] [15]

MICE: Multivaria te imputation by chained equations in R

Van Buuren S, Groothuis-Oudshoorn K. MICE: Multivaria te imputation by chained equations in R. Journal of Statistical Software. 2011;45:1-67

work page 2011

[16] [16]

Post-selection inference [Journal Article]

Kuchibhotla AK, Kolassa JE, Kuﬀner TA. Post-selection inference [Journal Article]. Annual Review of Statistics and Its Application. 2022;9(Volume 9, 2022): 505-27

work page 2022

[17] [17]

Targeted maximum likelihood estimation for dynamic and static longitudinalmarginal structural working models

Petersen M, Schwab J, Gruber S, Blaser N, Schomaker M, va n der Laan M. Targeted maximum likelihood estimation for dynamic and static longitudinalmarginal structural working models. Journal of Causal Inference. 2014;2(2):147-85

work page 2014

[18] [18]

Robust estimation in sequentially ignorabl e missing data and causal inference models

Robins JM. Robust estimation in sequentially ignorabl e missing data and causal inference models. Proceedings of the American Statistical Association Section on Bayesian Statistical Science. 2000:6- 10

work page 2000

[19] [19]

Commentary on ’Using inverse weighting and p redictive inference to estimate the eﬀects of time-varying treatments on the discrete-time hazard’

Robins JM. Commentary on ’Using inverse weighting and p redictive inference to estimate the eﬀects of time-varying treatments on the discrete-time hazard’. S tatistics in Medicine. 2002;21(12):1663- 80. 29

work page 2002

[20] [20]

Multiply robust estimators of causal eﬀects for survival outcomes

Wen L, Hernan MA, Robins JM. Multiply robust estimators of causal eﬀects for survival outcomes. Scandinavian Journal of Statistics. 2022;49(3):1304-28

work page 2022

[21] [21]

Robust estimation of inverse probability weights for marginal structural models

Imai K, Ratkovic M. Robust estimation of inverse probability weights for marginal structural models. Journal of the American Statistical Association. 2015;110 (511):1013-23. 30 A Identiﬁability assumptions A.1 Identiﬁability assumptions of E[ /u1D44C¯/u1D44E] for ¯/u1D44E∈ ¯A (A1) consistency If ¯/u1D434= ¯/u1D44E,then /u1D44C= /u1D44C¯/u1D44E, for ¯/u1D4...

work page 2015