pith. sign in

arxiv: 2606.04134 · v1 · pith:I7WANIDEnew · submitted 2026-06-02 · 📊 stat.ME

Optimal Treatment Policy Estimation for Recurrent Events with a Competing Terminal Event: An Instrumented Difference-in-Differences Approach

Pith reviewed 2026-06-28 08:33 UTC · model grok-4.3

classification 📊 stat.ME
keywords optimal treatment policyrecurrent eventscompeting terminal eventinstrumented difference-in-differencesmultiply robust estimatoradministrative health dataType 2 diabetes
0
0 comments X

The pith

An instrumented difference-in-differences approach estimates optimal treatment policies for recurrent events while accounting for a competing terminal event.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The authors develop a method to learn optimal treatment policies from administrative health data for chronic conditions involving repeated events like hospitalizations that can end with a terminal event such as death. By using policy changes as instruments in a difference-in-differences design, the framework accommodates unmeasured differences between populations that persist over time. This setup is more suitable for real-world data than traditional instrumental variable or difference-in-differences methods. The resulting multiply robust estimator remains consistent as long as certain combinations of the models for treatment assignment, outcome, or other components are correctly specified. They prove its large-sample properties and test it in simulations before using it on Medicare records to optimize diabetes care strategies.

Core claim

Two distinct inverse probability weighted identifications are derived for the optimal policy under the instrumented difference-in-differences design, yielding a multiply robust estimator that is consistent if any one of several subsets of the nuisance models is correctly specified, with consistency and asymptotic normality established by large-sample theory.

What carries the argument

The multiply robust estimator based on inverse probability weighted identifications from the instrumented difference-in-differences design.

If this is right

  • The framework enables estimation of policies that minimize recurrent events without increasing the risk of the terminal event.
  • Administrative health data can be used for policy learning despite unmeasured confounding.
  • The estimator achieves consistency under weaker conditions than standard methods.
  • Superior finite-sample performance is shown compared to existing methods in simulations.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • Similar methods could be developed for other types of outcomes in longitudinal health studies.
  • Application to additional datasets from policy variations in different countries could validate the approach.
  • The method suggests that optimal policies should always be evaluated jointly for event reduction and survival impact.

Load-bearing premise

The assumptions of the instrumented difference-in-differences design are satisfied, particularly that policy changes create valid treatment variation and unmeasured population differences remain constant over time.

What would settle it

If simulations are run where the policy-induced variation is correlated with time-varying unmeasured factors, and the estimator fails to recover the true optimal policy, the central claim would be falsified.

Figures

Figures reproduced from arXiv: 2606.04134 by Ashkan Ertefaie, James Flory, Ritoban Kundu, Sean Hennessy.

Figure 1
Figure 1. Figure 1: Percentage of Correct Decisions (PCD) for estimated optimal treatment policies [PITH_FULL_IMAGE:figures/full_fig_p023_1.png] view at source ↗
read the original abstract

Learning reproducible and generalizable optimal treatment policies for chronic diseases requires large, representative populations with long-term follow-up. Administrative health data provide a natural starting point, but their use is often limited by unmeasured confounding. We address this by proposing a novel framework based on Instrumented Difference-in-Differences (iDID) to estimate optimal policies for recurrent event outcomes subject to a terminating event. The iDID design is particularly useful in this setting because it leverages policy-induced treatment variation while allowing for persistent unmeasured differences across populations, relying on assumptions that are more plausible for administrative health data than those required by conventional IV or DID approaches. A key feature of our approach is that it explicitly addresses the fundamental challenge of avoiding policies that trivially reduce recurrent adverse events by increasing mortality. We derive two distinct Inverse Probability Weighted identifications and develop a multiply robust estimator that achieves consistency if any one of several subsets of nuisance models is correctly specified. We establish the estimator's consistency and asymptotic normality through large-sample theory and demonstrate its superior finite-sample performance over existing methods via simulation. Finally, we apply this framework to a national Medicare dataset to optimize first-line Type 2 Diabetes strategies, specifically targeting the minimization of disease-related hospitalizations while accounting for survival.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

0 major / 3 minor

Summary. The manuscript proposes an instrumented difference-in-differences (iDID) framework to estimate optimal treatment policies for recurrent-event outcomes subject to a competing terminal event. It derives two distinct inverse-probability-weighted identification expressions, constructs a multiply-robust estimator whose consistency holds if any one of several nuisance-model subsets is correct, proves consistency and asymptotic normality, reports superior finite-sample performance in simulations relative to existing methods, and applies the estimator to national Medicare data to optimize first-line Type 2 diabetes regimens while penalizing policies that increase mortality.

Significance. If the identification results and asymptotic theory hold, the work supplies a practically relevant tool for policy learning from administrative health data that routinely exhibit unmeasured confounding and right-censoring by death. The explicit handling of the terminal event to avoid trivial mortality-increasing policies, the multiply-robust property, and the provision of large-sample theory constitute clear strengths. The simulation comparisons and real-data illustration further support potential utility in chronic-disease settings.

minor comments (3)
  1. The abstract states that two IPW identifications are derived but does not indicate whether the two forms are algebraically equivalent or whether one is preferred under particular data configurations; a brief comparison in the main text would clarify this.
  2. Notation for the recurrent-event and terminal-event processes is introduced without an explicit table of symbols; adding such a table would improve readability for readers unfamiliar with competing-risks notation.
  3. The simulation section reports superior performance but does not state the exact sample sizes, number of Monte Carlo replications, or the precise nuisance-model misspecification patterns used; these details should be added for reproducibility.

Simulated Author's Rebuttal

0 responses · 0 unresolved

We thank the referee for the supportive summary, recognition of the method's strengths in handling unmeasured confounding and the terminal event, and the recommendation for minor revision. No specific major comments were provided in the report.

Circularity Check

0 steps flagged

No circularity: derivations rely on external policy variation and stated assumptions

full rationale

The paper derives two IPW identifications and a multiply-robust estimator for optimal policies under iDID, proving consistency and asymptotic normality via large-sample theory plus simulation validation. No equations or steps reduce the target policy or estimator to a fitted input by construction, nor do self-citations serve as load-bearing justifications for uniqueness or ansatzes. The central identification uses policy-induced treatment variation as an external source, with assumptions presented as domain-plausible rather than internally derived, rendering the chain self-contained.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

Based solely on the abstract, the central claim rests on the plausibility of iDID assumptions for administrative data and the multiply robust property under correct specification of at least one nuisance model subset; no free parameters, invented entities, or additional axioms are detailed.

axioms (1)
  • domain assumption iDID assumptions are more plausible for administrative health data than conventional IV or DID approaches
    Explicitly stated as a key feature enabling the framework.

pith-pipeline@v0.9.1-grok · 5761 in / 1334 out tokens · 27739 ms · 2026-06-28T08:33:13.888701+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

92 extracted references · 2 linked inside Pith

  1. [1]

    Biometrika , pages=

    A semiparametric instrumented difference-in-differences approach to policy learning , author=. Biometrika , pages=. 2025 , publisher=

  2. [2]

    Journal of Machine Learning Research , volume=

    Efficient and robust transfer learning of optimal individualized treatment regimes with right-censored survival data , author=. Journal of Machine Learning Research , volume=

  3. [3]

    Sharp instruments for classifying compliers and generalizing causal effects , author=

  4. [4]

    2008 , publisher=

    Introduction to empirical processes and semiparametric inference , author=. 2008 , publisher=

  5. [5]

    Annals of statistics , volume=

    Statistical inference for the mean outcome under a possibly non-unique optimal treatment strategy , author=. Annals of statistics , volume=

  6. [6]

    2019 , publisher=

    Dynamic treatment regimes: Statistical methods for precision medicine , author=. 2019 , publisher=

  7. [7]

    Econometrica , volume=

    Policy learning with observational data , author=. Econometrica , volume=. 2021 , publisher=

  8. [8]

    Journal of the American Statistical Association , volume=

    Off-policy confidence interval estimation with confounded markov decision process , author=. Journal of the American Statistical Association , volume=. 2024 , publisher=

  9. [9]

    International Conference on Machine Learning , pages=

    Doubly robust distributionally robust off-policy evaluation and learning , author=. International Conference on Machine Learning , pages=. 2022 , organization=

  10. [10]

    Annual review of statistics and its application , volume=

    Precision medicine , author=. Annual review of statistics and its application , volume=. 2019 , publisher=

  11. [11]

    Advances in neural information processing systems , volume=

    Balanced policy evaluation and learning , author=. Advances in neural information processing systems , volume=

  12. [12]

    arXiv preprint arXiv:1611.02314 , year=

    Robust hybrid learning for estimating personalized dynamic treatment regimens , author=. arXiv preprint arXiv:1611.02314 , year=

  13. [13]

    Journal of the American Statistical Association , year=

    Matched learning for optimizing individualized treatment strategies using electronic health records , author=. Journal of the American Statistical Association , year=

  14. [14]

    Advances in neural information processing systems , volume=

    Confounding-robust policy improvement , author=. Advances in neural information processing systems , volume=

  15. [15]

    2015 , publisher=

    Adaptive treatment strategies in practice: planning trials and analyzing data for personalized medicine , author=. 2015 , publisher=

  16. [16]

    Journal of the American Statistical Association , volume=

    Quantile-optimal treatment regimes , author=. Journal of the American Statistical Association , volume=. 2018 , publisher=

  17. [17]

    SIAM Journal on Optimization , volume=

    Estimation of individualized decision rules based on an optimized covariate-dependent equivalent of random outcomes , author=. SIAM Journal on Optimization , volume=. 2019 , publisher=

  18. [18]

    2010 , publisher=

    Dynamic regime marginal structural mean models for estimation of optimal dynamic treatment regimes, part I: main content , author=. 2010 , publisher=

  19. [19]

    Proceedings of the Second Seattle Symposium in Biostatistics: analysis of correlated data , pages=

    Optimal structural nested models for optimal sequential decisions , author=. Proceedings of the Second Seattle Symposium in Biostatistics: analysis of correlated data , pages=. 2004 , organization=

  20. [20]

    Annals of statistics , volume=

    Performance guarantees for individualized treatment rules , author=. Annals of statistics , volume=

  21. [21]

    Journal of the Royal Statistical Society Series B: Statistical Methodology , volume=

    Optimal dynamic treatment regimes , author=. Journal of the Royal Statistical Society Series B: Statistical Methodology , volume=. 2003 , publisher=

  22. [22]

    Statistical science: a review journal of the Institute of Mathematical Statistics , volume=

    Q-and A-learning methods for estimating optimal dynamic treatment regimes , author=. Statistical science: a review journal of the Institute of Mathematical Statistics , volume=

  23. [23]

    Annals of statistics , volume=

    High-dimensional A-learning for optimal dynamic treatment regimes , author=. Annals of statistics , volume=

  24. [24]

    Stat , volume=

    Estimating optimal treatment regimes from a classification perspective , author=. Stat , volume=. 2012 , publisher=

  25. [25]

    Biometrics , volume=

    A robust method for estimating optimal treatment regimes , author=. Biometrics , volume=. 2012 , publisher=

  26. [26]

    Journal of the American Statistical Association , volume=

    Estimating individualized treatment rules using outcome weighted learning , author=. Journal of the American Statistical Association , volume=. 2012 , publisher=

  27. [27]

    The international journal of biostatistics , volume=

    Statistical issues and limitations in personalized medicine research with clinical trials , author=. The international journal of biostatistics , volume=

  28. [28]

    Biometrika , volume=

    Tree-based methods for individualized treatment regimes , author=. Biometrika , volume=. 2015 , publisher=

  29. [29]

    Biometrics , volume=

    Using decision lists to construct interpretable and parsimonious treatment regimes , author=. Biometrics , volume=. 2015 , publisher=

  30. [30]

    Journal of the American Statistical Association , volume=

    Interpretable dynamic treatment regimes , author=. Journal of the American Statistical Association , volume=. 2018 , publisher=

  31. [31]

    Advances in Neural Information Processing Systems , volume=

    Semi-parametric efficient policy learning with continuous actions , author=. Advances in Neural Information Processing Systems , volume=

  32. [32]

    Journal of Business & Economic Statistics , volume=

    Double debiased machine learning nonparametric inference with continuous treatments , author=. Journal of Business & Economic Statistics , volume=. 2026 , publisher=

  33. [33]

    Journal of the American Statistical Association , volume=

    Causal inference with general treatment regimes: Generalizing the propensity score , author=. Journal of the American Statistical Association , volume=. 2004 , publisher=

  34. [34]

    arXiv preprint arXiv:1702.02896 , year=

    Efficient policy learning , author=. arXiv preprint arXiv:1702.02896 , year=

  35. [35]

    Adaptive treatment strategies in practice: Planning trials and analyzing data for personalized medicine , pages=

    Chapter 15: Estimation of dynamic treatment regimes for complex outcomes: balancing benefits and risks , author=. Adaptive treatment strategies in practice: Planning trials and analyzing data for personalized medicine , pages=. 2015 , publisher=

  36. [36]

    Journal of the American Statistical Association , volume=

    Interactive Q-learning for quantiles , author=. Journal of the American Statistical Association , volume=. 2017 , publisher=

  37. [37]

    Biometrika , volume=

    Interactive model building for Q-learning , author=. Biometrika , volume=. 2014 , publisher=

  38. [38]

    Annals of statistics , volume=

    Q-learning with censored data , author=. Annals of statistics , volume=

  39. [39]

    Electronic journal of statistics , volume=

    Tree based weighted learning for estimating individualized treatment rules with censored data , author=. Electronic journal of statistics , volume=

  40. [40]

    Journal of the Royal Statistical Society Series B: Statistical Methodology , volume=

    On estimation of optimal treatment regimes for maximizing t-year survival probability , author=. Journal of the Royal Statistical Society Series B: Statistical Methodology , volume=. 2017 , publisher=

  41. [41]

    Lifetime data analysis , volume=

    Optimal treatment regimes for survival endpoints using a locally-efficient doubly-robust estimator from a classification perspective , author=. Lifetime data analysis , volume=. 2017 , publisher=

  42. [42]

    Journal of the American Statistical Association , volume=

    Transformation-invariant learning of optimal individualized decision rules with time-to-event outcomes , author=. Journal of the American Statistical Association , volume=. 2023 , publisher=

  43. [43]

    arXiv preprint arXiv:2502.10658 , year=

    C-learning in estimation of optimal individualized treatment regimes for recurrent disease , author=. arXiv preprint arXiv:2502.10658 , year=

  44. [44]

    Journal of the American Statistical Association , volume=

    A semiparametric instrumental variable approach to optimal treatment regimes under endogeneity , author=. Journal of the American Statistical Association , volume=. 2021 , publisher=

  45. [45]

    Journal of the American Statistical Association , volume=

    Proximal learning for individualized treatment regimes under unmeasured confounding , author=. Journal of the American Statistical Association , volume=. 2024 , publisher=

  46. [46]

    Journal of the American Statistical Association , volume=

    Selecting and ranking individualized treatment rules with unmeasured confounding , author=. Journal of the American Statistical Association , volume=. 2021 , publisher=

  47. [47]

    2007 , publisher=

    The statistical analysis of recurrent events , author=. 2007 , publisher=

  48. [48]

    Biometrika , volume=

    A consistent estimator for the distribution of quality adjusted survival time , author=. Biometrika , volume=. 1997 , publisher=

  49. [49]

    Biometrics , pages=

    Estimating medical costs from incomplete follow-up data , author=. Biometrics , pages=. 1997 , publisher=

  50. [50]

    Biometrics , volume=

    Nonparametric analysis of recurrent events and death , author=. Biometrics , volume=. 2000 , publisher=

  51. [51]

    Journal of the American Statistical Association , volume=

    Robust estimation of mean functions and treatment effects for recurrent events under event-dependent censoring and termination: application to skeletal complications in cancer metastatic to bone , author=. Journal of the American Statistical Association , volume=. 2009 , publisher=

  52. [52]

    Lifetime data analysis , volume=

    Estimating treatment effects on the marginal recurrent event mean in the presence of a terminating event , author=. Lifetime data analysis , volume=. 2010 , publisher=

  53. [53]

    Journal of the American Statistical Association , volume=

    Estimating the mean of an increasing stochastic process at a censored stopping time , author=. Journal of the American Statistical Association , volume=. 2000 , publisher=

  54. [54]

    Janvin et al

    Causal inference with recurrent and competing events: M. Janvin et al. , author=. Lifetime data analysis , volume=. 2024 , publisher=

  55. [55]

    Statistics in medicine , volume=

    Doubly robust estimation and causal inference for recurrent event data , author=. Statistics in medicine , volume=. 2020 , publisher=

  56. [56]

    arXiv preprint arXiv:2306.16571 , year=

    Causal inference for the expected number of recurrent events in the presence of a terminal event , author=. arXiv preprint arXiv:2306.16571 , year=

  57. [57]

    1995 , publisher=

    Identification and estimation of local average treatment effects , author=. 1995 , publisher=

  58. [58]

    Journal of the American statistical Association , volume=

    Identification of causal effects using instrumental variables , author=. Journal of the American statistical Association , volume=. 1996 , publisher=

  59. [59]

    Journal of the American Statistical Association , volume=

    Regression and weighting methods for causal inference using instrumental variables , author=. Journal of the American Statistical Association , volume=. 2006 , publisher=

  60. [60]

    Journal of the Royal Statistical Society Series B: Statistical Methodology , volume=

    Doubly robust estimation of the local average treatment effect curve , author=. Journal of the Royal Statistical Society Series B: Statistical Methodology , volume=. 2015 , publisher=

  61. [61]

    Epidemiology , volume=

    Instruments for causal inference: an epidemiologist's dream? , author=. Epidemiology , volume=. 2006 , publisher=

  62. [62]

    Political Analysis , volume=

    Beyond LATE: Estimation of the average treatment effect with an instrumental variable , author=. Political Analysis , volume=. 2013 , publisher=

  63. [63]

    1993 , publisher=

    Minimum wages and employment: A case study of the fast food industry in New Jersey and Pennsylvania , author=. 1993 , publisher=

  64. [64]

    The review of economic studies , volume=

    Semiparametric difference-in-differences estimators , author=. The review of economic studies , volume=. 2005 , publisher=

  65. [65]

    Journal of econometrics , volume=

    Doubly robust difference-in-differences estimators , author=. Journal of econometrics , volume=. 2020 , publisher=

  66. [66]

    Foundations and trends in econometrics , volume=

    The estimation of causal effects by difference-in-difference methods , author=. Foundations and trends in econometrics , volume=. 2011 , publisher=

  67. [67]

    Journal of econometrics , volume=

    What’s trending in difference-in-differences? A synthesis of the recent econometrics literature , author=. Journal of econometrics , volume=. 2023 , publisher=

  68. [68]

    Econometrica , volume=

    Identification and inference in nonlinear difference-in-differences models , author=. Econometrica , volume=. 2006 , publisher=

  69. [69]

    Journal of the American Statistical Association , volume=

    A negative correlation strategy for bracketing in difference-in-differences , author=. Journal of the American Statistical Association , volume=. 2024 , publisher=

  70. [70]

    arXiv preprint arXiv:1901.01869 , year=

    Patterns of effects and sensitivity analysis for differences-in-differences , author=. arXiv preprint arXiv:1901.01869 , year=

  71. [71]

    Statistical science: a review journal of the Institute of Mathematical Statistics , volume=

    On negative outcome control of unobserved confounding as a generalization of difference-in-differences , author=. Statistical science: a review journal of the Institute of Mathematical Statistics , volume=

  72. [72]

    ARXIV , year=

    Semiparametric bespoke instrumental variables , author=. ARXIV , year=

  73. [73]

    arXiv e-prints , pages=

    A universal difference-in-differences approach for causal inference , author=. arXiv e-prints , pages=

  74. [74]

    Biometrics , volume=

    Instrumented difference-in-differences , author=. Biometrics , volume=. 2023 , publisher=

  75. [75]

    Electronic Journal of Statistics , volume=

    Structural mean models for instrumented difference-in-differences , author=. Electronic Journal of Statistics , volume=. 2024 , publisher=

  76. [76]

    Journal of the Royal Statistical Society Series B: Statistical Methodology , volume=

    Estimating optimal treatment rules with an instrumental variable: A partial identification learning approach , author=. Journal of the Royal Statistical Society Series B: Statistical Methodology , volume=. 2021 , publisher=

  77. [77]

    Journal of the Royal Statistical Society Series B: Statistical Methodology , volume=

    Bounded, efficient and multiply robust estimation of average treatment effects using instrumental variables , author=. Journal of the Royal Statistical Society Series B: Statistical Methodology , volume=. 2018 , publisher=

  78. [78]

    Journal of Economic perspectives , volume=

    The state of applied econometrics: Causality and policy evaluation , author=. Journal of Economic perspectives , volume=. 2017 , publisher=

  79. [79]

    Epidemiology , volume=

    Marginal structural models and causal inference in epidemiology , author=. Epidemiology , volume=. 2000 , publisher=

  80. [80]

    The annals of statistics , pages=

    A survey of product-integration with a view toward application in survival analysis , author=. The annals of statistics , pages=. 1990 , publisher=

Showing first 80 references.