On causal inference with marked point process data
Pith reviewed 2026-05-10 14:22 UTC · model grok-4.3
The pith
Dynamic treatment regimes for marked point process data are identified using martingale analogues of consistency, exchangeability, and positivity.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
We define dynamic treatment regimes and associated potential outcomes for data described by marked point processes (MPPs). These definitions motivate MPP analogues of the commonly used consistency, exchangeability, and positivity conditions that are sufficient for identifying effects in MPP data structures. The conditions are formulated based on martingale theory, which allows us to derive explicit identifying assumptions for data described by stochastic processes. The definitions and conditions align with well-established discrete-time results in important special cases. After formulating a set of identification conditions, we derive and characterize marginal g-formulas.
What carries the argument
Martingale-theoretic analogues of the consistency, exchangeability, and positivity conditions for potential outcomes under dynamic treatment regimes in marked point processes.
Load-bearing premise
The martingale-theoretic formulation of the consistency, exchangeability, and positivity conditions is sufficient to identify causal effects for the defined potential outcomes in continuous-time marked point process data.
What would settle it
A data-generating process satisfying the martingale versions of consistency, exchangeability, and positivity yet producing a g-formula value that differs from the true marginal distribution of the potential outcome under the regime would disprove the identification result.
read the original abstract
We define dynamic treatment regimes and associated potential outcomes for data described by marked point processes (MPPs). These definitions motivate MPP analogues of the commonly used consistency, exchangeability, and positivity conditions that are sufficient for identifying effects in MPP data structures. The conditions are formulated based on martingale theory, which allows us to derive explicit identifying assumptions for data described by stochastic processes. The definitions and conditions align with well-established discrete-time results in important special cases. Thus, this work bridges the large literatures on survival (event history) analysis with counting processes in continuous time and causal inference with variables in discrete-time. After formulating a set of identification conditions, we derive and characterize marginal g-formulas. The g-formulas are generally different from those studied in related works, though they coincide in important special cases. We relate our findings to previous work on causal inference with (counting) processes, the classical survival literature, and the discrete-time causal inference literature.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper defines dynamic treatment regimes and associated potential outcomes for data described by marked point processes (MPPs). It formulates martingale-theoretic analogues of the consistency, exchangeability, and positivity conditions as sufficient for identifying causal effects in MPP data structures. Marginal g-formulas are derived under these conditions; the framework is shown to align with discrete-time causal inference results in special cases while differing from some related continuous-time approaches in general, thereby bridging survival analysis with counting processes and discrete-time causal inference.
Significance. If the identification results hold, this work provides a significant extension of causal inference methods to continuous-time marked point process data. The martingale formulation supplies the necessary predictability and compensator structure to operationalize the identifying assumptions without internal inconsistencies. The explicit reduction to known discrete-time and classical survival results in special cases, combined with the derivation of generally distinct g-formulas, strengthens the contribution and offers a rigorous bridge between the survival/event-history and causal-inference literatures.
minor comments (1)
- The abstract states that the g-formulas are 'generally different from those studied in related works' yet coincide in special cases; a brief, explicit contrast (e.g., one sentence referencing the key structural difference) would improve immediate clarity for readers familiar with the cited literatures.
Simulated Author's Rebuttal
We thank the referee for the positive and accurate summary of our manuscript, as well as for the recommendation of minor revision. We are pleased that the work is viewed as providing a rigorous bridge between the marked point process literature, survival analysis, and discrete-time causal inference. The referee's assessment of the significance of the martingale-based identification conditions and g-formulas is appreciated.
Circularity Check
No significant circularity; derivation is self-contained
full rationale
The paper defines dynamic treatment regimes and potential outcomes for marked point processes, then formulates martingale-based analogues of standard identifying assumptions (consistency, exchangeability, positivity) drawn from established discrete-time causal inference and survival analysis. It derives marginal g-formulas that are shown to coincide with known results in special cases, without any step where a fitted parameter is relabeled as a prediction, a result is defined in terms of itself, or a load-bearing uniqueness claim reduces to an unverified self-citation. The central identification results rest on external martingale theory and prior non-overlapping literature rather than internal construction.
Axiom & Free-Parameter Ledger
axioms (2)
- domain assumption Martingale theory supplies the correct framework for stating identifying assumptions for data generated by marked point processes.
- domain assumption The defined potential outcomes and dynamic regimes coincide with standard discrete-time definitions in important special cases.
Reference graph
Works this paper leans on
-
[1]
Odd Aalen, Ørnulf Borgan, and Hakon Gjessing.Survival and Event His- tory Analysis: A Process Point of View (Statistics for Biology and Health). Springer, 2008
work page 2008
-
[2]
Per Andersen, Ørnulf Borgan, Richard Gill, and Niels Keiding.Statistical Models Based on Counting Processes. Springer Series in Statistics. Springer- Verlag, New York, 1993
work page 1993
-
[3]
Ryan M. Andrews and Vanessa Didelez. Insights into the cross-world indepen- dence assumption of causal mediation analysis.Epidemiology, 32(2):209–219, December 2020
work page 2020
-
[4]
Pierre Br´ emaud.Point Processes and Queues. Springer Series in Statistics. Springer, New York, NY, 1981 edition, September 1981. 33
work page 1981
-
[5]
Probability and Its Applications
Samuel N Cohen and Robert J Elliott.Stochastic Calculus and Applications. Probability and Its Applications. Springer, New York, NY, 2 edition, November 2015
work page 2015
-
[6]
Daniel Commenges and Anne G´ egout-Petit. A general dynamical statistical model with causal interpretation.Journal of the Royal Statistical Society: Series B (Statistical Methodology), 71(3):719–736, 2009
work page 2009
-
[7]
Richard J. Cook and Jerald F. Lawless.The Statistical Analysis of Recurrent Events. Springer New York, 2007
work page 2007
-
[8]
A. Philip Dawid. Decision-theoretic foundations for statistical causality.Jour- nal of Causal Inference, 9(1):39–77, January 2021
work page 2021
-
[9]
Philip Dawid and Vanessa Didelez
A. Philip Dawid and Vanessa Didelez. Identifying the consequences of dy- namic treatment strategies: A decision-theoretic overview.Statistics Surveys, 4(none), January 2010
work page 2010
-
[10]
Vanessa Didelez. Graphical models for marked point processes based on local independence.Journal of the Royal Statistical Society: Series B (Statistical Methodology), 70(2):245–264, February 2008
work page 2008
-
[11]
Thomas R. Fleming and David P. Harrington.Counting Processes and Survival Analysis. John Wiley & Sons, Inc., September 2005
work page 2005
-
[12]
Gill.Lectures on survival analysis, page 115–241
Richard D. Gill.Lectures on survival analysis, page 115–241. Springer Berlin Heidelberg, 1994
work page 1994
-
[13]
Richard D. Gill and Soren Johansen. A survey of product-integration with a view toward application in survival analysis.The Annals of Statistics, 18(4):1129–1170, December 1990
work page 1990
-
[14]
Richard D. Gill and James M. Robins. Causal inference for complex longitu- dinal data: The continuous case.The Annals of Statistics, 29(6), December 2001
work page 2001
-
[15]
Richard D. Gill and James M. Robins. Causal inference for complex longi- tudinal data: The continuous time g-computation formula.arXiv preprint, 2004
work page 2004
-
[16]
CRC Press, Boca Raton, FL, September 1992
Sheng-Wu He, Jia-Gang Wang, and Jia-An Yan.Semimartingale theory and stochastic calculus. CRC Press, Boca Raton, FL, September 1992
work page 1992
-
[17]
M. A. Hern´ an, B. Brumback, and J. M. Robins. Marginal structural models to estimate the causal effect of zidovudine on the survival of hiv-positive men, 2000
work page 2000
-
[18]
Miguel A. Hern´ an and James M. Robins.Causal Inference. CRC Press, Boca Raton, FL, February 2021
work page 2021
- [19]
-
[20]
Point process theory and applications marked point and piecewise deterministic processes
Martin Jacobsen. Point process theory and applications marked point and piecewise deterministic processes. InProbability and Its Applications, pages 3–7. Birkh¨ auser-Verlag, Boston, 2006
work page 2006
-
[21]
Jean Jacod. Multivariate point processes: Predictable projection, radon- nikodym derivatives, representation of martingales.Probability Theory and Related Fields, 1975
work page 1975
-
[22]
Jean Jacod and Albert N. Shiryaev.Limit Theorems for Stochastic Processes, volume 288 ofGrundlehren der Mathematischen Wissenschaften [Fundamental Principles of Mathematical Sciences]. Springer-Verlag, Berlin, second edition, 34 2003
work page 2003
-
[23]
Brent A. Johnson and Anastasios A. Tsiatis. Semiparametric inference in observational duration-response studies, with duration possibly right-censored. Biometrika, 92(3):605–618, 2005
work page 2005
-
[24]
Springer International Publishing, 2021
Olav Kallenberg.Foundations of Modern Probability. Springer International Publishing, 2021
work page 2021
-
[25]
Kosorok.Introduction to Empirical Processes and Semiparametric Inference
Michael R. Kosorok.Introduction to Empirical Processes and Semiparametric Inference. Springer New York, 2008
work page 2008
-
[26]
Probability and Its Applications
G¨ unter Last and Andreas Brandt.Marked point processes on the real line. Probability and Its Applications. Springer, New York, NY, 1995 edition, Au- gust 1995
work page 1995
-
[27]
Lok.Statistical Modelling of Causal Effects in Time
Judith J. Lok.Statistical Modelling of Causal Effects in Time. PhD the- sis, Vrije Universiteit Amsterdam, 2001. Naam instelling promotie: VU Vrije Universiteit Naam instelling onderzoek: VU Vrije Universiteit
work page 2001
-
[28]
Judith J. Lok. Statistical modeling of causal effects in continuous time.The Annals of Statistics, 36(3), June 2008
work page 2008
-
[29]
Judith J. Lok, Richard D. Gill, Aad van der Vaart, and James M. Robins. Estimating the causal effect of a time-varying treatment on time-to-event using structural nested failure time models.Statistica Neerlandica, 58(3):271–295, August 2004
work page 2004
-
[30]
Werner Nagel and Rolf Steyer.Probability and Conditional Expectation: Fun- damentals for the Empirical Sciences. Wiley, March 2017
work page 2017
-
[31]
Johan Sebastial Ohlendorff, Kjetil Røysland, Anders Munch, and Thomas Gerds. Identification and estimation of causal effects under treatment-assigned- at-visit interventions in continuous time.In preparation, page 823–843, August 2026
work page 2026
-
[32]
Cambridge Univer- sity Press, New York, NY, USA, 2nd edition, 2009
Judea Pearl.Causality: Models, Reasoning and Inference. Cambridge Univer- sity Press, New York, NY, USA, 2nd edition, 2009
work page 2009
-
[33]
Protter.Stochastic Integration and Differential Equations
P. Protter.Stochastic Integration and Differential Equations. Springer, 2005
work page 2005
-
[34]
Thomas S. Richardson and James M. Robins. Single World Interven- tion Graphs (SWIGs): A Unification of the Counterfactual and Graph- ical Approaches to Causality. Technical Report 128, Center for Statis- tics and the Social Sciences, University of Washington, 2013. Avail- able online at: https://csss.uw.edu/research/working-papers/single-world- interventi...
work page 2013
-
[35]
J. M. Robins.Marginal Structural Models versus Structural Nested Models as Tools for Causal Inference, pages 95–133. Springer New York, 2000
work page 2000
-
[36]
J. M. Robins and A. Rotnitzky.Recovery of Information and Adjustment for Dependent Censoring Using Surrogate Markers, pages 297–331. Birkh¨ auser Boston, 1992
work page 1992
-
[37]
James Robins. A new approach to causal inference in mortality studies with sustained exposure periods – application to control of the healthy worker sur- vivor effect.Mathematical Modeling, 7(9):1393–1512, 1986
work page 1986
-
[38]
James M. Robins. Addendum to “a new approach to causal inference in mor- tality studies with a sustained exposure period—application to control of the healthy worker survivor effect”.Computers & Mathematics with Applications, 14(9-12):923–945, 1987. 35
work page 1987
-
[39]
James M. Robins. The analysis of randomized and nonrandomized aids treat- ment trials using a new approach to causal inference in longitudinal studies. In Lee Sechrest, Howard Freeman, and Albert Mulley, editors,Health Service Research Methodology: A Focus on AIDS, page 113–159. U.S. Public Health Service, National Center for Health Services Research, Was...
work page 1989
-
[40]
James M. Robins. Causal inference from complex longitudinal data. In Maia Berkane, editor,Latent Variable Modeling and Applications to Causality, pages 69–117, New York, NY, 1997. Springer New York
work page 1997
-
[41]
James M. Robins. Optimal structural nested models for optimal sequential decisions. InProceedings of the Second Seattle Symposium in Biostatistics, pages 189–326. Springer, 2004
work page 2004
-
[42]
James M. Robins and Dianne M. Finkelstein. Correcting for noncompliance and dependent censoring in an aids clinical trial with inverse probability of cen- soring weighted (ipcw) log-rank tests.Biometrics, 56(3):779–788, September 2000
work page 2000
-
[43]
James M Robins, Miguel A Hern´ an, and Uwe Siebert. Effects of multiple interventions.Comparative quantification of health risks: global and regional burden of disease attributable to selected major risk factors, 1:2191–2230, 2004
work page 2004
-
[44]
James M. Robins and Thomas S. Richardson. Alternative graphical causal models and the identification of direct effects. InCausality and Psychopathol- ogy: Finding the Determinants of Disorders and their Cures. Oxford University Press, February 2011
work page 2011
-
[45]
P. R. Rosenbaum and D. B. Rubin. The central role of the propensity score in observational studies for causal effects.Biometrika, 70(1):41–55, 1983
work page 1983
-
[46]
D. B. Rubin. Estimating causal effects of treatments in randomized and non- randomized studies.Journal of Educational Psychology, 66(5):688–701, 1974
work page 1974
-
[47]
P˚ al Ryalen, Mats Stensrud, Sophie Foss˚ a, and Kjetil Røysland. Causal infer- ence in continuous time: An example on prostate cancer therapy.Biostatistics, 19(4):600–613, 2018
work page 2018
-
[48]
P˚ al C. Ryalen, Mats J. Stensrud, and Kjetil Røysland. The additive hazard es- timator is consistent for continuous-time marginal structural models.Lifetime Data Analysis, 25(4):611–638, February 2019
work page 2019
-
[49]
Helene C. Rytgaard, Thomas A. Gerds, and Mark J. van der Laan. Continuous- time targeted minimum loss-based estimation of intervention-specific mean outcomes.The Annals of Statistics, 50(5), October 2022
work page 2022
-
[50]
A martingale approach to continuous-time marginal struc- tural models.Bernoulli, 2011
Kjetil Røysland. A martingale approach to continuous-time marginal struc- tural models.Bernoulli, 2011
work page 2011
-
[51]
Kjetil Røysland. Counterfactual analyses with graphical models based on local independence.The Annals of Statistics, 40(4):2162–2194, 2012
work page 2012
-
[52]
Kjetil Røysland, P˚ al Ryalen, Mari Nyg˚ ard, and Vanessa Didelez. Graphical criteria for the identification of marginal causal effects in continuous-time sur- vival and event-history analyses, 2022
work page 2022
-
[53]
Aaron L Sarvet and Mats J Stensrud. The natural value of treatment and its importance for causal inference.Annual Review of Statistics and Its Applica- tion, 13, 2025
work page 2025
-
[54]
Satten, Somnath Datta, and James Robins
Glen A. Satten, Somnath Datta, and James Robins. Estimating the marginal survival function in the presence of time dependent covariates.Statistics & Probability Letters, 54(4):397–403, October 2001. 36
work page 2001
-
[55]
Scheike and Torben Martinussen.Dynamic Regression Models for Survival Data
Thomas H. Scheike and Torben Martinussen.Dynamic Regression Models for Survival Data. Springer, NY, 2006
work page 2006
-
[56]
Albert N. Shiryaev and Jan Kallsen. The cumulant process and Esscher’s change of measure.Finance and Stochastics, 6(4):397–428, 2002
work page 2002
-
[57]
Alexander Sokol and Niels Richard Hansen. Exponential martingales and changes of measure for counting processes.Stochastic Analysis and Appli- cations, 33(5):823–843, August 2015
work page 2015
- [58]
-
[59]
A. W. van der Vaart.Asymptotic Statistics. Cambridge Series in Statistical and Probabilistic Mathematics. Cambridge University Press, 1998
work page 1998
-
[60]
Aad W. van der Vaart. On robins’ formula.Statistics & Decisions. International Mathematical Journal for Stochastic Methods and Models, 22(3):171–200, 2004
work page 2004
-
[61]
Cambridge University Press, February 1991
David Williams.Probability with Martingales. Cambridge University Press, February 1991
work page 1991
-
[62]
Shu Yang. Semiparametric estimation of structural nested mean models with irregularly spaced longitudinal observations.Biometrics, 78(3):937–949, April 2021
work page 2021
-
[63]
Andrew Ying. Causality for complex continuous-time functional longitudinal studies with dynamic treatment regimes, 2024
work page 2024
-
[64]
Causality for functional longitudinal data
Andrew Ying. Causality for functional longitudinal data. In Francesco Lo- catello and Vanessa Didelez, editors,Proceedings of the Third Conference on Causal Learning and Reasoning, volume 236 ofProceedings of Machine Learn- ing Research, page 665–687. PMLR, April 1–3 2024
work page 2024
-
[65]
Jessica G. Young. Story-led causal inference.Epidemiology, 35(3):289–294, April 2024
work page 2024
-
[66]
Jessica G Young, Miguel A Hern´ an, and James M Robins. Identification, estimation and approximation of risk under interventions that depend on the natural value of treatment using observational data.Epidemiologic Methods, 3(1):1–19, Jan 2014
work page 2014
-
[67]
Zhengguo Yu and Mark J. van der Laan. Construction of counterfactuals and the g-computation formula. department of biostatistics.University of California, 2002
work page 2002
-
[68]
Mingyuan Zhang, Marshall M. Joffe, and Dylan S. Small. Causal inference for continuous-time processes when covariates are observed only at discrete times. The Annals of Statistics, 39(1), February 2011. Appendix Overview Appendix A reviews concepts from stochastic process theory and describes the canonical space of point process realizations. Appendix B p...
work page 2011
-
[69]
= ( ˜T ′ 1, ˜X ′ 1).(66) Suppose then thatτ ′J > T ′ 1, and note thatC 1 = Ω′ by Algorithm 1. There are two cases to check: •Case I: The first observed event is a treatment event (of some typej∈J), and the regime is followed.By Algorithm 1, the first observed event is a treatment event if, for somej∈J,T j 1 =∧ h∈J T h 1 ∧T \J 1 . The regime is followed wh...
-
[70]
by [33, II 8 Theorem 37]), Wt = I(τ a > t) P0<s≤t(1−Λ as)
In particular, it is a well-defined semimartingale, and (11) is well-defined, with the solution given in Proposition 2, (e.g. by [33, II 8 Theorem 37]), Wt = I(τ a > t) P0<s≤t(1−Λ as) . This is also a local martingale, as highlighted in Lemma 7. Since the only contri- bution to the product are at the times{θ k}, which are discontinuity points of Λ a, we g...
-
[71]
Using these properties, the proof of (42) follow the lines of the proof of Theorem 1
Specifically, under the conditions (36), and (i), (ii) and (iii) of Theorem 3, the processE(K J) satisfies properties parallel to those established forW=E(K) in Lemma 7 and Lemma 8, withK J,N J,Λ J, andτ J playing the roles ofK a,N a,Λ a, andτ a, respectively. Using these properties, the proof of (42) follow the lines of the proof of Theorem 1. As in Lemm...
-
[72]
□ AppendixF.Notation General mathematical notation
The derivation of the distributional representation on the right-hand side of (43) follows step for step the proof of Lemma 1 and is therefore omitted. □ AppendixF.Notation General mathematical notation. (Ω,F, P) The underlying abstract probability space wherePis a fixed, but arbitrary, probability measure x∧y,x∨yThe minimum ofxandy, and the maximum ofxan...
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.