pith. sign in

arxiv: 2604.12977 · v1 · submitted 2026-04-14 · 📊 stat.ME

On causal inference with marked point process data

Pith reviewed 2026-05-10 14:22 UTC · model grok-4.3

classification 📊 stat.ME
keywords causal inferencemarked point processesdynamic treatment regimesmartingale theoryg-formulasurvival analysiscounting processespotential outcomes
0
0 comments X

The pith

Dynamic treatment regimes for marked point process data are identified using martingale analogues of consistency, exchangeability, and positivity.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper defines dynamic treatment regimes and their associated potential outcomes when data arrives as marked point processes that record timed events and marks in continuous time. It develops counterparts to the familiar consistency, exchangeability, and positivity assumptions, but expressed through martingale theory so that they apply directly to the stochastic-process structure of the data. These conditions suffice to identify causal effects and yield marginal g-formulas that recover the usual discrete-time expressions in special cases yet differ in general continuous-time settings. The construction links the counting-process methods of survival analysis to discrete-time causal inference. Readers working with irregular event data would therefore gain a principled way to estimate effects of time-varying interventions without forcing the data into fixed-time grids.

Core claim

We define dynamic treatment regimes and associated potential outcomes for data described by marked point processes (MPPs). These definitions motivate MPP analogues of the commonly used consistency, exchangeability, and positivity conditions that are sufficient for identifying effects in MPP data structures. The conditions are formulated based on martingale theory, which allows us to derive explicit identifying assumptions for data described by stochastic processes. The definitions and conditions align with well-established discrete-time results in important special cases. After formulating a set of identification conditions, we derive and characterize marginal g-formulas.

What carries the argument

Martingale-theoretic analogues of the consistency, exchangeability, and positivity conditions for potential outcomes under dynamic treatment regimes in marked point processes.

Load-bearing premise

The martingale-theoretic formulation of the consistency, exchangeability, and positivity conditions is sufficient to identify causal effects for the defined potential outcomes in continuous-time marked point process data.

What would settle it

A data-generating process satisfying the martingale versions of consistency, exchangeability, and positivity yet producing a g-formula value that differs from the true marginal distribution of the potential outcome under the regime would disprove the identification result.

read the original abstract

We define dynamic treatment regimes and associated potential outcomes for data described by marked point processes (MPPs). These definitions motivate MPP analogues of the commonly used consistency, exchangeability, and positivity conditions that are sufficient for identifying effects in MPP data structures. The conditions are formulated based on martingale theory, which allows us to derive explicit identifying assumptions for data described by stochastic processes. The definitions and conditions align with well-established discrete-time results in important special cases. Thus, this work bridges the large literatures on survival (event history) analysis with counting processes in continuous time and causal inference with variables in discrete-time. After formulating a set of identification conditions, we derive and characterize marginal g-formulas. The g-formulas are generally different from those studied in related works, though they coincide in important special cases. We relate our findings to previous work on causal inference with (counting) processes, the classical survival literature, and the discrete-time causal inference literature.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

0 major / 1 minor

Summary. The paper defines dynamic treatment regimes and associated potential outcomes for data described by marked point processes (MPPs). It formulates martingale-theoretic analogues of the consistency, exchangeability, and positivity conditions as sufficient for identifying causal effects in MPP data structures. Marginal g-formulas are derived under these conditions; the framework is shown to align with discrete-time causal inference results in special cases while differing from some related continuous-time approaches in general, thereby bridging survival analysis with counting processes and discrete-time causal inference.

Significance. If the identification results hold, this work provides a significant extension of causal inference methods to continuous-time marked point process data. The martingale formulation supplies the necessary predictability and compensator structure to operationalize the identifying assumptions without internal inconsistencies. The explicit reduction to known discrete-time and classical survival results in special cases, combined with the derivation of generally distinct g-formulas, strengthens the contribution and offers a rigorous bridge between the survival/event-history and causal-inference literatures.

minor comments (1)
  1. The abstract states that the g-formulas are 'generally different from those studied in related works' yet coincide in special cases; a brief, explicit contrast (e.g., one sentence referencing the key structural difference) would improve immediate clarity for readers familiar with the cited literatures.

Simulated Author's Rebuttal

0 responses · 0 unresolved

We thank the referee for the positive and accurate summary of our manuscript, as well as for the recommendation of minor revision. We are pleased that the work is viewed as providing a rigorous bridge between the marked point process literature, survival analysis, and discrete-time causal inference. The referee's assessment of the significance of the martingale-based identification conditions and g-formulas is appreciated.

Circularity Check

0 steps flagged

No significant circularity; derivation is self-contained

full rationale

The paper defines dynamic treatment regimes and potential outcomes for marked point processes, then formulates martingale-based analogues of standard identifying assumptions (consistency, exchangeability, positivity) drawn from established discrete-time causal inference and survival analysis. It derives marginal g-formulas that are shown to coincide with known results in special cases, without any step where a fitted parameter is relabeled as a prediction, a result is defined in terms of itself, or a load-bearing uniqueness claim reduces to an unverified self-citation. The central identification results rest on external martingale theory and prior non-overlapping literature rather than internal construction.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 0 invented entities

The central claims rest on standard causal assumptions adapted to stochastic processes and on martingale theory; no free parameters or new entities are introduced in the abstract.

axioms (2)
  • domain assumption Martingale theory supplies the correct framework for stating identifying assumptions for data generated by marked point processes.
    Invoked to derive explicit versions of consistency, exchangeability, and positivity for continuous-time processes.
  • domain assumption The defined potential outcomes and dynamic regimes coincide with standard discrete-time definitions in important special cases.
    Used to claim alignment with existing causal inference results.

pith-pipeline@v0.9.0 · 5470 in / 1211 out tokens · 31286 ms · 2026-05-10T14:22:45.611458+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

72 extracted references · 72 canonical work pages

  1. [1]

    Springer, 2008

    Odd Aalen, Ørnulf Borgan, and Hakon Gjessing.Survival and Event His- tory Analysis: A Process Point of View (Statistics for Biology and Health). Springer, 2008

  2. [2]

    Springer Series in Statistics

    Per Andersen, Ørnulf Borgan, Richard Gill, and Niels Keiding.Statistical Models Based on Counting Processes. Springer Series in Statistics. Springer- Verlag, New York, 1993

  3. [3]

    Andrews and Vanessa Didelez

    Ryan M. Andrews and Vanessa Didelez. Insights into the cross-world indepen- dence assumption of causal mediation analysis.Epidemiology, 32(2):209–219, December 2020

  4. [4]

    Springer Series in Statistics

    Pierre Br´ emaud.Point Processes and Queues. Springer Series in Statistics. Springer, New York, NY, 1981 edition, September 1981. 33

  5. [5]

    Probability and Its Applications

    Samuel N Cohen and Robert J Elliott.Stochastic Calculus and Applications. Probability and Its Applications. Springer, New York, NY, 2 edition, November 2015

  6. [6]

    A general dynamical statistical model with causal interpretation.Journal of the Royal Statistical Society: Series B (Statistical Methodology), 71(3):719–736, 2009

    Daniel Commenges and Anne G´ egout-Petit. A general dynamical statistical model with causal interpretation.Journal of the Royal Statistical Society: Series B (Statistical Methodology), 71(3):719–736, 2009

  7. [7]

    Cook and Jerald F

    Richard J. Cook and Jerald F. Lawless.The Statistical Analysis of Recurrent Events. Springer New York, 2007

  8. [8]

    Philip Dawid

    A. Philip Dawid. Decision-theoretic foundations for statistical causality.Jour- nal of Causal Inference, 9(1):39–77, January 2021

  9. [9]

    Philip Dawid and Vanessa Didelez

    A. Philip Dawid and Vanessa Didelez. Identifying the consequences of dy- namic treatment strategies: A decision-theoretic overview.Statistics Surveys, 4(none), January 2010

  10. [10]

    Graphical models for marked point processes based on local independence.Journal of the Royal Statistical Society: Series B (Statistical Methodology), 70(2):245–264, February 2008

    Vanessa Didelez. Graphical models for marked point processes based on local independence.Journal of the Royal Statistical Society: Series B (Statistical Methodology), 70(2):245–264, February 2008

  11. [11]

    Fleming and David P

    Thomas R. Fleming and David P. Harrington.Counting Processes and Survival Analysis. John Wiley & Sons, Inc., September 2005

  12. [12]

    Gill.Lectures on survival analysis, page 115–241

    Richard D. Gill.Lectures on survival analysis, page 115–241. Springer Berlin Heidelberg, 1994

  13. [13]

    Gill and Soren Johansen

    Richard D. Gill and Soren Johansen. A survey of product-integration with a view toward application in survival analysis.The Annals of Statistics, 18(4):1129–1170, December 1990

  14. [14]

    Gill and James M

    Richard D. Gill and James M. Robins. Causal inference for complex longitu- dinal data: The continuous case.The Annals of Statistics, 29(6), December 2001

  15. [15]

    Gill and James M

    Richard D. Gill and James M. Robins. Causal inference for complex longi- tudinal data: The continuous time g-computation formula.arXiv preprint, 2004

  16. [16]

    CRC Press, Boca Raton, FL, September 1992

    Sheng-Wu He, Jia-Gang Wang, and Jia-An Yan.Semimartingale theory and stochastic calculus. CRC Press, Boca Raton, FL, September 1992

  17. [17]

    M. A. Hern´ an, B. Brumback, and J. M. Robins. Marginal structural models to estimate the causal effect of zidovudine on the survival of hiv-positive men, 2000

  18. [18]

    Hern´ an and James M

    Miguel A. Hern´ an and James M. Robins.Causal Inference. CRC Press, Boca Raton, FL, February 2021

  19. [19]

    Hu and J

    L. Hu and J. W. Hogan. Causal comparative effectiveness analysis of dynamic continuous-time treatment initiation rules with sparsely measured outcomes and death.Biometrics, 75(2):695–707, 2019

  20. [20]

    Point process theory and applications marked point and piecewise deterministic processes

    Martin Jacobsen. Point process theory and applications marked point and piecewise deterministic processes. InProbability and Its Applications, pages 3–7. Birkh¨ auser-Verlag, Boston, 2006

  21. [21]

    Multivariate point processes: Predictable projection, radon- nikodym derivatives, representation of martingales.Probability Theory and Related Fields, 1975

    Jean Jacod. Multivariate point processes: Predictable projection, radon- nikodym derivatives, representation of martingales.Probability Theory and Related Fields, 1975

  22. [22]

    Shiryaev.Limit Theorems for Stochastic Processes, volume 288 ofGrundlehren der Mathematischen Wissenschaften [Fundamental Principles of Mathematical Sciences]

    Jean Jacod and Albert N. Shiryaev.Limit Theorems for Stochastic Processes, volume 288 ofGrundlehren der Mathematischen Wissenschaften [Fundamental Principles of Mathematical Sciences]. Springer-Verlag, Berlin, second edition, 34 2003

  23. [23]

    Johnson and Anastasios A

    Brent A. Johnson and Anastasios A. Tsiatis. Semiparametric inference in observational duration-response studies, with duration possibly right-censored. Biometrika, 92(3):605–618, 2005

  24. [24]

    Springer International Publishing, 2021

    Olav Kallenberg.Foundations of Modern Probability. Springer International Publishing, 2021

  25. [25]

    Kosorok.Introduction to Empirical Processes and Semiparametric Inference

    Michael R. Kosorok.Introduction to Empirical Processes and Semiparametric Inference. Springer New York, 2008

  26. [26]

    Probability and Its Applications

    G¨ unter Last and Andreas Brandt.Marked point processes on the real line. Probability and Its Applications. Springer, New York, NY, 1995 edition, Au- gust 1995

  27. [27]

    Lok.Statistical Modelling of Causal Effects in Time

    Judith J. Lok.Statistical Modelling of Causal Effects in Time. PhD the- sis, Vrije Universiteit Amsterdam, 2001. Naam instelling promotie: VU Vrije Universiteit Naam instelling onderzoek: VU Vrije Universiteit

  28. [28]

    Judith J. Lok. Statistical modeling of causal effects in continuous time.The Annals of Statistics, 36(3), June 2008

  29. [29]

    Lok, Richard D

    Judith J. Lok, Richard D. Gill, Aad van der Vaart, and James M. Robins. Estimating the causal effect of a time-varying treatment on time-to-event using structural nested failure time models.Statistica Neerlandica, 58(3):271–295, August 2004

  30. [30]

    Wiley, March 2017

    Werner Nagel and Rolf Steyer.Probability and Conditional Expectation: Fun- damentals for the Empirical Sciences. Wiley, March 2017

  31. [31]

    Identification and estimation of causal effects under treatment-assigned- at-visit interventions in continuous time.In preparation, page 823–843, August 2026

    Johan Sebastial Ohlendorff, Kjetil Røysland, Anders Munch, and Thomas Gerds. Identification and estimation of causal effects under treatment-assigned- at-visit interventions in continuous time.In preparation, page 823–843, August 2026

  32. [32]

    Cambridge Univer- sity Press, New York, NY, USA, 2nd edition, 2009

    Judea Pearl.Causality: Models, Reasoning and Inference. Cambridge Univer- sity Press, New York, NY, USA, 2nd edition, 2009

  33. [33]

    Protter.Stochastic Integration and Differential Equations

    P. Protter.Stochastic Integration and Differential Equations. Springer, 2005

  34. [34]

    Richardson and James M

    Thomas S. Richardson and James M. Robins. Single World Interven- tion Graphs (SWIGs): A Unification of the Counterfactual and Graph- ical Approaches to Causality. Technical Report 128, Center for Statis- tics and the Social Sciences, University of Washington, 2013. Avail- able online at: https://csss.uw.edu/research/working-papers/single-world- interventi...

  35. [35]

    J. M. Robins.Marginal Structural Models versus Structural Nested Models as Tools for Causal Inference, pages 95–133. Springer New York, 2000

  36. [36]

    J. M. Robins and A. Rotnitzky.Recovery of Information and Adjustment for Dependent Censoring Using Surrogate Markers, pages 297–331. Birkh¨ auser Boston, 1992

  37. [37]

    James Robins. A new approach to causal inference in mortality studies with sustained exposure periods – application to control of the healthy worker sur- vivor effect.Mathematical Modeling, 7(9):1393–1512, 1986

  38. [38]

    a new approach to causal inference in mor- tality studies with a sustained exposure period—application to control of the healthy worker survivor effect

    James M. Robins. Addendum to “a new approach to causal inference in mor- tality studies with a sustained exposure period—application to control of the healthy worker survivor effect”.Computers & Mathematics with Applications, 14(9-12):923–945, 1987. 35

  39. [39]

    James M. Robins. The analysis of randomized and nonrandomized aids treat- ment trials using a new approach to causal inference in longitudinal studies. In Lee Sechrest, Howard Freeman, and Albert Mulley, editors,Health Service Research Methodology: A Focus on AIDS, page 113–159. U.S. Public Health Service, National Center for Health Services Research, Was...

  40. [40]

    James M. Robins. Causal inference from complex longitudinal data. In Maia Berkane, editor,Latent Variable Modeling and Applications to Causality, pages 69–117, New York, NY, 1997. Springer New York

  41. [41]

    James M. Robins. Optimal structural nested models for optimal sequential decisions. InProceedings of the Second Seattle Symposium in Biostatistics, pages 189–326. Springer, 2004

  42. [42]

    Robins and Dianne M

    James M. Robins and Dianne M. Finkelstein. Correcting for noncompliance and dependent censoring in an aids clinical trial with inverse probability of cen- soring weighted (ipcw) log-rank tests.Biometrics, 56(3):779–788, September 2000

  43. [43]

    Effects of multiple interventions.Comparative quantification of health risks: global and regional burden of disease attributable to selected major risk factors, 1:2191–2230, 2004

    James M Robins, Miguel A Hern´ an, and Uwe Siebert. Effects of multiple interventions.Comparative quantification of health risks: global and regional burden of disease attributable to selected major risk factors, 1:2191–2230, 2004

  44. [44]

    Robins and Thomas S

    James M. Robins and Thomas S. Richardson. Alternative graphical causal models and the identification of direct effects. InCausality and Psychopathol- ogy: Finding the Determinants of Disorders and their Cures. Oxford University Press, February 2011

  45. [45]

    P. R. Rosenbaum and D. B. Rubin. The central role of the propensity score in observational studies for causal effects.Biometrika, 70(1):41–55, 1983

  46. [46]

    D. B. Rubin. Estimating causal effects of treatments in randomized and non- randomized studies.Journal of Educational Psychology, 66(5):688–701, 1974

  47. [47]

    Causal infer- ence in continuous time: An example on prostate cancer therapy.Biostatistics, 19(4):600–613, 2018

    P˚ al Ryalen, Mats Stensrud, Sophie Foss˚ a, and Kjetil Røysland. Causal infer- ence in continuous time: An example on prostate cancer therapy.Biostatistics, 19(4):600–613, 2018

  48. [48]

    Ryalen, Mats J

    P˚ al C. Ryalen, Mats J. Stensrud, and Kjetil Røysland. The additive hazard es- timator is consistent for continuous-time marginal structural models.Lifetime Data Analysis, 25(4):611–638, February 2019

  49. [49]

    Rytgaard, Thomas A

    Helene C. Rytgaard, Thomas A. Gerds, and Mark J. van der Laan. Continuous- time targeted minimum loss-based estimation of intervention-specific mean outcomes.The Annals of Statistics, 50(5), October 2022

  50. [50]

    A martingale approach to continuous-time marginal struc- tural models.Bernoulli, 2011

    Kjetil Røysland. A martingale approach to continuous-time marginal struc- tural models.Bernoulli, 2011

  51. [51]

    Counterfactual analyses with graphical models based on local independence.The Annals of Statistics, 40(4):2162–2194, 2012

    Kjetil Røysland. Counterfactual analyses with graphical models based on local independence.The Annals of Statistics, 40(4):2162–2194, 2012

  52. [52]

    Graphical criteria for the identification of marginal causal effects in continuous-time sur- vival and event-history analyses, 2022

    Kjetil Røysland, P˚ al Ryalen, Mari Nyg˚ ard, and Vanessa Didelez. Graphical criteria for the identification of marginal causal effects in continuous-time sur- vival and event-history analyses, 2022

  53. [53]

    The natural value of treatment and its importance for causal inference.Annual Review of Statistics and Its Applica- tion, 13, 2025

    Aaron L Sarvet and Mats J Stensrud. The natural value of treatment and its importance for causal inference.Annual Review of Statistics and Its Applica- tion, 13, 2025

  54. [54]

    Satten, Somnath Datta, and James Robins

    Glen A. Satten, Somnath Datta, and James Robins. Estimating the marginal survival function in the presence of time dependent covariates.Statistics & Probability Letters, 54(4):397–403, October 2001. 36

  55. [55]

    Scheike and Torben Martinussen.Dynamic Regression Models for Survival Data

    Thomas H. Scheike and Torben Martinussen.Dynamic Regression Models for Survival Data. Springer, NY, 2006

  56. [56]

    Shiryaev and Jan Kallsen

    Albert N. Shiryaev and Jan Kallsen. The cumulant process and Esscher’s change of measure.Finance and Stochastics, 6(4):397–428, 2002

  57. [57]

    Exponential martingales and changes of measure for counting processes.Stochastic Analysis and Appli- cations, 33(5):823–843, August 2015

    Alexander Sokol and Niels Richard Hansen. Exponential martingales and changes of measure for counting processes.Stochastic Analysis and Appli- cations, 33(5):823–843, August 2015

  58. [58]

    Crawford

    Jinghao Sun and Forrest W. Crawford. Causal identification for continuous- time stochastic processes, 2022

  59. [59]

    A. W. van der Vaart.Asymptotic Statistics. Cambridge Series in Statistical and Probabilistic Mathematics. Cambridge University Press, 1998

  60. [60]

    van der Vaart

    Aad W. van der Vaart. On robins’ formula.Statistics & Decisions. International Mathematical Journal for Stochastic Methods and Models, 22(3):171–200, 2004

  61. [61]

    Cambridge University Press, February 1991

    David Williams.Probability with Martingales. Cambridge University Press, February 1991

  62. [62]

    Semiparametric estimation of structural nested mean models with irregularly spaced longitudinal observations.Biometrics, 78(3):937–949, April 2021

    Shu Yang. Semiparametric estimation of structural nested mean models with irregularly spaced longitudinal observations.Biometrics, 78(3):937–949, April 2021

  63. [63]

    Causality for complex continuous-time functional longitudinal studies with dynamic treatment regimes, 2024

    Andrew Ying. Causality for complex continuous-time functional longitudinal studies with dynamic treatment regimes, 2024

  64. [64]

    Causality for functional longitudinal data

    Andrew Ying. Causality for functional longitudinal data. In Francesco Lo- catello and Vanessa Didelez, editors,Proceedings of the Third Conference on Causal Learning and Reasoning, volume 236 ofProceedings of Machine Learn- ing Research, page 665–687. PMLR, April 1–3 2024

  65. [65]

    Jessica G. Young. Story-led causal inference.Epidemiology, 35(3):289–294, April 2024

  66. [66]

    Jessica G Young, Miguel A Hern´ an, and James M Robins. Identification, estimation and approximation of risk under interventions that depend on the natural value of treatment using observational data.Epidemiologic Methods, 3(1):1–19, Jan 2014

  67. [67]

    van der Laan

    Zhengguo Yu and Mark J. van der Laan. Construction of counterfactuals and the g-computation formula. department of biostatistics.University of California, 2002

  68. [68]

    Joffe, and Dylan S

    Mingyuan Zhang, Marshall M. Joffe, and Dylan S. Small. Causal inference for continuous-time processes when covariates are observed only at discrete times. The Annals of Statistics, 39(1), February 2011. Appendix Overview Appendix A reviews concepts from stochastic process theory and describes the canonical space of point process realizations. Appendix B p...

  69. [69]

    = ( ˜T ′ 1, ˜X ′ 1).(66) Suppose then thatτ ′J > T ′ 1, and note thatC 1 = Ω′ by Algorithm 1. There are two cases to check: •Case I: The first observed event is a treatment event (of some typej∈J), and the regime is followed.By Algorithm 1, the first observed event is a treatment event if, for somej∈J,T j 1 =∧ h∈J T h 1 ∧T \J 1 . The regime is followed wh...

  70. [70]

    by [33, II 8 Theorem 37]), Wt = I(τ a > t) P0<s≤t(1−Λ as)

    In particular, it is a well-defined semimartingale, and (11) is well-defined, with the solution given in Proposition 2, (e.g. by [33, II 8 Theorem 37]), Wt = I(τ a > t) P0<s≤t(1−Λ as) . This is also a local martingale, as highlighted in Lemma 7. Since the only contri- bution to the product are at the times{θ k}, which are discontinuity points of Λ a, we g...

  71. [71]

    Using these properties, the proof of (42) follow the lines of the proof of Theorem 1

    Specifically, under the conditions (36), and (i), (ii) and (iii) of Theorem 3, the processE(K J) satisfies properties parallel to those established forW=E(K) in Lemma 7 and Lemma 8, withK J,N J,Λ J, andτ J playing the roles ofK a,N a,Λ a, andτ a, respectively. Using these properties, the proof of (42) follow the lines of the proof of Theorem 1. As in Lemm...

  72. [72]

    □ AppendixF.Notation General mathematical notation

    The derivation of the distributional representation on the right-hand side of (43) follows step for step the proof of Lemma 1 and is therefore omitted. □ AppendixF.Notation General mathematical notation. (Ω,F, P) The underlying abstract probability space wherePis a fixed, but arbitrary, probability measure x∧y,x∨yThe minimum ofxandy, and the maximum ofxan...