pith. sign in

arxiv: 2412.06628 · v4 · submitted 2024-12-09 · 📊 stat.ME · math.ST· stat.TH

Partial identification of principal causal effects under violations of principal ignorability

Pith reviewed 2026-05-23 07:36 UTC · model grok-4.3

classification 📊 stat.ME math.STstat.TH
keywords principal stratificationprincipal ignorabilitypartial identificationcausal effectsprincipal strataparametric modelsassociation parameterscausal inference
0
0 comments X

The pith

Even with known principal strata distributions and correctly specified parametric outcome models, principal causal effects remain only partially identified without the principal ignorability assumption.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper examines the principal ignorability assumption within principal stratification for causal effects involving post-treatment variables. It demonstrates that a common strategy of jointly modeling the outcome and principal strata parametrically, without principal ignorability, produces only partial identification of the causal effects. This partial identification persists even when the joint distribution of principal strata is known and the outcome models are simple and correctly specified. Principal ignorability instead delivers point identification, while weaker assumptions can generate informative partial identification regions. The authors further establish that association parameters for the strata distribution are identifiable only under violations of principal ignorability and strong parametric constraints, with extensions to semiparametric and nonparametric Bayesian models.

Core claim

Even if the joint distribution of principal strata is known, the strategy of using parametric models to jointly model the outcome and principal strata without requiring the principal ignorability assumption necessarily leads to only partial identification of causal effects, even under very simple and correctly specified outcome models. While principal ignorability leads to point identification in this setting, alternative weaker assumptions can lead to informative partial identification regions. Association parameters that govern the joint distribution of principal strata are identifiable only if the principal ignorability assumption is violated, and only under strong parametric constraints.

What carries the argument

Principal stratification framework classifying units into strata by potential post-treatment variables, with principal ignorability as the conditional independence assumption between strata membership and outcomes given covariates, and parametric joint models for the outcome and strata.

If this is right

  • Parametric joint models without principal ignorability produce partial identification regions rather than point estimates for principal causal effects.
  • Weaker assumptions beyond principal ignorability can still yield informative bounds on the causal effects.
  • Association parameters for the joint distribution of principal strata become identifiable only when principal ignorability is violated.
  • Due to the partial identifiability of the causal effects, the association parameters themselves require strong parametric constraints to be identified.
  • The partial identification result continues to hold under semiparametric and nonparametric Bayesian extensions of the joint models.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • Analyses relying on joint parametric modeling should routinely report partial identification intervals when principal ignorability cannot be justified.
  • The necessity of parametric constraints for identifying strata association parameters suggests that fully nonparametric approaches will inherit the same partial identification issue.
  • These results point toward developing sensitivity analyses that vary the strength of departure from principal ignorability while tracking changes in the identification region.
  • In practice, the findings imply that researchers may need to collect richer covariate information or auxiliary data to tighten the partial identification bounds.

Load-bearing premise

The parametric forms chosen for the joint outcome-strata model are correctly specified and impose no additional identifying restrictions beyond that structure.

What would settle it

A finite-sample or asymptotic calculation in which the width of the partial identification region for a principal causal effect fails to collapse to a single point as the sample size grows, even when the strata distribution is treated as known and the outcome model is correctly specified.

read the original abstract

Principal stratification is a general framework for studying causal mechanisms involving post-treatment variables. When estimating principal causal effects, the principal ignorability assumption is commonly invoked, which we study in detail in this manuscript. Our first key contribution is studying a commonly used strategy of using parametric models to jointly model the outcome and principal strata without requiring the principal ignorability assumption. We show that even if the joint distribution of principal strata is known, this strategy necessarily leads to only partial identification of causal effects, even under very simple and correctly specified outcome models. While principal ignorability leads to point identification in this setting, we discuss alternative, weaker assumptions and show how they can lead to informative partial identification regions. An additional contribution is that we provide theoretical support to strategies used in the literature for identifying association parameters that govern the joint distribution of principal strata. We prove that this is possible, but only if the principal ignorability assumption is violated. Additionally, due to partial identifiability of causal effects even when these association parameters are known, we show that these association parameters are only identifiable under strong parametric constraints. Lastly, we extend these results to more flexible semiparametric and nonparametric Bayesian models.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The manuscript examines violations of the principal ignorability (PI) assumption in principal stratification. It demonstrates that jointly modeling outcomes and principal strata via parametric models yields only partial identification of principal causal effects, even when the strata distribution is known and the outcome models are correctly specified and simple. PI itself produces point identification in the same setting. The paper explores weaker identifying assumptions that can produce informative partial-identification regions, supplies theoretical justification for estimating association parameters that govern the joint strata distribution (possible only when PI is violated), shows that such parameters remain only partially identifiable without strong parametric restrictions, and extends the analysis to semiparametric and nonparametric Bayesian models.

Significance. If the derivations are correct, the paper supplies a clear theoretical explanation for why common joint-modeling strategies in principal stratification cannot deliver point identification without PI, even under favorable conditions. It also supplies formal support for practices already used in the literature for recovering strata-association parameters and delineates the additional parametric constraints required. The extension to flexible models broadens the practical relevance of the partial-identification results.

major comments (2)
  1. [§3] The central claim that partial identification persists even when the joint distribution of principal strata is treated as known rests on the mixture structure induced by the latent strata; the manuscript should explicitly display the relevant likelihood or moment equations (likely in §3 or §4) that demonstrate why the causal-effect parameters remain unidentified without additional restrictions.
  2. [§5] The result that association parameters for the strata distribution are identifiable only under violation of PI and only under strong parametric constraints is load-bearing for the later semiparametric extension; the proof should be checked for any hidden identifying restrictions that might inadvertently restore point identification.
minor comments (2)
  1. Notation for the principal strata and the association parameters should be introduced once and used consistently; currently the abstract and later sections appear to employ slightly different symbols for the same quantities.
  2. The manuscript would benefit from a small simulation study (even a simple binary-outcome example) that numerically illustrates the width of the partial-identification intervals under the parametric models discussed in §3.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their positive assessment and recommendation of minor revision. We address each major comment below and will incorporate clarifications to improve the manuscript.

read point-by-point responses
  1. Referee: [§3] The central claim that partial identification persists even when the joint distribution of principal strata is treated as known rests on the mixture structure induced by the latent strata; the manuscript should explicitly display the relevant likelihood or moment equations (likely in §3 or §4) that demonstrate why the causal-effect parameters remain unidentified without additional restrictions.

    Authors: We agree that explicitly presenting the likelihood or moment equations would clarify the mixture structure and the source of partial identification. In the revised manuscript, we will add these equations in Section 3, showing how the observed-data likelihood is a mixture over the latent strata even when the strata distribution is known, and why the principal causal effect parameters remain unidentified without further restrictions. revision: yes

  2. Referee: [§5] The result that association parameters for the strata distribution are identifiable only under violation of PI and only under strong parametric constraints is load-bearing for the later semiparametric extension; the proof should be checked for any hidden identifying restrictions that might inadvertently restore point identification.

    Authors: We have re-examined the proof in Section 5. The identification of the association parameters requires both the violation of principal ignorability and the strong parametric constraints on the outcome models; the derivation contains no hidden restrictions that would restore point identification. In the revision, we will add a short clarifying remark after the proof to explicitly state that the result relies only on the assumptions listed in the section. revision: partial

Circularity Check

0 steps flagged

No significant circularity identified

full rationale

The paper's central results establish partial identification of principal causal effects from the mixture structure of latent strata under correctly specified parametric joint models, even when the strata distribution is treated as known. This follows directly from the identification framework and mixture representation without reducing target quantities to fitted parameters by construction or relying on load-bearing self-citations. The derivations for weaker assumptions yielding informative bounds and the conditions under which association parameters are identifiable are presented as consequences of the model structure itself, with no evidence that any key step equates to its inputs via self-definition, renaming, or imported uniqueness theorems. The work is self-contained against external benchmarks of mixture identifiability.

Axiom & Free-Parameter Ledger

1 free parameters · 1 axioms · 0 invented entities

The work rests on the standard principal stratification framework and parametric modeling assumptions; no new entities are introduced and free parameters are the usual model coefficients estimated from data.

free parameters (1)
  • parameters of parametric outcome and strata models
    Coefficients in the joint parametric distributions are estimated from observed data and enter the identification analysis.
axioms (1)
  • standard math Standard axioms of probability and the potential-outcomes framework for principal stratification
    The entire analysis is conducted inside the principal stratification model that assumes well-defined potential outcomes and strata membership.

pith-pipeline@v0.9.0 · 5735 in / 1277 out tokens · 25956 ms · 2026-05-23T07:36:38.754984+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

43 extracted references · 43 canonical work pages

  1. [1]

    barticle [author] Angrist , Joshua D. J. D. , Imbens , Guido W. G. W. Rubin , Donald B. D. B. ( 1996 ). Identification of Causal Effects Using Instrumental Variables . Journal of the American Statistical Association 91 444–455 . https://doi.org/10.2307/2291629 barticle

  2. [2]

    , Wu , Minxuan M

    barticle [author] Antonelli , Joseph J. , Wu , Minxuan M. , Mealli , Fabrizia F. , Beck , Brenden B. Mattei , Alessandra A. ( 2023 ). Principal stratification with continuous treatments and continuous post-treatment variables . arXiv preprint arXiv:2309.14486 . barticle

  3. [3]

    , Mattei , Alessandra A

    barticle [author] Baccini , Michela M. , Mattei , Alessandra A. Mealli , Fabrizia F. ( 2017 ). Bayesian inference for causal mechanisms with application to a randomized study for postoperative pain control . Biostatistics 18 605–617 . https://doi.org/10.1093/biostatistics/kxx010 barticle

  4. [4]

    Grilli , Leonardo L

    barticle [author] Bartolucci , Francesco F. Grilli , Leonardo L. ( 2011 ). Modeling Partial Compliance Through Copulas in a Principal Stratification Framework . Journal of the American Statistical Association 106 469–479 . https://doi.org/10.1198/jasa.2011.ap09094 barticle

  5. [5]

    truncated

    barticle [author] Bia , M. M. , Mattei , A. A. Mercatanti , A. A. ( 2022 ). Assessing Causal Effects in a longitudinal observational study with “truncated” outcomes due to unemployment and nonignorable missing data . Journal of Business & Economic Statistics 40 718--729 . barticle

  6. [6]

    , Molenberghs , Geert G

    bbook [author] Burzykowski , Tomasz T. , Molenberghs , Geert G. Buyse , Marc E M. E. ( 2005 ). The Evaluation of Surrogate Endpoints . Springer Nature . https://doi.org/10.1007/b138566 bbook

  7. [7]

    Small , D S D

    barticle [author] Cheng , J J. Small , D S D. S. ( 2006 ). Bounds on causal effects in three-arm trials with non-compliance . Journal of the Royal Statistical Society: Series B (Statistical Methodology) 68 815--836 . barticle

  8. [8]

    barticle [author] Chipman , Hugh A. H. A. , George , Edward I. E. I. McCulloch , Robert E. R. E. ( 2010 ). BART: Bayesian additive regression trees . The Annals of Applied Statistics 4 266–298 . https://doi.org/10.1214/09-aoas285 barticle

  9. [9]

    , Geng , Zhi Z

    barticle [author] Ding , Peng P. , Geng , Zhi Z. , Yan , Wei W. Zhou , Xiao-Hua X.-H. ( 2011 ). Identifiability and estimation of causal effects by principal stratification with outcomes truncated by death . Journal of the American Statistical Association 106 1578--1591 . barticle

  10. [10]

    Lu , Jiannan J

    barticle [author] Ding , Peng P. Lu , Jiannan J. ( 2016 ). Principal stratification analysis using principal scores . Journal of the Royal Statistical Society: Series B (Statistical Methodology) 79 757–777 . https://doi.org/10.1111/rssb.12191 barticle

  11. [11]

    Feldman , D D

    barticle [author] Efron , B B. Feldman , D D. ( 1991 ). Compliance as an Explanatory Variable in Clinical Trials . Journal of the American Statistical Association 86 9–17 . https://doi.org/10.1080/01621459.1991.10474996 barticle

  12. [12]

    barticle [author] Frangakis , Constantine E. C. E. Rubin , Donald B. D. B. ( 2002 ). Principal Stratification in Causal Inference . Biometrics 58 21–29 . https://doi.org/10.1111/j.0006-341x.2002.00021.x barticle

  13. [13]

    ( 2010 )

    barticle [author] Gustafson , Paul P. ( 2010 ). Bayesian Inference for Partially Identified Models . The International Journal of Biostatistics 6 . https://doi.org/10.2202/1557-4679.1206 barticle

  14. [14]

    Richard P

    barticle [author] Hahn , P. Richard P. R. , Murray , Jared S. J. S. Carvalho , Carlos M. C. M. ( 2020 ). Bayesian Regression Tree Models for Causal Inference: Regularization, Confounding, and Heterogeneous Effects . Bayesian Analysis . https://doi.org/10.1214/19-ba1195 barticle

  15. [15]

    barticle [author] Hammer , Scott M. S. M. , Katzenstein , David A. D. A. , Hughes , Michael D. M. D. , Gundacker , Holly H. , Schooley , Robert T. R. T. , Haubrich , Richard H. R. H. , Henry , W. Keith W. K. , Lederman , Michael M. M. M. , Phair , John P. J. P. , Niu , Manette M. , Hirsch , Martin S. M. S. Merigan , Thomas C. T. C. ( 1996 ). A Trial Compa...

  16. [16]

    ( 2011 )

    bbook [author] Hayashi , Fumio F. ( 2011 ). Econometrics . Princeton University Press . bbook

  17. [17]

    barticle [author] Hill , Jennifer L. J. L. ( 2011 ). Bayesian Nonparametric Modeling for Causal Inference . Journal of Computational and Graphical Statistics 20 217–240 . https://doi.org/10.1198/jcgs.2010.08162 barticle

  18. [18]

    , Imbens , Guido W

    barticle [author] Hirano , Keisuke K. , Imbens , Guido W. G. W. , Rubin , Donald B. D. B. Zhou , Xiao-Hua X.-H. ( 2000 ). Assessing the effect of an influenza vaccine in an encouragement design . Biostatistics 1 69–88 . https://doi.org/10.1093/biostatistics/1.1.69 barticle

  19. [19]

    truncation-by-death

    barticle [author] Imai , Kosuke K. ( 2008 ). Sharp bounds on the causal effects in randomized experiments with “truncation-by-death” . Statistics & probability letters 78 144--149 . barticle

  20. [20]

    barticle [author] Imbens , Guido W. G. W. Rubin , Donald B. D. B. ( 1997 ). Bayesian inference for causal effects in randomized experiments with noncompliance . The Annals of Statistics 25 305–327 . https://doi.org/10.1214/aos/1034276631 barticle

  21. [21]

    Ding , Peng P

    barticle [author] Jiang , Zhichao Z. Ding , Peng P. ( 2021 ). Identification of causal effects within principal strata using auxiliary variables . Statistical Science 36 493--508 . barticle

  22. [22]

    Rubin , Donald B D

    barticle [author] Jin , Hui H. Rubin , Donald B D. B. ( 2008 ). Principal Stratification for Causal Inference With Extended Partial Compliance . Journal of the American Statistical Association 103 101–111 . https://doi.org/10.1198/016214507000000347 barticle

  23. [23]

    Stuart , Elizabeth A

    barticle [author] Jo , Booil B. Stuart , Elizabeth A. E. A. ( 2009 ). On the use of propensity scores in principal causal effect estimation . Statistics in Medicine 28 2857–2875 . https://doi.org/10.1002/sim.3669 barticle

  24. [24]

    , Daniels , Michael J M

    barticle [author] Kim , Chanmin C. , Daniels , Michael J M. J. , Hogan , Joseph W J. W. , Choirat , Christine C. Zigler , Corwin M C. M. ( 2019 ). Bayesian methods for multiple mediators: Relating principal stratification and causal mediation in the analysis of power plant emission controls . The annals of applied statistics 13 1927 . barticle

  25. [25]

    Zigler , Corwin C

    barticle [author] Kim , Chanmin C. Zigler , Corwin C. ( 2024 ). Bayesian Nonparametric Trees for Principal Causal Effects . arXiv preprint arXiv:2403.13256 . barticle

  26. [26]

    barticle [author] Lee , David S D. S. ( 2009 ). Training, wages, and sample selection: Estimating sharp bounds on treatment effects . The Review of Economic Studies 76 1071--1102 . barticle

  27. [27]

    barticle [author] Linero , Antonio R A. R. Antonelli , Joseph L J. L. ( 2023 ). The how and why of Bayesian nonparametric causal inference . Wiley Interdisciplinary Reviews: Computational Statistics 15 e1583 . barticle

  28. [28]

    , Jiang , Zhichao Z

    barticle [author] Lu , Sizhu S. , Jiang , Zhichao Z. Ding , Peng P. ( 2023 ). Principal Stratification with Continuous Post-Treatment Variables: Nonparametric Identification and Semiparametric Estimation . arXiv preprint arXiv:2309.12425 . barticle

  29. [29]

    bincollection [author] Mattei , A. A. , Forastiere , L. L. Mealli , F. F. ( 2023 ). Assessing Principal Causal Effects Using Principal Score Methods . In Handbook of Matching and Weighting Adjustments for Causal Inference 17 , 313--348 . Chapman and Hall/CRC . bincollection

  30. [30]

    Mattei , Alessandra A

    barticle [author] Mealli , Fabrizia F. Mattei , Alessandra A. ( 2012 ). A refreshing account of principal stratification . The international journal of biostatistics 8 . barticle

  31. [31]

    Pacini , Barbara B

    barticle [author] Mealli , Fabrizia F. Pacini , Barbara B. ( 2013 ). Using secondary outcomes to sharpen inference in randomized experiments with noncompliance . Journal of the American Statistical Association 108 1120--1131 . barticle

  32. [32]

    , Pacini , Barbara B

    barticle [author] Mealli , Fabrizia F. , Pacini , Barbara B. Stanghellini , Elena E. ( 2016 ). Identification of principal causal effects using additional outcomes in concentration graphs . Journal of Educational and Behavioral Statistics 41 463--480 . barticle

  33. [33]

    barticle [author] Nguyen , Trang Quynh T. Q. , Stuart , Elizabeth A E. A. , Scharfstein , Daniel O D. O. Ogburn , Elizabeth L E. L. ( 2024 ). Sensitivity analysis for principal ignorability violation in estimating complier and noncomplier average causal effects . Statistics in Medicine . https://doi.org/10.1002/sim.10153 barticle

  34. [34]

    barticle [author] Rubin , Donald B D. B. ( 1974 ). Estimating causal effects of treatments in randomized and nonrandomized studies. Journal of educational Psychology 66 688 . barticle

  35. [35]

    barticle [author] Rubin , Donald B. D. B. ( 1980 ). Randomization Analysis of Experimental Data: The Fisher Randomization Test Comment . Journal of the American Statistical Association 75 591 . https://doi.org/10.2307/2287653 barticle

  36. [36]

    barticle [author] Rubin , Donald B. D. B. ( 2006 ). The designversus the analysis of observational studies for causal effects: parallels with the design of randomized trials . Statistics in Medicine 26 20–36 . https://doi.org/10.1002/sim.2739 barticle

  37. [37]

    barticle [author] Schwartz , Scott L. S. L. , Li , Fan F. Mealli , Fabrizia F. ( 2011 ). A Bayesian Semiparametric Approach to Intermediate Variables in Causal Inference . Journal of the American Statistical Association 106 1331–1344 . https://doi.org/10.1198/jasa.2011.ap10425 barticle

  38. [38]

    barticle [author] VanderWeele , Tyler J T. J. ( 2008 ). Simple relations between principal stratification and direct and indirect effects . Statistics & Probability Letters 78 2957--2962 . barticle

  39. [39]

    , Zhang , Yufen Y

    barticle [author] Wang , Craig C. , Zhang , Yufen Y. , Mealli , Fabrizia F. Bornkamp , Björn B. ( 2022 ). Sensitivity analyses for the principal ignorability assumption using multiple imputation . Pharmaceutical Statistics 22 64–78 . https://doi.org/10.1002/pst.2260 barticle

  40. [40]

    Small , Dylan S D

    barticle [author] Yang , Fan F. Small , Dylan S D. S. ( 2016 ). Using post-outcome measurement information in censoring-by-death problems . Journal of the Royal Statistical Society Series B: Statistical Methodology 78 299--318 . barticle

  41. [41]

    barticle [author] Zhang , Junni L J. L. Rubin , Donald B D. B. ( 2003 ). Estimation of causal effects via principal stratification when some outcomes are truncated by “death” . Journal of Educational and Behavioral Statistics 28 353--368 . barticle

  42. [42]

    Yang , Shu S

    barticle [author] Zhang , Yichi Y. Yang , Shu S. ( 2025 ). Semiparametric localized principal stratification analysis with continuous strata . Journal of the Royal Statistical Society Series B (Statistical Methodology) . https://doi.org/10.1093/jrsssb/qkaf034 barticle

  43. [43]

    barticle [author] Zigler , Corwin M C. M. Belin , Thomas R T. R. ( 2012 ). A Bayesian approach to improved estimation of causal effect predictiveness for a principal surrogate endpoint . Biometrics 68 922--932 . barticle