Partial identification of principal causal effects under violations of principal ignorability
Pith reviewed 2026-05-23 07:36 UTC · model grok-4.3
The pith
Even with known principal strata distributions and correctly specified parametric outcome models, principal causal effects remain only partially identified without the principal ignorability assumption.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Even if the joint distribution of principal strata is known, the strategy of using parametric models to jointly model the outcome and principal strata without requiring the principal ignorability assumption necessarily leads to only partial identification of causal effects, even under very simple and correctly specified outcome models. While principal ignorability leads to point identification in this setting, alternative weaker assumptions can lead to informative partial identification regions. Association parameters that govern the joint distribution of principal strata are identifiable only if the principal ignorability assumption is violated, and only under strong parametric constraints.
What carries the argument
Principal stratification framework classifying units into strata by potential post-treatment variables, with principal ignorability as the conditional independence assumption between strata membership and outcomes given covariates, and parametric joint models for the outcome and strata.
If this is right
- Parametric joint models without principal ignorability produce partial identification regions rather than point estimates for principal causal effects.
- Weaker assumptions beyond principal ignorability can still yield informative bounds on the causal effects.
- Association parameters for the joint distribution of principal strata become identifiable only when principal ignorability is violated.
- Due to the partial identifiability of the causal effects, the association parameters themselves require strong parametric constraints to be identified.
- The partial identification result continues to hold under semiparametric and nonparametric Bayesian extensions of the joint models.
Where Pith is reading between the lines
- Analyses relying on joint parametric modeling should routinely report partial identification intervals when principal ignorability cannot be justified.
- The necessity of parametric constraints for identifying strata association parameters suggests that fully nonparametric approaches will inherit the same partial identification issue.
- These results point toward developing sensitivity analyses that vary the strength of departure from principal ignorability while tracking changes in the identification region.
- In practice, the findings imply that researchers may need to collect richer covariate information or auxiliary data to tighten the partial identification bounds.
Load-bearing premise
The parametric forms chosen for the joint outcome-strata model are correctly specified and impose no additional identifying restrictions beyond that structure.
What would settle it
A finite-sample or asymptotic calculation in which the width of the partial identification region for a principal causal effect fails to collapse to a single point as the sample size grows, even when the strata distribution is treated as known and the outcome model is correctly specified.
read the original abstract
Principal stratification is a general framework for studying causal mechanisms involving post-treatment variables. When estimating principal causal effects, the principal ignorability assumption is commonly invoked, which we study in detail in this manuscript. Our first key contribution is studying a commonly used strategy of using parametric models to jointly model the outcome and principal strata without requiring the principal ignorability assumption. We show that even if the joint distribution of principal strata is known, this strategy necessarily leads to only partial identification of causal effects, even under very simple and correctly specified outcome models. While principal ignorability leads to point identification in this setting, we discuss alternative, weaker assumptions and show how they can lead to informative partial identification regions. An additional contribution is that we provide theoretical support to strategies used in the literature for identifying association parameters that govern the joint distribution of principal strata. We prove that this is possible, but only if the principal ignorability assumption is violated. Additionally, due to partial identifiability of causal effects even when these association parameters are known, we show that these association parameters are only identifiable under strong parametric constraints. Lastly, we extend these results to more flexible semiparametric and nonparametric Bayesian models.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript examines violations of the principal ignorability (PI) assumption in principal stratification. It demonstrates that jointly modeling outcomes and principal strata via parametric models yields only partial identification of principal causal effects, even when the strata distribution is known and the outcome models are correctly specified and simple. PI itself produces point identification in the same setting. The paper explores weaker identifying assumptions that can produce informative partial-identification regions, supplies theoretical justification for estimating association parameters that govern the joint strata distribution (possible only when PI is violated), shows that such parameters remain only partially identifiable without strong parametric restrictions, and extends the analysis to semiparametric and nonparametric Bayesian models.
Significance. If the derivations are correct, the paper supplies a clear theoretical explanation for why common joint-modeling strategies in principal stratification cannot deliver point identification without PI, even under favorable conditions. It also supplies formal support for practices already used in the literature for recovering strata-association parameters and delineates the additional parametric constraints required. The extension to flexible models broadens the practical relevance of the partial-identification results.
major comments (2)
- [§3] The central claim that partial identification persists even when the joint distribution of principal strata is treated as known rests on the mixture structure induced by the latent strata; the manuscript should explicitly display the relevant likelihood or moment equations (likely in §3 or §4) that demonstrate why the causal-effect parameters remain unidentified without additional restrictions.
- [§5] The result that association parameters for the strata distribution are identifiable only under violation of PI and only under strong parametric constraints is load-bearing for the later semiparametric extension; the proof should be checked for any hidden identifying restrictions that might inadvertently restore point identification.
minor comments (2)
- Notation for the principal strata and the association parameters should be introduced once and used consistently; currently the abstract and later sections appear to employ slightly different symbols for the same quantities.
- The manuscript would benefit from a small simulation study (even a simple binary-outcome example) that numerically illustrates the width of the partial-identification intervals under the parametric models discussed in §3.
Simulated Author's Rebuttal
We thank the referee for their positive assessment and recommendation of minor revision. We address each major comment below and will incorporate clarifications to improve the manuscript.
read point-by-point responses
-
Referee: [§3] The central claim that partial identification persists even when the joint distribution of principal strata is treated as known rests on the mixture structure induced by the latent strata; the manuscript should explicitly display the relevant likelihood or moment equations (likely in §3 or §4) that demonstrate why the causal-effect parameters remain unidentified without additional restrictions.
Authors: We agree that explicitly presenting the likelihood or moment equations would clarify the mixture structure and the source of partial identification. In the revised manuscript, we will add these equations in Section 3, showing how the observed-data likelihood is a mixture over the latent strata even when the strata distribution is known, and why the principal causal effect parameters remain unidentified without further restrictions. revision: yes
-
Referee: [§5] The result that association parameters for the strata distribution are identifiable only under violation of PI and only under strong parametric constraints is load-bearing for the later semiparametric extension; the proof should be checked for any hidden identifying restrictions that might inadvertently restore point identification.
Authors: We have re-examined the proof in Section 5. The identification of the association parameters requires both the violation of principal ignorability and the strong parametric constraints on the outcome models; the derivation contains no hidden restrictions that would restore point identification. In the revision, we will add a short clarifying remark after the proof to explicitly state that the result relies only on the assumptions listed in the section. revision: partial
Circularity Check
No significant circularity identified
full rationale
The paper's central results establish partial identification of principal causal effects from the mixture structure of latent strata under correctly specified parametric joint models, even when the strata distribution is treated as known. This follows directly from the identification framework and mixture representation without reducing target quantities to fitted parameters by construction or relying on load-bearing self-citations. The derivations for weaker assumptions yielding informative bounds and the conditions under which association parameters are identifiable are presented as consequences of the model structure itself, with no evidence that any key step equates to its inputs via self-definition, renaming, or imported uniqueness theorems. The work is self-contained against external benchmarks of mixture identifiability.
Axiom & Free-Parameter Ledger
free parameters (1)
- parameters of parametric outcome and strata models
axioms (1)
- standard math Standard axioms of probability and the potential-outcomes framework for principal stratification
Reference graph
Works this paper leans on
-
[1]
barticle [author] Angrist , Joshua D. J. D. , Imbens , Guido W. G. W. Rubin , Donald B. D. B. ( 1996 ). Identification of Causal Effects Using Instrumental Variables . Journal of the American Statistical Association 91 444–455 . https://doi.org/10.2307/2291629 barticle
-
[2]
barticle [author] Antonelli , Joseph J. , Wu , Minxuan M. , Mealli , Fabrizia F. , Beck , Brenden B. Mattei , Alessandra A. ( 2023 ). Principal stratification with continuous treatments and continuous post-treatment variables . arXiv preprint arXiv:2309.14486 . barticle
-
[3]
barticle [author] Baccini , Michela M. , Mattei , Alessandra A. Mealli , Fabrizia F. ( 2017 ). Bayesian inference for causal mechanisms with application to a randomized study for postoperative pain control . Biostatistics 18 605–617 . https://doi.org/10.1093/biostatistics/kxx010 barticle
-
[4]
barticle [author] Bartolucci , Francesco F. Grilli , Leonardo L. ( 2011 ). Modeling Partial Compliance Through Copulas in a Principal Stratification Framework . Journal of the American Statistical Association 106 469–479 . https://doi.org/10.1198/jasa.2011.ap09094 barticle
-
[5]
barticle [author] Bia , M. M. , Mattei , A. A. Mercatanti , A. A. ( 2022 ). Assessing Causal Effects in a longitudinal observational study with “truncated” outcomes due to unemployment and nonignorable missing data . Journal of Business & Economic Statistics 40 718--729 . barticle
work page 2022
-
[6]
bbook [author] Burzykowski , Tomasz T. , Molenberghs , Geert G. Buyse , Marc E M. E. ( 2005 ). The Evaluation of Surrogate Endpoints . Springer Nature . https://doi.org/10.1007/b138566 bbook
-
[7]
barticle [author] Cheng , J J. Small , D S D. S. ( 2006 ). Bounds on causal effects in three-arm trials with non-compliance . Journal of the Royal Statistical Society: Series B (Statistical Methodology) 68 815--836 . barticle
work page 2006
-
[8]
barticle [author] Chipman , Hugh A. H. A. , George , Edward I. E. I. McCulloch , Robert E. R. E. ( 2010 ). BART: Bayesian additive regression trees . The Annals of Applied Statistics 4 266–298 . https://doi.org/10.1214/09-aoas285 barticle
-
[9]
barticle [author] Ding , Peng P. , Geng , Zhi Z. , Yan , Wei W. Zhou , Xiao-Hua X.-H. ( 2011 ). Identifiability and estimation of causal effects by principal stratification with outcomes truncated by death . Journal of the American Statistical Association 106 1578--1591 . barticle
work page 2011
-
[10]
barticle [author] Ding , Peng P. Lu , Jiannan J. ( 2016 ). Principal stratification analysis using principal scores . Journal of the Royal Statistical Society: Series B (Statistical Methodology) 79 757–777 . https://doi.org/10.1111/rssb.12191 barticle
-
[11]
barticle [author] Efron , B B. Feldman , D D. ( 1991 ). Compliance as an Explanatory Variable in Clinical Trials . Journal of the American Statistical Association 86 9–17 . https://doi.org/10.1080/01621459.1991.10474996 barticle
-
[12]
barticle [author] Frangakis , Constantine E. C. E. Rubin , Donald B. D. B. ( 2002 ). Principal Stratification in Causal Inference . Biometrics 58 21–29 . https://doi.org/10.1111/j.0006-341x.2002.00021.x barticle
-
[13]
barticle [author] Gustafson , Paul P. ( 2010 ). Bayesian Inference for Partially Identified Models . The International Journal of Biostatistics 6 . https://doi.org/10.2202/1557-4679.1206 barticle
-
[14]
barticle [author] Hahn , P. Richard P. R. , Murray , Jared S. J. S. Carvalho , Carlos M. C. M. ( 2020 ). Bayesian Regression Tree Models for Causal Inference: Regularization, Confounding, and Heterogeneous Effects . Bayesian Analysis . https://doi.org/10.1214/19-ba1195 barticle
-
[15]
barticle [author] Hammer , Scott M. S. M. , Katzenstein , David A. D. A. , Hughes , Michael D. M. D. , Gundacker , Holly H. , Schooley , Robert T. R. T. , Haubrich , Richard H. R. H. , Henry , W. Keith W. K. , Lederman , Michael M. M. M. , Phair , John P. J. P. , Niu , Manette M. , Hirsch , Martin S. M. S. Merigan , Thomas C. T. C. ( 1996 ). A Trial Compa...
- [16]
-
[17]
barticle [author] Hill , Jennifer L. J. L. ( 2011 ). Bayesian Nonparametric Modeling for Causal Inference . Journal of Computational and Graphical Statistics 20 217–240 . https://doi.org/10.1198/jcgs.2010.08162 barticle
-
[18]
barticle [author] Hirano , Keisuke K. , Imbens , Guido W. G. W. , Rubin , Donald B. D. B. Zhou , Xiao-Hua X.-H. ( 2000 ). Assessing the effect of an influenza vaccine in an encouragement design . Biostatistics 1 69–88 . https://doi.org/10.1093/biostatistics/1.1.69 barticle
-
[19]
barticle [author] Imai , Kosuke K. ( 2008 ). Sharp bounds on the causal effects in randomized experiments with “truncation-by-death” . Statistics & probability letters 78 144--149 . barticle
work page 2008
-
[20]
barticle [author] Imbens , Guido W. G. W. Rubin , Donald B. D. B. ( 1997 ). Bayesian inference for causal effects in randomized experiments with noncompliance . The Annals of Statistics 25 305–327 . https://doi.org/10.1214/aos/1034276631 barticle
-
[21]
barticle [author] Jiang , Zhichao Z. Ding , Peng P. ( 2021 ). Identification of causal effects within principal strata using auxiliary variables . Statistical Science 36 493--508 . barticle
work page 2021
-
[22]
barticle [author] Jin , Hui H. Rubin , Donald B D. B. ( 2008 ). Principal Stratification for Causal Inference With Extended Partial Compliance . Journal of the American Statistical Association 103 101–111 . https://doi.org/10.1198/016214507000000347 barticle
-
[23]
barticle [author] Jo , Booil B. Stuart , Elizabeth A. E. A. ( 2009 ). On the use of propensity scores in principal causal effect estimation . Statistics in Medicine 28 2857–2875 . https://doi.org/10.1002/sim.3669 barticle
-
[24]
barticle [author] Kim , Chanmin C. , Daniels , Michael J M. J. , Hogan , Joseph W J. W. , Choirat , Christine C. Zigler , Corwin M C. M. ( 2019 ). Bayesian methods for multiple mediators: Relating principal stratification and causal mediation in the analysis of power plant emission controls . The annals of applied statistics 13 1927 . barticle
work page 2019
-
[25]
barticle [author] Kim , Chanmin C. Zigler , Corwin C. ( 2024 ). Bayesian Nonparametric Trees for Principal Causal Effects . arXiv preprint arXiv:2403.13256 . barticle
-
[26]
barticle [author] Lee , David S D. S. ( 2009 ). Training, wages, and sample selection: Estimating sharp bounds on treatment effects . The Review of Economic Studies 76 1071--1102 . barticle
work page 2009
-
[27]
barticle [author] Linero , Antonio R A. R. Antonelli , Joseph L J. L. ( 2023 ). The how and why of Bayesian nonparametric causal inference . Wiley Interdisciplinary Reviews: Computational Statistics 15 e1583 . barticle
work page 2023
-
[28]
barticle [author] Lu , Sizhu S. , Jiang , Zhichao Z. Ding , Peng P. ( 2023 ). Principal Stratification with Continuous Post-Treatment Variables: Nonparametric Identification and Semiparametric Estimation . arXiv preprint arXiv:2309.12425 . barticle
-
[29]
bincollection [author] Mattei , A. A. , Forastiere , L. L. Mealli , F. F. ( 2023 ). Assessing Principal Causal Effects Using Principal Score Methods . In Handbook of Matching and Weighting Adjustments for Causal Inference 17 , 313--348 . Chapman and Hall/CRC . bincollection
work page 2023
-
[30]
barticle [author] Mealli , Fabrizia F. Mattei , Alessandra A. ( 2012 ). A refreshing account of principal stratification . The international journal of biostatistics 8 . barticle
work page 2012
-
[31]
barticle [author] Mealli , Fabrizia F. Pacini , Barbara B. ( 2013 ). Using secondary outcomes to sharpen inference in randomized experiments with noncompliance . Journal of the American Statistical Association 108 1120--1131 . barticle
work page 2013
-
[32]
barticle [author] Mealli , Fabrizia F. , Pacini , Barbara B. Stanghellini , Elena E. ( 2016 ). Identification of principal causal effects using additional outcomes in concentration graphs . Journal of Educational and Behavioral Statistics 41 463--480 . barticle
work page 2016
-
[33]
barticle [author] Nguyen , Trang Quynh T. Q. , Stuart , Elizabeth A E. A. , Scharfstein , Daniel O D. O. Ogburn , Elizabeth L E. L. ( 2024 ). Sensitivity analysis for principal ignorability violation in estimating complier and noncomplier average causal effects . Statistics in Medicine . https://doi.org/10.1002/sim.10153 barticle
-
[34]
barticle [author] Rubin , Donald B D. B. ( 1974 ). Estimating causal effects of treatments in randomized and nonrandomized studies. Journal of educational Psychology 66 688 . barticle
work page 1974
-
[35]
barticle [author] Rubin , Donald B. D. B. ( 1980 ). Randomization Analysis of Experimental Data: The Fisher Randomization Test Comment . Journal of the American Statistical Association 75 591 . https://doi.org/10.2307/2287653 barticle
-
[36]
barticle [author] Rubin , Donald B. D. B. ( 2006 ). The designversus the analysis of observational studies for causal effects: parallels with the design of randomized trials . Statistics in Medicine 26 20–36 . https://doi.org/10.1002/sim.2739 barticle
-
[37]
barticle [author] Schwartz , Scott L. S. L. , Li , Fan F. Mealli , Fabrizia F. ( 2011 ). A Bayesian Semiparametric Approach to Intermediate Variables in Causal Inference . Journal of the American Statistical Association 106 1331–1344 . https://doi.org/10.1198/jasa.2011.ap10425 barticle
-
[38]
barticle [author] VanderWeele , Tyler J T. J. ( 2008 ). Simple relations between principal stratification and direct and indirect effects . Statistics & Probability Letters 78 2957--2962 . barticle
work page 2008
-
[39]
barticle [author] Wang , Craig C. , Zhang , Yufen Y. , Mealli , Fabrizia F. Bornkamp , Björn B. ( 2022 ). Sensitivity analyses for the principal ignorability assumption using multiple imputation . Pharmaceutical Statistics 22 64–78 . https://doi.org/10.1002/pst.2260 barticle
-
[40]
barticle [author] Yang , Fan F. Small , Dylan S D. S. ( 2016 ). Using post-outcome measurement information in censoring-by-death problems . Journal of the Royal Statistical Society Series B: Statistical Methodology 78 299--318 . barticle
work page 2016
-
[41]
barticle [author] Zhang , Junni L J. L. Rubin , Donald B D. B. ( 2003 ). Estimation of causal effects via principal stratification when some outcomes are truncated by “death” . Journal of Educational and Behavioral Statistics 28 353--368 . barticle
work page 2003
-
[42]
barticle [author] Zhang , Yichi Y. Yang , Shu S. ( 2025 ). Semiparametric localized principal stratification analysis with continuous strata . Journal of the Royal Statistical Society Series B (Statistical Methodology) . https://doi.org/10.1093/jrsssb/qkaf034 barticle
-
[43]
barticle [author] Zigler , Corwin M C. M. Belin , Thomas R T. R. ( 2012 ). A Bayesian approach to improved estimation of causal effect predictiveness for a principal surrogate endpoint . Biometrics 68 922--932 . barticle
work page 2012
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.