Probabilities of Causation for Continuous Outcomes: Bounds and Identification
Pith reviewed 2026-05-09 16:38 UTC · model grok-4.3
The pith
For continuous outcomes, the probability that a treatment was necessary can be bounded sharply from above and below using only ignorability and monotonicity, and narrowed further with copula models of potential-outcome dependence.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The paper proposes the general probability of necessity (GPN) for continuous outcomes as the probability that an observed value of Y would not have been realized had the treatment not been received. Under the standard assumptions of ignorability and monotonicity, sharp lower and upper bounds on GPN are derived directly from the observed data and the joint distribution of the potential outcomes. A copula-based identification strategy is further introduced that exploits any available information on the dependence structure between Y(1) and Y(0) to obtain strictly narrower bounds.
What carries the argument
The general probability of necessity (GPN) defined on the difference between observed and counterfactual continuous outcomes, together with the sharp bounds obtained from the marginal distributions under ignorability and monotonicity, and the copula representation used to incorporate dependence between potential outcomes.
If this is right
- GPN supplies an attribution measure that applies directly to blood pressure, income, test scores, and other continuous variables.
- The bounds are the narrowest possible under the stated assumptions, so any further tightening requires either stronger assumptions or additional data.
- The copula approach lets analysts use external information or estimates of rank dependence to shrink the interval without assuming a specific functional form for the outcome distributions.
- Simulation checks confirm that the bounds contain the true GPN whenever the ignorability and monotonicity conditions are satisfied.
- Real-data examples illustrate that the method yields informative intervals in medical and economic attribution problems where binary PN is inapplicable.
Where Pith is reading between the lines
- Researchers could combine the copula tightening step with existing methods for estimating rank correlations from auxiliary data or theory to make attribution statements more precise in observational studies.
- The same bounding strategy might be adapted to define and partially identify a continuous analogue of the probability of sufficiency.
- Because the method is nonparametric except for the copula choice, it naturally lends itself to sensitivity analyses that vary the strength of assumed dependence.
- In practice the width of the reported interval will often be driven by how much dependence between potential outcomes can be credibly assumed or estimated.
Load-bearing premise
The claimed sharp bounds hold only when treatment is ignorable given the observed covariates and when the treatment cannot reverse the ordering of potential outcomes for any individual.
What would settle it
A randomized experiment with a continuous outcome in which the empirical frequency of cases where the outcome would have differed without treatment falls outside the interval given by the derived lower and upper bounds.
Figures
read the original abstract
The probability of necessity (PN), which quantifies the probability that an observed event would not have occurred in the absence of the treatment, is a central estimand in attribution analysis. While PN has been extensively studied for binary outcomes and has recently been developed for ordinal outcomes, a formal framework for continuous outcomes remains underdeveloped. To address this gap, we propose the general probability of necessity (GPN) for continuous outcomes, a setting that is substantially more challenging than the binary and ordinal cases. Rather than imposing strong identifiability assumptions, we adopt a partial identification perspective and derive sharp lower and upper bounds under standard assumptions of ignorability and monotonicity. We further introduce a copula-based framework that exploits dependence information between potential outcomes to tighten these bounds. Simulation studies and real-world applications demonstrate the effectiveness of our method.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript proposes the General Probability of Necessity (GPN) for continuous outcomes as an extension of the probability of necessity. Under standard ignorability and monotonicity assumptions, it derives sharp lower and upper bounds on the GPN. It further develops a copula-based framework that incorporates dependence information between potential outcomes to tighten the bounds. The claims are supported by simulation studies and real-data applications.
Significance. If the bounds are verifiably sharp and the copula tightening is shown to be valid under explicit, non-circular assumptions, the work would meaningfully extend partial-identification methods from binary/ordinal to continuous outcomes, an area that remains underdeveloped. The partial-identification framing is appropriate, and the simulation/application results provide useful evidence of practical performance. The explicit use of copulas to exploit dependence is a potentially valuable technical contribution if the identification of the dependence parameter is clarified.
major comments (2)
- [§3.2] §3.2, around Eq. (8)–(10): the claim that the derived bounds are sharp under ignorability and monotonicity requires an explicit attainability argument. The current construction appears to rely on the Fréchet–Hoeffding bounds for the joint distribution of potential outcomes, but it is unclear whether monotonicity alone guarantees that these bounds are attained for arbitrary continuous marginals without additional regularity conditions on the conditional distributions.
- [§4.2] §4.2, copula tightening step: the framework requires a copula family and dependence parameter to tighten the GPN bounds. Because the joint distribution of (Y(1), Y(0)) is not observed, it is not evident whether the dependence parameter is identified from the data, chosen by the analyst, or calibrated in a way that preserves the partial-identification interpretation. If the parameter is fitted to the same observed data used for the marginals, the tightened bounds may no longer be sharp in the original sense and could introduce circularity.
minor comments (3)
- [Introduction] The introduction cites binary and ordinal PN literature but omits a brief comparison of the technical challenges unique to the continuous case (e.g., lack of natural ordering for events).
- [Simulation studies] Figure 2 (simulation results) would benefit from error bars or shaded regions indicating variability across replications to allow visual assessment of the bound tightness.
- [§2] Notation for the GPN estimand is introduced in §2 but the link to the classical PN formula is not restated explicitly; a one-line reminder would aid readers.
Simulated Author's Rebuttal
We thank the referee for the thoughtful and constructive comments on our manuscript. We address the major comments point by point below, indicating where revisions will be made to strengthen the paper.
read point-by-point responses
-
Referee: [§3.2] §3.2, around Eq. (8)–(10): the claim that the derived bounds are sharp under ignorability and monotonicity requires an explicit attainability argument. The current construction appears to rely on the Fréchet–Hoeffding bounds for the joint distribution of potential outcomes, but it is unclear whether monotonicity alone guarantees that these bounds are attained for arbitrary continuous marginals without additional regularity conditions on the conditional distributions.
Authors: We acknowledge that the attainability of the bounds should be demonstrated more explicitly. In the revised version, we will include a detailed proof in an appendix showing that the lower and upper bounds are attained by specific joint distributions of the potential outcomes that satisfy the observed marginals, the ignorability assumption, and the monotonicity condition. Specifically, we construct the extremal couplings by adjusting the Fréchet-Hoeffding bounds to respect the direction of the treatment effect implied by monotonicity (i.e., Y(1) ≥ Y(0) almost surely). This construction works for arbitrary continuous marginal distributions without requiring further regularity conditions beyond the continuity of the CDFs, as the quantile transformations allow us to achieve the bounds. We believe this addresses the concern and confirms the sharpness. revision: yes
-
Referee: [§4.2] §4.2, copula tightening step: the framework requires a copula family and dependence parameter to tighten the GPN bounds. Because the joint distribution of (Y(1), Y(0)) is not observed, it is not evident whether the dependence parameter is identified from the data, chosen by the analyst, or calibrated in a way that preserves the partial-identification interpretation. If the parameter is fitted to the same observed data used for the marginals, the tightened bounds may no longer be sharp in the original sense and could introduce circularity.
Authors: The copula approach is presented as a way to incorporate additional dependence information to obtain tighter bounds within the partial identification framework. The dependence parameter is not claimed to be identified from the observed data; rather, it is a sensitivity parameter that the analyst can specify based on substantive knowledge or external data sources. We will revise the manuscript to explicitly state that the tightened bounds are sharp conditional on the chosen copula family and parameter value. We agree that fitting the parameter directly to the observed data could compromise the partial identification guarantees and introduce circularity, and we will add a discussion warning against this practice and suggesting instead the use of plausible ranges for the dependence parameter. This maintains the integrity of the partial-identification interpretation. revision: partial
Circularity Check
No significant circularity
full rationale
The derivation proceeds from standard ignorability and monotonicity assumptions to sharp bounds on GPN, then applies a copula model that incorporates separate dependence information between potential outcomes to tighten those bounds. No step reduces by construction to a fitted parameter, self-definition, or load-bearing self-citation; the copula step is an explicit modeling choice that adds information rather than renaming or tautologically reproducing the input bounds. The approach is framed as partial identification, consistent with the continuous-outcome setting, and remains self-contained against external benchmarks.
Axiom & Free-Parameter Ledger
axioms (2)
- domain assumption Ignorability: no unmeasured confounding between treatment and potential outcomes
- domain assumption Monotonicity: treatment never decreases the outcome
invented entities (2)
-
General Probability of Necessity (GPN)
no independent evidence
-
Copula-based tightening framework
no independent evidence
Reference graph
Works this paper leans on
-
[1]
and Lee, David S
Almond, Douglas and Chay, Kenneth Y. and Lee, David S. , title =. The Quarterly Journal of Economics , volume =
-
[2]
Cross-World Assumption and Refining Prediction Intervals for Individual Treatment Effects , Year =
Juraj Bodik and Yaxuan Huang and Bin Yu , Journal =. Cross-World Assumption and Refining Prediction Intervals for Individual Treatment Effects , Year =
-
[3]
Maternal and Child Health Journal , volume=
Maternal smoking during pregnancy and birthweight: a propensity score matching approach , author=. Maternal and Child Health Journal , volume=. 2008 , publisher=
2008
-
[4]
Philip Dawid and Monica Musio , date-added =
A. Philip Dawid and Monica Musio , date-added =. Effects of Causes and Causes of Effects , volume =. Annual Review of Statistics and Its Application , pages =
-
[5]
Sociological Methods & Research , volume=
Fitting science into legal contexts: assessing effects of causes or causes of effects? , author=. Sociological Methods & Research , volume=. 2014 , publisher=
2014
-
[6]
Law, Probability and Risk , volume =
Dawid, A Philip and Musio, Monica and Murtas, Rossella , title =. Law, Probability and Risk , volume =
-
[7]
2024 , edition =
Peng Ding , title =. 2024 , edition =
2024
-
[8]
Compliance as an explanatory variable in clinical trials , volume =
Efron, Bradley and Feldman, David , journal =. Compliance as an explanatory variable in clinical trials , volume =
-
[9]
Sharp Bounds on the Distribution of the Treatment Effect and Their Statistical Inference , volume =
Yanqin Fan and Sang Soo Park , journal =. Sharp Bounds on the Distribution of the Treatment Effect and Their Statistical Inference , volume =
-
[10]
Fréchet , journal =
M. Fréchet , journal =. Sur les tableaux dont les marges et des bornes sont données , volume =
-
[11]
Conditional Outcome Equivalence: A Quantile Alternative to
Givens, Josh and Reeve, Henry W J and Liu, Song and Reluga, Katarzyna , booktitle=. Conditional Outcome Equivalence: A Quantile Alternative to. 2024 , address =
2024
-
[12]
Holland , date-added =
Paul W. Holland , date-added =. Statistics and Causal Inference , volume =. Journal of the American Statistical Association , pages =
-
[13]
Joe, H. , year=. Dependence Modeling with Copulas , edition=
-
[14]
Genetics in Medicine , volume=
An epidemiologic assessment of genomic profiling for measuring susceptibility to common diseases and targeting interventions , author=. Genetics in Medicine , volume=. 2004 , publisher=
2004
-
[15]
Proceedings of the AAAI Conference on Artificial Intelligence (AAAI 2024) , volume=
Probabilities of causation with nonbinary treatment and effect , author=. Proceedings of the AAAI Conference on Artificial Intelligence (AAAI 2024) , volume=. 2024 , address=
2024
-
[16]
Learning probabilities of causation from finite population data , author=. arXiv:2210.08453 , year=
-
[17]
Causes of Effects: Learning individual responses from population data , Year =
Scott Mueller and Ang Li and Judea Pearl , Journal =. Causes of Effects: Learning individual responses from population data , Year =
-
[18]
, edition =
Nelsen, Roger B. , edition =. An Introduction to Copulas , year =
-
[19]
Synthese , volume=
Probabilities of causation: three counterfactual interpretations and their identification , author=. Synthese , volume=
-
[20]
Causality , year =
Pearl, Judea , publisher =. Causality , year =
-
[21]
Sociological Methods & Research , volume=
Causes of effects and effects of causes , author=. Sociological Methods & Research , volume=
-
[22]
Jewell , publisher =
Judea Pearl and Madelyn Glymour and Nicholas P. Jewell , publisher =. Causal Inference in Statistics: A Primer , year =
-
[23]
Rosenbaum , Publisher =
Paul R. Rosenbaum , Publisher =. Design of Observational Studies , Year =
-
[24]
Biometrika , volume=
The central role of the propensity score in observational studies for causal effects , author=. Biometrika , volume=. 1983 , publisher=
1983
-
[25]
Rubin, D. B. , date-modified =. Estimating causal effects of treatments in randomized and nonrandomized studies , volume =. Journal of educational psychology , pages =
-
[26]
Differential etiology: Inferring specific causation in the law from group data in science , author=. Ariz. L. Rev. , volume=. 2021 , publisher=
2021
-
[27]
Biometrics , volume=
Treatment benefit and treatment harm rate to characterize heterogeneity in treatment effect , author=. Biometrics , volume=. 2013 , publisher=
2013
-
[28]
Fonctions de r
Sklar, Abe , journal =. Fonctions de r
-
[29]
Jerzy Splawa-Neyman and D. M. Dabrowska and T. P. Speed , journal =. On the application of probability theory to agricultural experiments. Essay on principles. Section 9 , volume =. 1990 , number =
1990
-
[30]
Invariant Deep Uplift Modeling for Incentive Assignment in Online Marketing via Probability of Necessity and Sufficiency , publisher =
Zexu Sun and Qiyu Han and Hao Yang and Anpeng Wu and Minqin Zhu and Dugang Liu and Chen Ma and Yunpeng Weng and Xing Tang and Xiuqiang He , booktitle =. Invariant Deep Uplift Modeling for Incentive Assignment in Online Marketing via Probability of Necessity and Sufficiency , publisher =. 2025 , volume =
2025
-
[31]
Annals of Mathematics and Artificial Intelligence , volume=
Probabilities of causation: Bounds and identification , author=. Annals of Mathematics and Artificial Intelligence , volume=. 2000 , publisher=
2000
-
[32]
Statistics in Medicine , volume=
Semiparametric efficient inference for the probability of necessary and sufficient causation , author=. Statistics in Medicine , volume=. 2025 , publisher=
2025
-
[33]
Southern Economic Journal , volume =
Walker, Mary Beth and Tekin, Erdal and Wallace, Sally , title =. Southern Economic Journal , volume =
-
[34]
1992 , institution=
Low Birth Weight: A tabulation of available information , author=. 1992 , institution=
1992
-
[35]
Quantifying Individual Risk for Binary Outcome: Bounds and Inference , Year =
Peng Wu and Peng Ding and Zhi Geng and Yue Li , Journal =. Quantifying Individual Risk for Binary Outcome: Bounds and Inference , Year =
-
[36]
2023 , booktitle =
Yang, Mengyue and Fang, Zhen and Zhang, Yonggang and Du, Yali and Liu, Furui and Ton, Jean-Francois and Wang, Jianhong and Wang, Jun , title =. 2023 , booktitle =
2023
-
[37]
Biometrika , volume =
Zhang, Chao and Geng, Zhi and Li, Wei and Ding, Peng , title =. Biometrika , volume =
-
[38]
2025 , journal =
Causal attribution with confidence , author=. 2025 , journal =
2025
-
[39]
Proceedings of the 39th Conference on Uncertainty in Artificial Intelligence (UAI 2023) , year=
Conditional Counterfactual Causal Effect for Individual Attribution , author=. Proceedings of the 39th Conference on Uncertainty in Artificial Intelligence (UAI 2023) , year=
2023
-
[40]
and Wolf, Michael , title =
Romano, Joseph P. and Wolf, Michael , title =. Metrika , volume =
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.