The Promises of Multiple Experiments: Identifying Joint Distribution of Potential Outcomes
Pith reviewed 2026-05-22 18:58 UTC · model grok-4.3
The pith
Multiple experiments identify the joint distribution of potential outcomes under transportability of state transitions
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Under the assumption of transportability of state transition probabilities for potential outcomes across datasets and a regular full-column rank condition, the joint distribution of potential outcomes is identified; a least-squares-based estimator is consistent and asymptotically normal. The key identification assumptions are testable in an overidentified setting and are analogous to those in the context of instrumental variables, with the dataset indicator serving as an instrument. The framework further extends to identify and estimate principal causal effects.
What carries the argument
Transportability of state transition probabilities for potential outcomes across datasets, which connects separate experiments to recover the full joint distribution.
If this is right
- The joint distribution of potential outcomes can be recovered from multiple experiments.
- A simple least-squares estimator for the joint distribution is consistent and asymptotically normal.
- Principal causal effects become identifiable and estimable within the same framework.
- The identifying assumptions can be tested directly in overidentified settings.
Where Pith is reading between the lines
- The approach may extend to settings where experiments share similar transition structures even if populations differ slightly.
- It offers a way to pool information across trials for estimands that require joint rather than separate outcome information.
Load-bearing premise
State transition probabilities for potential outcomes are the same across the different experimental datasets.
What would settle it
If combining the datasets under the transportability assumption produces inconsistent estimates of the joint distribution when checked against held-out data or when the observed matrix fails the full-column rank condition, the identification would not hold.
read the original abstract
Typical causal effects are defined based on the marginal distribution of potential outcomes. However, many real-world applications require causal estimands involving the joint distribution of potential outcomes to enable more nuanced treatment evaluation and selection. In this article, we propose a novel framework for identifying and estimating the joint distribution of potential outcomes using multiple experimental datasets. We introduce the assumption of transportability of state transition probabilities for potential outcomes across datasets and establish the identification of the joint distribution under this assumption, along with a regular full-column rank condition. The key identification assumptions are testable in an overidentified setting and are analogous to those in the context of instrumental variables, with the dataset indicator serving as "instrument". Moreover, we propose an easy-to-use least-squares-based estimator for the joint distribution of potential outcomes in each dataset, proving its consistency and asymptotic normality. We further extend the proposed framework to identify and estimate principal causal effects. We empirically demonstrate the proposed framework by conducting extensive simulations and applying it to evaluate the surrogate endpoint in a real-world application.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper claims to identify the joint distribution of potential outcomes using multiple experimental datasets under the transportability of state transition probabilities across datasets and a regular full-column rank condition on the design matrix (with the dataset indicator serving as an instrument). It establishes that the key assumptions are testable in an overidentified setting, proposes a least-squares-based estimator that is consistent and asymptotically normal, extends the framework to principal causal effects, and validates the approach via simulations and a real-world surrogate endpoint application.
Significance. If the identification result holds, the work would advance causal inference by enabling estimation of joint distributions of potential outcomes, which support more nuanced treatment evaluation and selection than marginal effects alone. The IV-style analogy with testable assumptions and the provision of an easy-to-use consistent estimator with asymptotic normality represent practical strengths. Credit is due for the explicit consistency and asymptotic normality results for the least-squares estimator as well as the empirical demonstrations through extensive simulations and a real-data application.
major comments (2)
- [§3] §3 (Identification result): the claim that the joint distribution is identified under transportability of state transition probabilities plus the full-column rank condition requires an explicit step-by-step derivation showing how the observed conditional distributions map to the target joint probabilities; without this, it is difficult to verify that the rank condition is sufficient and non-circular.
- [§5] §5 (Estimator and asymptotics): the consistency and asymptotic normality of the least-squares estimator are central to the practical contribution, yet the manuscript should state the precise form of the estimator (e.g., the design matrix construction) and the regularity conditions invoked for the asymptotic normality result.
minor comments (2)
- [Introduction] The notation for potential outcomes and state transitions would benefit from a small illustrative example early in the introduction to improve accessibility.
- [Simulations] Simulation tables should report standard errors or coverage probabilities alongside point estimates to allow readers to assess finite-sample performance more fully.
Simulated Author's Rebuttal
We thank the referee for the thoughtful and constructive comments. We address each major point below and will incorporate clarifications to improve the manuscript's clarity and rigor.
read point-by-point responses
-
Referee: §3 (Identification result): the claim that the joint distribution is identified under transportability of state transition probabilities plus the full-column rank condition requires an explicit step-by-step derivation showing how the observed conditional distributions map to the target joint probabilities; without this, it is difficult to verify that the rank condition is sufficient and non-circular.
Authors: We agree that an explicit step-by-step derivation would strengthen the presentation. In the revised manuscript, we will expand Section 3 with a detailed derivation that starts from the observed conditional distributions, applies the transportability assumption on state transition probabilities, and shows how the full-column rank condition on the design matrix (with dataset indicator as instrument) yields unique identification of the joint distribution of potential outcomes. Intermediate algebraic steps will be included to confirm that the rank condition is sufficient and non-circular. revision: yes
-
Referee: §5 (Estimator and asymptotics): the consistency and asymptotic normality of the least-squares estimator are central to the practical contribution, yet the manuscript should state the precise form of the estimator (e.g., the design matrix construction) and the regularity conditions invoked for the asymptotic normality result.
Authors: We appreciate this recommendation for greater precision. In the revised Section 5, we will explicitly define the least-squares estimator, including the construction of the design matrix that incorporates the dataset indicators. We will also state the regularity conditions (e.g., bounded moments, positive definiteness of the limiting matrix, and the maintained rank condition) under which consistency and asymptotic normality hold, with a brief reference to standard results for linear estimators. revision: yes
Circularity Check
No significant circularity
full rationale
The paper derives identification of the joint distribution of potential outcomes directly from the explicitly stated transportability assumption on state-transition probabilities across datasets together with the full-column-rank condition on the matrix of dataset indicators. The least-squares estimator is then constructed from this identified expression and its consistency and asymptotic normality are shown via standard arguments for linear estimators under the maintained assumptions; neither step reduces to a fitted parameter being relabeled as a prediction nor relies on a self-citation chain for its justification. The overidentification testability claim follows from the IV-style analogy with the dataset indicator as instrument and does not presuppose the target result. The derivation chain is therefore self-contained against the paper's own stated assumptions and external statistical theory.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Transportability of state transition probabilities for potential outcomes across datasets
Forward citations
Cited by 1 Pith paper
-
Improving Treatment Effect Estimation in Trials through Adaptive Borrowing of External Controls
An adaptive influence-function framework selects optimal external control subsets to minimize MSE of the ATE estimator in RCTs and adds outcome calibration for better data use.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.