Partial Identification of Policy-Relevant Treatment Effects with Instrumental Variables via Optimal Transport
Pith reviewed 2026-05-10 15:56 UTC · model grok-4.3
The pith
Policy-relevant treatment effects can be partially identified by recasting the problem as constrained conditional optimal transport, which reduces to one-dimensional problems with closed-form sharp bounds.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
PRTE partial identification in the generalized Roy model can be formulated as a Constrained Conditional Optimal Transport (CCOT) problem over the joint conditional law of the potential outcome and the latent resistance. The resulting multidimensional CCOT problem reduces analytically to separable one-dimensional OT problems with product costs, yielding sharp closed-form bounds and avoiding direct solution of the original high-dimensional CCOT problem.
What carries the argument
Constrained Conditional Optimal Transport (CCOT) over the joint conditional law of potential outcomes and latent resistance, which reduces to separable one-dimensional OT problems with product costs.
If this is right
- The bounds are sharp and available in closed form without solving the full high-dimensional transport problem.
- Double machine learning estimators based on Neyman-orthogonal scores achieve the parametric root-n rate and asymptotic normality for discrete instruments even with high-dimensional covariates.
- For continuous instruments the framework supplies explicit nonparametric convergence rates.
- The resulting bounds are substantially tighter than those from moment-relaxation methods in both simulations and the bed-net subsidy application.
- The formulation extends to covariates, continuous instruments, and general treatment settings.
Where Pith is reading between the lines
- The separability into one-dimensional problems may allow similar optimal-transport reductions in other partial-identification settings that involve latent variables and conditional distributions.
- The closed-form nature of the bounds could be used to optimize the choice of instruments or policies that maximize the width of the identified set for a target policy effect.
- Integrating flexible machine-learning models for the conditional distributions inside the transport step could further improve finite-sample performance beyond the current DML procedure.
Load-bearing premise
The data are generated by a generalized Roy model in which the instrument produces only limited variation in treatment propensity.
What would settle it
Simulate data from the generalized Roy model with a binary instrument that shifts propensity only between 0.3 and 0.7, compute the true PRTE, and verify whether it always lies inside the closed-form bounds obtained from the observed conditional distributions; systematic violations outside those bounds would falsify sharpness.
Figures
read the original abstract
Policy-Relevant Treatment Effects (PRTEs) are generally not point-identified under standard Instrumental Variable (IV) assumptions when the instrument generates limited support in treatment propensity. We show that PRTE partial identification in the generalized Roy model can instead be formulated as a Constrained Conditional Optimal Transport (CCOT) problem over the joint conditional law of the potential outcome and the latent resistance. The resulting multidimensional CCOT problem reduces analytically to separable one-dimensional OT problems with product costs, yielding sharp closed-form bounds and avoiding direct solution of the original high-dimensional CCOT problem. We also develop estimation and inference procedures for these bounds: for discrete instruments, we use a Double Machine Learning (DML) approach based on Neyman-orthogonal scores that accommodates high-dimensional covariates while achieving the parametric $\sqrt{n}$ rate and asymptotic normality; for continuous instruments, we explicitly characterize the corresponding nonparametric convergence rates. The framework accommodates covariates, discrete and continuous instruments, and extensions to general treatment settings. In simulations and a bed-net subsidy application, the resulting bounds are substantially tighter than the moment-relaxation method.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript claims that partial identification of Policy-Relevant Treatment Effects (PRTEs) under the generalized Roy model with instrumental variables (when the instrument has limited support in treatment propensity) can be reformulated as a Constrained Conditional Optimal Transport (CCOT) problem over the joint conditional law of potential outcomes and latent resistance. This multidimensional CCOT reduces analytically to separable one-dimensional OT problems with product costs, delivering sharp closed-form bounds without directly solving the high-dimensional problem. The authors develop Double Machine Learning estimators for discrete instruments that achieve sqrt(n) rates and asymptotic normality, characterize nonparametric rates for continuous instruments, accommodate covariates and general treatments, and show tighter bounds than moment-relaxation methods in simulations and a bed-net subsidy application.
Significance. If the central claims hold, the work provides a meaningful advance in partial identification for policy-relevant parameters by leveraging optimal transport to obtain tractable, sharp bounds that are otherwise difficult to compute. The analytical separability result is a clear strength, as is the integration with Neyman-orthogonal DML scores for feasible estimation under high-dimensional covariates. The framework's flexibility across instrument types and its empirical demonstration of improved bound tightness add practical value to the causal inference and econometrics literature.
minor comments (3)
- The abstract and introduction would benefit from a brief explicit statement of the closed-form expression for the bounds (or at least their functional dependence on the marginals) to allow readers to immediately assess sharpness without consulting the full derivation.
- Notation for the latent resistance variable and the product-cost structure in the CCOT formulation should be introduced with a short notational table or explicit definition early in the identification section to improve readability for readers less familiar with optimal transport.
- In the simulation section, the comparison to the moment-relaxation method would be strengthened by reporting the width of the bounds or a normalized tightness metric in addition to qualitative statements.
Simulated Author's Rebuttal
We thank the referee for the positive assessment of the manuscript and the recommendation for minor revision. The summary accurately captures the core contribution of reformulating PRTE partial identification as a CCOT problem that reduces to separable one-dimensional OT problems, along with the associated estimation procedures.
Circularity Check
No significant circularity; derivation is self-contained via external OT theory
full rationale
The paper formulates PRTE partial identification as a CCOT problem over the joint law of potential outcomes and latent resistance, then applies standard properties of optimal transport (specifically, separability under product costs) to reduce the multidimensional problem to independent one-dimensional OT problems. This reduction is a direct mathematical consequence of the product-cost structure and the limited-support IV assumption, not a redefinition or fit of the target quantity itself. No parameters are estimated from the target bounds and then relabeled as predictions; no load-bearing uniqueness theorem or ansatz is imported via self-citation; and the subsequent DML estimation follows conventional semiparametric theory. The central claim therefore rests on independent mathematical structure and external OT results rather than reducing to its own inputs by construction.
Axiom & Free-Parameter Ledger
axioms (2)
- domain assumption Generalized Roy model for potential outcomes and latent resistance
- domain assumption Instrumental variable assumptions with limited treatment propensity support
Reference graph
Works this paper leans on
-
[1]
Also available as arXiv:2202.10665
URL https://proceedings.mlr.press/v177/guo22a/guo22a.pdf. Also available as arXiv:2202.10665. Peter Hall, Rodney C. L. Wolff, and Qiwei Yao. Methods for estimating a conditional distribution function.Journal of the American Statistical Association, 94(445):154–163, 1999. Sukjin Han and Shenshen Yang. A computational approach to identification of treatment...
-
[2]
Similarly, P(Y∈ · |W = 0, Z = z) = δ0(·)forces π0(· |u ) = δ0 for almost all u∈ (1/4, 1)
Because the conditional distribution of the observed outcome is a Dirac measure at zero, P(Y∈ · |W = 1, Z = z) = δ0(·)for each z, the observational constraint forcesπ1(· |u ) = δ0 for almost allu∈ (0, 3/4). Similarly, P(Y∈ · |W = 0, Z = z) = δ0(·)forces π0(· |u ) = δ0 for almost all u∈ (1/4, 1). Since ω is supported on(1/4, 1/2) ⊂ (0, 3/4) ∩ (1/4, 1), bot...
work page 2018
-
[3]
Z 1 0 1 u∈ U id(x, w) Eobs[Y|w, Z=z u, X]ω(w, X, u) du # dw + Z W EX
The resulting bounds[−M/2, M/ 2]are arbitrarily wider than the sharp value of0. By strictly enforcing the full distributional constraint, the CCOT approach eliminates these mathematically feasible but structurally impossible counterfactuals. Figure 5 illustrates this gap numerically. 64 C Proofs in Section 4 Proof of Theorem 4.2.Let {eπw,x}w∈W,x∈X ∈ Γgen(...
work page 1996
-
[4]
−γ full,k(X)1(j = 0), where theγfull,k(X)term collects the "full" contribution acting on the entire interval andγj,k(X)collects the additional sub-interval mass. Multiplying by QY,k|X and integrating over each sub-interval[κj,k(X), κj+1,k(X)]in the normalized scale u= (v−p k(X))/(p k+1(X)−p k(X))yields (pk+1(X)−p k(X)) Z 1 0 QY,k|X (u)Q ω,k|X (1−u) du=γ f...
-
[5]
Level Sets:By Theorem 5.2, the true score levels are separated by at least cgap. We first compute the empirical marginal averages over the auxiliary sample:¯p(z) = 1 |I1| P i∈I1bp(z, Xi)and ¯q(z) = 1 |I1| P i∈I1bq(z, Xi). Let V = {¯p(z)}z∈Z ∪ {¯q(z)}z∈Z. We group the elements inV to disjoint subsetsV1,· · ·, V E such that the radius of each subset is less...
-
[6]
Instrument Assignment:Estimate πk(x) = Pobs(Z∈S k |X = x)by regressing 1(Z∈ bSk)on X. Similarly, estimate the weighting probabilitiesγj,k(x)and γK(x)by regressing their respective set indicators onX
-
[7]
Conditional Quantiles:By definition, νj,k(x) = QY,k|x(κj,k(x))is the κj,k(x)-quantile of the conditional distributionµ1,k|x, where µ1,k|x is identified in (3.4). Applying the quantile characterization Pµ1,k|x(Y⩽ν j,k(x)) = κj,k(x) = qj,k(x)−pk(x) pk+1(x)−pk(x) and expanding Pµ1,k|x in terms of the observed conditional distributions yields the following mo...
-
[8]
PropensityScoresandLevelSets:Estimate p(z, x), computebq(z, x) = ϕ(z, x,bp(z, x)), and construct the level setsbSk, bTj,k exactly as in the continuous outcome procedure (Steps 1–3 in Section D.3)
-
[9]
Similarly, estimateγfull,k(x), γj,k(x), and γK(x)by regressing their respective set indicators onX
Conditional Joint Probability:Estimate P1,k(x) = E[Y W|Z∈S k, X = x]by regressing Y W on X conditional on Z∈ bSk, using any flexible machine learning method. Similarly, estimateγfull,k(x), γj,k(x), and γK(x)by regressing their respective set indicators onX
-
[10]
Z p(X) p(X) ∂g1(u, X) ∂u ω(X, u) du # =E Z,X
Rearrangement:In the population, P1,k(X)is monotone non-decreasing in k and the gaps are bounded byP1,k+1(X) −P 1,k(X) ⩽p k+1(X) −p k(X), since Jfull,k(X) = (pk+1(X)−p k(X))θ k(X)withθ k(X)∈[0,1]. However, because each bP1,k is estimated on a disjoint subsample (Z∈ bSk) by a separate regression, the finite-sample estimates need not preserve either constra...
work page 2009
-
[11]
the integration boundarypk+1 −J full,k, corrected byα(j,k) p,k+1Rp,k+1
-
[12]
the lower level-set boundaryq0,k =p k whenj= 1, corrected byα (1,k) p,k Rp,k
-
[13]
the alternative policy values qj,k and qj−1,k through the mapping ϕ(z, X, p(z, X)), corrected by the policy terms involvingTj,k andT j−1,k. 92 For channel (a), a perturbationpk+1 →p k+1 + t δpk+1(X)shifts the boundary by δpk+1, giving ∂th− j,k = −D− j,k δpk+1. The correction α(j,k) p,k+1Rp,k+1 = −D− j,k · 1(Z∈Sk+1) πk+1 (W−p k+1) contributes∂ tE[α(j,k) p,...
work page 2018
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.