Optimal Experiments for Partial Causal Effect Identification
Pith reviewed 2026-05-11 01:41 UTC · model grok-4.3
The pith
Selecting a cost-limited set of experiments can maximize the guaranteed reduction in bound width on a partially identifiable causal query.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
We formalize the selection of a cost-constrained subset of experiments that maximally tightens bounds on a target causal query as the max-potency problem, where epistemic potency measures the worst-case reduction in bound width guaranteed by an experiment. The problem is shown to be NP-hard via a reduction from 0-1 knapsack. A general procedure evaluates epistemic potency in discrete settings by building on the polynomial-programming framework of Duarte et al. (2023). Two graphical pruning criteria that depend only on the causal graph and the query, a novel path-interception rule and an identifiability check based on the ID algorithm, together prune 50-88 percent of candidate experiments on
What carries the argument
Epistemic potency, the worst-case reduction in the width of bounds on the target causal query that is guaranteed by performing a given experiment, which is computed via polynomial programming and pruned using path-interception and ID-algorithm checks on the known causal graph.
If this is right
- The two graphical pruning criteria together certify zero potency for 50-88 percent of candidate experiments on average in Erdős-Rényi graphs and bnlearn networks without solving any polynomial program.
- Experiments that fail the ID-algorithm identifiability check are combinatorially inert, yielding a super-exponential reduction in the number of subsets that must be evaluated.
- Path interception through district structure certifies zero potency for many experiments in linear time using only the causal graph and query.
- The method produces a concrete recommendation of experiments on observational NHANES data for the effect of physical activity on diabetes.
Where Pith is reading between the lines
- If the assumed causal graph contains errors, the computed potencies and the selected experiments may no longer be optimal.
- The same potency definition could be used to rank experiments when the graph itself is uncertain by averaging over a posterior over graphs.
- Approximation algorithms for the knapsack reduction could be combined with the pruning rules to handle larger numbers of candidate experiments.
Load-bearing premise
The causal graph is known and correctly specified, and the setting is discrete so the polynomial-programming procedure applies directly.
What would settle it
An exhaustive enumeration on a small causal graph and query where the subset returned by the pruned max-potency procedure differs from the true optimum found by checking every feasible subset.
Figures
read the original abstract
Causal queries are often only partially identifiable from observational data, and experiments that could tighten the resulting bounds are typically costly. We study the problem of selecting, prior to observing experimental outcomes, a cost-constrained subset of experiments that maximally tightens bounds on a target query. We formalize this as the max-potency problem, where epistemic potency measures the worst-case reduction in bound width guaranteed by an experiment, and show that this problem is NP-hard via a reduction from 0-1 knapsack. Building on the polynomial-programming framework of Duarte et al. (2023), we give a general procedure for evaluating epistemic potency in discrete settings. To control the super-exponential search space, we introduce two graphical pruning criteria that depend only on the causal graph and the query: a novel path-interception rule that exploits district structure to certify zero potency in linear time, and an identifiability check based on the ID algorithm. On Erdos-Renyi random graphs and 11 bnlearn benchmark networks, the two criteria together prune 50-88% of candidate experiments on average without solving a single polynomial program. For the general subset search, we show that ID-pruned experiments are combinatorially inert, yielding a super-exponential reduction in the number of subsets evaluated. We close with an end-to-end demonstration on observational NHANES data, selecting optimal experiments for estimating the effect of physical activity on diabetes.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript studies selection of a cost-constrained subset of experiments that maximally tightens bounds on a partially identifiable causal query. It formalizes the max-potency problem (epistemic potency as worst-case bound-width reduction), proves NP-hardness via reduction from 0-1 knapsack, adapts the Duarte et al. (2023) polynomial-programming framework to evaluate potency in discrete settings, introduces two graph-only pruning rules (path-interception exploiting district structure and an ID-algorithm identifiability check) that prune 50-88% of candidates on Erdos-Renyi and bnlearn graphs, shows that ID-pruned experiments are combinatorially inert (yielding super-exponential subset reduction), and demonstrates the pipeline on NHANES data for the effect of physical activity on diabetes.
Significance. If the results hold, the work offers a principled, computationally tractable approach to optimal experimental design for tightening partially identified causal effects. The NP-hardness result, the two sound graphical pruning criteria that require no polynomial-program solves, the combinatorial-inertness observation, and the end-to-end real-data demonstration are concrete strengths that advance both theory and practice in causal inference and experimental design.
major comments (2)
- [NP-hardness reduction] The NP-hardness reduction (abstract and the section presenting the max-potency problem): the explicit mapping from knapsack instances to experiments and the correspondence between potency values and knapsack objective must be stated formally so that the reduction can be verified as polynomial-time and correct; without this the complexity claim remains uncheckable.
- [Pruning criteria] Section introducing the path-interception pruning rule: the linear-time certification of zero potency via district structure is load-bearing for the reported 50-88% pruning rates; a self-contained soundness argument (or reference to a prior lemma) is required, as any counter-example would invalidate the pruning percentages and the subsequent combinatorial-inertness claim.
minor comments (3)
- [Combinatorial inertness paragraph] The term 'combinatorial inertness' is used without an explicit definition; add a short formal definition or property when first introduced.
- [Real-data demonstration] The NHANES demonstration would benefit from an explicit statement of the target causal query, the chosen cost budget, and the exact experiments selected by the algorithm.
- [Empirical evaluation] Tables reporting pruning percentages on the 11 bnlearn networks should include per-network values or at least min/max ranges rather than only averages.
Simulated Author's Rebuttal
We thank the referee for the positive assessment, the recommendation of minor revision, and the constructive comments that help clarify the presentation. We address each major comment below and will revise the manuscript to incorporate the requested formal details.
read point-by-point responses
-
Referee: [NP-hardness reduction] The NP-hardness reduction (abstract and the section presenting the max-potency problem): the explicit mapping from knapsack instances to experiments and the correspondence between potency values and knapsack objective must be stated formally so that the reduction can be verified as polynomial-time and correct; without this the complexity claim remains uncheckable.
Authors: We agree that an explicit, formal statement of the reduction is required for independent verification. In the revised manuscript we will add a dedicated subsection that defines the polynomial-time mapping: given any 0-1 knapsack instance with n items (weights w_i, values v_i, capacity W), we construct a causal graph G, a target query Q, and a set of candidate experiments such that each item i corresponds to one experiment e_i with cost exactly w_i, and the epistemic potency of any subset S of experiments equals the knapsack objective value of the corresponding items. We will prove that the mapping is computable in time polynomial in the knapsack input size and that an optimal max-potency solution corresponds exactly to an optimal knapsack solution. This will make the NP-hardness claim directly checkable. revision: yes
-
Referee: [Pruning criteria] Section introducing the path-interception pruning rule: the linear-time certification of zero potency via district structure is load-bearing for the reported 50-88% pruning rates; a self-contained soundness argument (or reference to a prior lemma) is required, as any counter-example would invalidate the pruning percentages and the subsequent combinatorial-inertness claim.
Authors: We thank the referee for emphasizing the centrality of this argument. The manuscript currently sketches the path-interception rule using district structure and the fact that experiments failing to intercept all relevant paths from intervention to query cannot tighten the bounds. To address the request for a self-contained argument, the revision will insert a new lemma immediately preceding the pruning section. The lemma states that, for a query Q whose identifying expression factors over districts, any experiment that does not intercept every path from its intervention node to the query nodes within the relevant districts has epistemic potency exactly zero; the proof relies only on the definition of epistemic potency, the district factorization, and the properties of the ID algorithm, without invoking polynomial-program solves. This will support the reported pruning rates and the combinatorial-inertness observation. revision: yes
Circularity Check
No significant circularity detected
full rationale
The paper's core contributions—the formalization of the max-potency problem, the NP-hardness reduction from 0-1 knapsack, the two novel graphical pruning criteria (path-interception exploiting district structure and ID-algorithm identifiability check), and the combinatorial inertness argument—are self-contained and do not reduce to definitions or quantities internal to the paper. The polynomial-programming evaluation procedure is explicitly adapted from the external Duarte et al. (2023) framework, which is cited as an independent black-box tool rather than a self-referential premise. No self-definitional loops, fitted inputs renamed as predictions, load-bearing self-citations, or smuggled ansatzes appear in the derivation chain. The reported pruning rates and NHANES demonstration follow directly once the external framework and the soundness of the new pruning rules are granted.
Axiom & Free-Parameter Ledger
axioms (2)
- domain assumption The causal graph is known and correctly specified.
- domain assumption The setting is discrete.
Reference graph
Works this paper leans on
-
[1]
doi: 10.1111/j.0006-341X.2002.00021.x. S. Greenland and J. M. Robins. Identifiability, exchangeability, and epidemiological confounding. International Journal of Epidemiology, 15(3):413–419, 09 1986. ISSN 0300-5771. doi: 10.1093/ ije/15.3.413. URLhttps://doi.org/10.1093/ije/15.3.413. Y . Kivva, E. Mokhtarian, J. Etesami, and N. Kiyavash. Revisiting the ge...
-
[2]
Graph.Let G be the disjoint union of n TP-cells C1, . . . ,Cn. The observed variables are V={X 1, Y1, . . . , Xn, Yn}
-
[3]
Observational distribution.Choose per-cell distributions Pi(Xi, Yi) such that poti(ai) = vi, where ai =P(y i,xi). Constructive achievability is established in Lemma C.3 below; if necessary, rescale the values vi so that they lie in the achievable range. Set P(V) =Qn i=1 Pi(Xi, Yi). 3.Query.Let θ= nX i=1 PNSi = nX i=1 P(y i,xi , y′ i,x′ i )
-
[4]
Experiment set.Let A={a 1, . . . ,an} with ai =P(y i,xi). For any S⊆ {1, . . . , n} , we writea S :={a i :i∈S}
-
[5]
, n} , and use the original budgetB
Cost function and budget.Set c(aS) =P i∈S wi for S⊆ {1, . . . , n} , and use the original budgetB. The construction is polynomial in n: each TP-cell is a constant-size object, and Lemma C.3 produces the requiredP i in time polynomial in the bit-length ofv i. 13 C.3 Additivity of potency across cells The key technical claim is that the potency of any subse...
work page 1996
-
[6]
PickY (1) uniformly fromV; pickX (1) uniformly fromanc(Y (1))\ {Y (1)}
-
[7]
24 H.3 Candidate Experiment Sampling Each a∈A is sampled independently
For the second world, pick Y (2) uniformly from V subject to X(1) ∈anc(Y (2)), then set X(2) =X (1). 24 H.3 Candidate Experiment Sampling Each a∈A is sampled independently. With probability 0.3, a has two counterfactual worlds; otherwise one
-
[8]
Sample|W (1)|,|Z (1)| ∼Unif{1,2,3}
-
[9]
PickW (1) ⊆Vuniformly; pickZ (1) ⊆anc(W (1))\W (1) uniformly
-
[10]
Remaining variables fill fromVandanc(W (2))\W (2), respectively
For two-world experiments, sample W (2),Z (2) analogously, but seed each with a shared element: pick w∈W (1) uniformly and require w∈W (2); pick z∈Z (1)∩(anc(W (2))\W (2)) uniformly and requirez∈Z (2). Remaining variables fill fromVandanc(W (2))\W (2), respectively. H.4 Erd ˝os–Rényi Graphs Each ER graph has a fixed topological order V1 <· · ·< V N . Dire...
-
[11]
Add a forced confounder betweenX (1) andY (1)
-
[12]
Draw a target ratior∼Unif(0.1,0.9), clamped below by1/|V|for achievability
-
[13]
physical activity is protective
Add confounders between uniformly random pairs of observed variables until the total reaches ⌊r· |V|⌋. With100simulations per graph, this yields11×100 = 1,100simulations. H.6 Compute Used All experiments were executed on a single Microsoft Surface Laptop Studio with an 11th Gen Intel Core i7-11370H CPU (3.30 GHz base) and 32 GB RAM. No GPU was used and no...
work page 2017
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.