PAIR-CI: Calibrated Conditional Independence Testing for Causal Discovery with Incomplete Data
Pith reviewed 2026-05-08 16:54 UTC · model grok-4.3
The pith
PAIR-CI restores proper calibration to conditional independence tests for causal discovery when data have missing values by integrating imputation into a paired permutation procedure.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
PAIR-CI is a nonparametric conditional independence test that integrates multiple imputation directly into the inferential procedure via a paired permutation design, forcing imputation error to cancel in the loss difference between cross-validated models that include versus exclude the candidate variable, together with a provably consistent variance estimator that jointly accounts for uncertainty from cross-validation and multiple imputation.
What carries the argument
The paired permutation design, in which the same imputed conditioning sets are used for both the model that includes the candidate variable and the one that excludes it, so that imputation uncertainty subtracts out when comparing their cross-validated losses.
If this is right
- Existing imputation-based CI tests show false positive rates of 28-45% under MNAR, while PAIR-CI stays below 5%.
- Integrating PAIR-CI into the PC algorithm reduces structural Hamming distance by 8% on 10-variable nonlinear graphs, 15% on 30-variable graphs, and up to 44% on the 56-variable HAILFINDER network.
- Performance remains stable across data-generating processes and missingness mechanisms, including MNAR.
- The approach provides the first formal unification of cross-validation and multiple imputation uncertainty in a single consistent variance estimator.
Where Pith is reading between the lines
- Similar paired designs could be applied to other testing problems where imputation introduces bias, such as regression with missing covariates.
- If the cancellation of imputation error holds more generally, it could improve calibration in other causal discovery algorithms that rely on CI tests.
- The variance estimator might extend to settings with different imputation models beyond the ones simulated.
Load-bearing premise
The paired permutation design makes imputation error cancel exactly in the loss difference between the two cross-validated models.
What would settle it
A dataset or simulation under MNAR missingness where the false positive rate of PAIR-CI exceeds the nominal level or where the variance estimator fails to be consistent.
Figures
read the original abstract
The standard constraint-based paradigm for causal discovery with incomplete data -- impute first, test second -- is frequently miscalibrated: any consistent conditional independence (CI) test rejects a true null with probability approaching 1 when imputation error induces spurious conditional dependence. We introduce PAIR-CI, a nonparametric CI test that restores calibration by integrating multiple imputation directly into the inferential procedure via a paired permutation design. PAIR-CI compares cross-validated models that include and exclude the candidate variable while receiving the same imputed conditioning set, forcing imputation error to cancel in their loss difference rather than contaminate the test statistic. A provably consistent variance estimator jointly accounts for uncertainty arising from cross-validation and multiple imputation -- to our knowledge, the first formal unification of these two inferential frameworks. In simulations, existing imputation-based CI tests exhibit false positive rates of 28--45% when data are missing not at random (MNAR), whereas PAIR-CI averages below the nominal 5% level across data-generating processes and missingness mechanisms. These gains are largest in nonlinear settings and grow with causal graph size: when integrated into the PC algorithm, PAIR-CI reduces structural Hamming distance by 8% on 10-variable nonlinear graphs, 15% on 30-variable equivalents, and up to 44% on the 56-variable HAILFINDER network, with stable performance in all settings.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript introduces PAIR-CI, a nonparametric conditional independence test designed for causal discovery with incomplete data. It addresses miscalibration in the standard 'impute first, test second' approach by using a paired permutation design that integrates multiple imputation, ensuring imputation errors cancel in the loss difference between cross-validated models that include and exclude the candidate variable. The method includes a provably consistent variance estimator accounting for both cross-validation and imputation uncertainty, and simulations demonstrate improved false positive control under various missingness mechanisms, including MNAR, as well as better performance in the PC algorithm for graph recovery.
Significance. Should the theoretical guarantees on calibration and consistency hold, this work offers a meaningful contribution to causal inference under missing data by providing a calibrated nonparametric test that avoids the pitfalls of separate imputation and testing steps. The reported simulation improvements, particularly the reduction in false positives from 28-45% to under 5% and SHD reductions up to 44% on larger graphs, indicate potential for enhancing reliability in practical applications of constraint-based causal discovery. The formal unification of cross-validation and multiple imputation inference frameworks is a notable strength.
major comments (2)
- [Abstract and §3] Abstract and §3: The claim that the paired permutation design forces imputation error to cancel exactly in the loss difference (thereby keeping the test statistic asymptotically pivotal under the null even for MNAR) is load-bearing for the calibration result. However, this appears to treat imputed values as fixed; under misspecified imputation models or MNAR depending on unobserved variables, the residual dependence may not cancel symmetrically, potentially invalidating the pivotality. Explicit assumptions or a detailed proof addressing this case are needed.
- [§4] §4: The consistency proof for the variance estimator, which jointly accounts for uncertainty from cross-validation and multiple imputation, relies on the same paired structure and cancellation property. If the imputation error is not mean-zero conditional on the observed data under general MNAR, the estimator's consistency may not hold, undermining the formal unification claim.
minor comments (1)
- The abstract mentions specific performance gains but lacks details on the number of simulations or exact parameter settings; adding these would improve reproducibility.
Simulated Author's Rebuttal
We thank the referee for the careful reading and constructive comments. We appreciate the recognition of PAIR-CI's potential contribution. We address each major comment below with clarifications on the paired design and indicate revisions to strengthen the theoretical presentation.
read point-by-point responses
-
Referee: [Abstract and §3] Abstract and §3: The claim that the paired permutation design forces imputation error to cancel exactly in the loss difference (thereby keeping the test statistic asymptotically pivotal under the null even for MNAR) is load-bearing for the calibration result. However, this appears to treat imputed values as fixed; under misspecified imputation models or MNAR depending on unobserved variables, the residual dependence may not cancel symmetrically, potentially invalidating the pivotality. Explicit assumptions or a detailed proof addressing this case are needed.
Authors: The paired permutation design applies the identical imputed values for the conditioning set to both the model including the candidate variable and the model excluding it. Consequently, any imputation error (whether from misspecification or MNAR depending on unobserved variables) enters the two loss functions in exactly the same way and subtracts out in their difference. Under the null, the expected loss difference is therefore driven solely by the candidate variable, preserving asymptotic pivotality. This cancellation does not require the imputation error to be mean-zero or the model to be correctly specified; it requires only that the same imputations are used for the paired losses. We will add an explicit assumption on identical application of the imputation procedure and include a detailed proof in the revised §3 and a new appendix establishing the result for general MNAR. revision: yes
-
Referee: [§4] §4: The consistency proof for the variance estimator, which jointly accounts for uncertainty from cross-validation and multiple imputation, relies on the same paired structure and cancellation property. If the imputation error is not mean-zero conditional on the observed data under general MNAR, the estimator's consistency may not hold, undermining the formal unification claim.
Authors: The variance estimator targets the variance of the paired loss difference. Because the imputation error is identical across the pair, it cancels in the difference; the estimator therefore consistently recovers the remaining variability from cross-validation and the multiple imputations. Consistency holds conditionally on the observed data even when the imputation error is not mean-zero, provided the paired structure is maintained. We will revise §4 to state this conditioning explicitly, derive the consistency result under general MNAR, and clarify the scope of the unification between cross-validation and multiple-imputation inference. revision: yes
Circularity Check
No significant circularity; derivation self-contained
full rationale
The paper's central construction defines PAIR-CI via a paired permutation design that shares the imputed conditioning set across include/exclude cross-validated models, then asserts that this forces imputation error to cancel in the loss difference (abstract). A separate variance estimator is introduced and labeled 'provably consistent' under the same structure. No quoted step reduces a claimed prediction or consistency result to a fitted parameter by construction, nor does any load-bearing premise rest on a self-citation whose content is unverified. The unification of cross-validation and multiple imputation is presented as a new formal step whose validity is internal to the paper's proofs and simulations rather than imported or renamed from prior author work. This meets the default expectation of a non-circular methodological contribution.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Standard nonparametric consistency assumptions for conditional independence tests and multiple imputation under the missingness mechanisms considered.
Reference graph
Works this paper leans on
-
[1]
Zheng, Xun and Aragam, Bryon and Ravikumar, Pradeep K. and Xing, Eric P , journal=
-
[2]
Zheng, Xun and Dan, Chen and Aragam, Bryon and Ravikumar, Pradeep and Xing, Eric , booktitle=. 2020 , organization=
work page 2020
-
[3]
Abramson, Bruce and Brown, John and Edwards, Ward and Murphy, Allan and Winkler, Robert L , journal=. 1996 , publisher=
work page 1996
-
[4]
Beinlich, Ingo A and Suermondt, Henri Jacques and Chavez, R Martin and Cooper, Gregory F , booktitle=. 1989 , publisher=
work page 1989
-
[5]
Small-sample degrees of freedom with multiple imputation , author=. Biometrika , volume=. 1999 , publisher=
work page 1999
-
[6]
Advances in Neural Information Processing Systems , volume=
Cross-validation confidence intervals for test error , author=. Advances in Neural Information Processing Systems , volume=
-
[7]
Advances in Neural Information Processing Systems , volume=
Conditional independence testing using generative adversarial networks , author=. Advances in Neural Information Processing Systems , volume=
-
[8]
Journal of the Royal Statistical Society Series B: Statistical Methodology , volume=
The conditional permutation test for independence while controlling for confounders , author=. Journal of the Royal Statistical Society Series B: Statistical Methodology , volume=. 2020 , publisher=
work page 2020
-
[9]
Random forests , author=. Machine learning , volume=. 2001 , publisher=
work page 2001
-
[10]
Journal of the Royal Statistical Society Series A: Statistics in Society , volume=
Causal discovery of gene regulation with incomplete data , author=. Journal of the Royal Statistical Society Series A: Statistics in Society , volume=. 2020 , publisher=
work page 2020
-
[11]
Extremely randomized trees , author=. Machine learning , volume=. 2006 , publisher=
work page 2006
-
[12]
International Conference on Probabilistic Graphical Models (PGM) , pages=
Structure Learning Under Missing Data , author=. International Conference on Probabilistic Graphical Models (PGM) , pages=
-
[13]
Gao, Erdun and Ng, Ignavier and Gong, Mingming and Shen, Li and Huang, Wei and Liu, Tongliang and Zhang, Kun and Bondell, Howard , journal=
-
[14]
Estimating high-dimensional directed acyclic graphs with the
Kalisch, Markus and B. Estimating high-dimensional directed acyclic graphs with the. Journal of Machine Learning Research , volume=
- [15]
-
[16]
Journal of the American Statistical Association , volume=
Graphical models for processing missing data , author=. Journal of the American Statistical Association , volume=. 2021 , publisher=
work page 2021
-
[17]
Scikit-learn: Machine learning in
Pedregosa, Fabian and Varoquaux, Ga. Scikit-learn: Machine learning in. the Journal of machine Learning research , volume=. 2011 , publisher=
work page 2011
-
[18]
Advances in Neural Information Processing Systems , volume=
Inference for the generalization error , author=. Advances in Neural Information Processing Systems , volume=
-
[19]
Annual review of psychology , volume=
Missing data analysis: Making it work in the real world , author=. Annual review of psychology , volume=. 2009 , publisher=
work page 2009
-
[20]
Inference for imputation estimators , author=. Biometrika , volume=. 2000 , publisher=
work page 2000
-
[21]
Rubin, Donald B , title =
-
[22]
Causal protein-signaling networks derived from multiparameter single-cell data , author=. Science , volume=. 2005 , publisher=
work page 2005
-
[23]
The Annals of Statistics , number =
Erwan Scornet and G. The Annals of Statistics , number =. 2015 , doi =
work page 2015
-
[24]
Advances in Neural Information Processing Systems , volume=
Model-powered conditional independence test , author=. Advances in Neural Information Processing Systems , volume=
-
[25]
The Annals of Statistics , number =
Rajen D Shah and Jonas Peters , title =. The Annals of Statistics , number =. 2020 , doi =
work page 2020
-
[26]
Spirtes, Peter and Glymour, Clark and Scheines, Richard , title =
-
[27]
Journal of Causal Inference , volume=
Approximate kernel-based conditional independence tests for fast non-parametric causal discovery , author=. Journal of Causal Inference , volume=. 2019 , publisher=
work page 2019
-
[28]
The 22nd International Conference on Artificial Intelligence and Statistics , pages=
Causal discovery in the presence of missing data , author=. The 22nd International Conference on Artificial Intelligence and Statistics , pages=. 2019 , organization=
work page 2019
-
[29]
Testing conditional independence in supervised learning algorithms , author=. Machine Learning , volume=. 2021 , publisher=
work page 2021
-
[30]
Statistics in medicine , volume=
Multiple imputation and test-wise deletion for causal discovery with incomplete cohort data , author=. Statistics in medicine , volume=. 2022 , publisher=
work page 2022
-
[31]
mice: Multivariate imputation by chained equations in
Van Buuren, Stef and Groothuis-Oudshoorn, Karin , journal=. mice: Multivariate imputation by chained equations in
-
[32]
Kernel-based conditional independence test and application in causal discovery , year =
Zhang, Kun and Peters, Jonas and Janzing, Dominik and Sch\". Kernel-based conditional independence test and application in causal discovery , year =. Proceedings of the Twenty-Seventh Conference on Uncertainty in Artificial Intelligence , pages =
- [33]
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.