Fast Conditional Independence Test for Vector Variables with Large Sample Sizes

Frederick Eberhardt; Krzysztof Chalupka; Pietro Perona

arxiv: 1804.02747 · v1 · pith:OWPYYOQ6new · submitted 2018-04-08 · 📊 stat.ML · cs.AI· cs.LG· stat.OT

Fast Conditional Independence Test for Vector Variables with Large Sample Sizes

Krzysztof Chalupka , Pietro Perona , Frederick Eberhardt This is my paper

classification 📊 stat.ML cs.AIcs.LGstat.OT

keywords independencetestconditionalavailableevaluationfastnonparametricsamples

0 comments

read the original abstract

We present and evaluate the Fast (conditional) Independence Test (FIT) -- a nonparametric conditional independence test. The test is based on the idea that when $P(X \mid Y, Z) = P(X \mid Y)$, $Z$ is not useful as a feature to predict $X$, as long as $Y$ is also a regressor. On the contrary, if $P(X \mid Y, Z) \neq P(X \mid Y)$, $Z$ might improve prediction results. FIT applies to thousand-dimensional random variables with a hundred thousand samples in a fraction of the time required by alternative methods. We provide an extensive evaluation that compares FIT to six extant nonparametric independence tests. The evaluation shows that FIT has low probability of making both Type I and Type II errors compared to other tests, especially as the number of available samples grows. Our implementation of FIT is publicly available.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Fast Nonparametric Conditional Independence Testing via Two-Stage Regression
stat.ML 2026-06 unverdicted novelty 6.0

BLITZ introduces a two-stage broad-to-local residualization method for fast nonparametric conditional independence testing with improved calibration over kernel and regression competitors.
Multiscale Cochran-Mantel-Haenszel Scanning for Conditional Dependency
stat.ME 2026-04 unverdicted novelty 6.0

Multiscale CMH scanning generalizes the classic test to continuous spaces, achieving consistency for conditional independence testing by conditioning on marginal order statistics without requiring large stratum sizes.