Towards a Learning Theory of Cause-Effect Inference

Bernhard Sch\"olkopf; David Lopez-Paz; Ilya Tolstikhin; Krikamol Muandet

arxiv: 1502.02398 · v2 · pith:BCRRIMGUnew · submitted 2015-02-09 · 📊 stat.ML · math.PR· math.ST· stat.TH

Towards a Learning Theory of Cause-Effect Inference

David Lopez-Paz , Krikamol Muandet , Bernhard Sch\"olkopf , Ilya Tolstikhin This is my paper

classification 📊 stat.ML math.PRmath.STstat.TH

keywords causalinferencelearningbinarycause-effectkernelprobabilityaccess

0 comments

read the original abstract

We pose causal inference as the problem of learning to classify probability distributions. In particular, we assume access to a collection $\{(S_i,l_i)\}_{i=1}^n$, where each $S_i$ is a sample drawn from the probability distribution of $X_i \times Y_i$, and $l_i$ is a binary label indicating whether "$X_i \to Y_i$" or "$X_i \leftarrow Y_i$". Given these data, we build a causal inference rule in two steps. First, we featurize each $S_i$ using the kernel mean embedding associated with some characteristic kernel. Second, we train a binary classifier on such embeddings to distinguish between causal directions. We present generalization bounds showing the statistical consistency and learning rates of the proposed approach, and provide a simple implementation that achieves state-of-the-art cause-effect inference. Furthermore, we extend our ideas to infer causal relationships between more than two variables.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

TCD-Arena: Assessing Robustness of Time Series Causal Discovery Methods Against Assumption Violations
cs.LG 2026-05 unverdicted novelty 7.0

TCD-Arena is a new customizable testing framework that runs millions of experiments to map how 33 different assumption violations affect time series causal discovery methods and shows ensembles can boost overall robustness.