Causal Machine Learning: A Survey and Open Problems

Aengus Lynch; Jean Kaddour; Matt J. Kusner; Qi Liu; Ricardo Silva

arxiv: 2206.15475 · v3 · pith:IOZCAKOHnew · submitted 2022-06-30 · 💻 cs.LG · stat.ME

Causal Machine Learning: A Survey and Open Problems

Jean Kaddour , Aengus Lynch , Qi Liu , Matt J. Kusner , Ricardo Silva This is my paper

classification 💻 cs.LG stat.ME

keywords causallearningmachineproblemscausalmlmethodsopenprocess

0 comments

read the original abstract

Causal Machine Learning (CausalML) is an umbrella term for machine learning methods that formalize the data-generation process as a structural causal model (SCM). This perspective enables us to reason about the effects of changes to this process (interventions) and what would have happened in hindsight (counterfactuals). We categorize work in CausalML into five groups according to the problems they address: (1) causal supervised learning, (2) causal generative modeling, (3) causal explanations, (4) causal fairness, and (5) causal reinforcement learning. We systematically compare the methods in each category and point out open problems. Further, we review data-modality-specific applications in computer vision, natural language processing, and graph representation learning. Finally, we provide an overview of causal benchmarks and a critical discussion of the state of this nascent field, including recommendations for future work.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 4 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Advancing Edge Classification through High-Dimensional Causal Modeling of Node-Edge Interplay
cs.LG 2026-05 unverdicted novelty 7.0

CECF is a new causal framework for edge classification that balances high-dimensional edge features against node influences via GNN embeddings and cross-attention to achieve better performance than standard methods.
From Weight Perturbation to Feature Attribution for Explaining Fully Connected Neural Networks
cs.LG 2026-05 unverdicted novelty 6.0

XWP and XWP_c are novel attribution methods for FCNNs that estimate feature importance by perturbing attached weights to avoid added bias and out-of-distribution issues in occlusion approaches.
Predictive and Prescriptive AI toward Optimizing Wildfire Suppression
math.OC 2026-05 unverdicted novelty 6.0

A new optimization algorithm with double machine learning for wildfire spread estimation enables better crew assignments that reduce total area burned.
Tabular Foundation Model for Generative Modelling
cs.LG 2026-05 unverdicted novelty 5.0

TabFORGE generates high-quality synthetic tabular data by leveraging pretrained causality-aware representations in a two-stage diffusion-decoder architecture that mitigates latent distribution shifts.