pith. sign in

arxiv: 2206.15475 · v3 · pith:IOZCAKOHnew · submitted 2022-06-30 · 💻 cs.LG · stat.ME

Causal Machine Learning: A Survey and Open Problems

classification 💻 cs.LG stat.ME
keywords causallearningmachineproblemscausalmlmethodsopenprocess
0
0 comments X
read the original abstract

Causal Machine Learning (CausalML) is an umbrella term for machine learning methods that formalize the data-generation process as a structural causal model (SCM). This perspective enables us to reason about the effects of changes to this process (interventions) and what would have happened in hindsight (counterfactuals). We categorize work in CausalML into five groups according to the problems they address: (1) causal supervised learning, (2) causal generative modeling, (3) causal explanations, (4) causal fairness, and (5) causal reinforcement learning. We systematically compare the methods in each category and point out open problems. Further, we review data-modality-specific applications in computer vision, natural language processing, and graph representation learning. Finally, we provide an overview of causal benchmarks and a critical discussion of the state of this nascent field, including recommendations for future work.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 4 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Advancing Edge Classification through High-Dimensional Causal Modeling of Node-Edge Interplay

    cs.LG 2026-05 unverdicted novelty 7.0

    CECF is a new causal framework for edge classification that balances high-dimensional edge features against node influences via GNN embeddings and cross-attention to achieve better performance than standard methods.

  2. From Weight Perturbation to Feature Attribution for Explaining Fully Connected Neural Networks

    cs.LG 2026-05 unverdicted novelty 6.0

    XWP and XWP_c are novel attribution methods for FCNNs that estimate feature importance by perturbing attached weights to avoid added bias and out-of-distribution issues in occlusion approaches.

  3. Predictive and Prescriptive AI toward Optimizing Wildfire Suppression

    math.OC 2026-05 unverdicted novelty 6.0

    A new optimization algorithm with double machine learning for wildfire spread estimation enables better crew assignments that reduce total area burned.

  4. Tabular Foundation Model for Generative Modelling

    cs.LG 2026-05 unverdicted novelty 5.0

    TabFORGE generates high-quality synthetic tabular data by leveraging pretrained causality-aware representations in a two-stage diffusion-decoder architecture that mitigates latent distribution shifts.