Prompting strategies for enabling large language models to infer causation from correlation

· 2024 · arXiv 2412.13952

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

representative citing papers

Why LLMs Fail at Causal Discovery and How Interventional Agents Escape

cs.AI · 2026-05-26 · unverdicted · novelty 7.0

LLMs fail causal discovery due to a kernel obstruction in observational learning, but interventional agents using frozen LLMs in Bayesian loops succeed without training on causal graph benchmarks.

CausalGuard: Conformal Inference under Graph Uncertainty

cs.LG · 2026-05-21 · unverdicted · novelty 6.0

CausalGuard aggregates LLM-proposed and data-pruned DAGs to weight doubly robust pseudo-outcomes and applies conformal calibration to deliver finite-sample marginal coverage for conditional average treatment effects under graph uncertainty.

When Do We Need LLMs? A Diagnostic for Language-Driven Bandits

cs.AI · 2026-04-07 · unverdicted · novelty 6.0

Lightweight numerical bandits on text embeddings match or exceed LLM accuracy in contextual bandits at a fraction of the cost, with an embedding-based diagnostic to choose between them.

citing papers explorer

Showing 3 of 3 citing papers after filters.

Why LLMs Fail at Causal Discovery and How Interventional Agents Escape cs.AI · 2026-05-26 · unverdicted · none · ref 18
LLMs fail causal discovery due to a kernel obstruction in observational learning, but interventional agents using frozen LLMs in Bayesian loops succeed without training on causal graph benchmarks.
CausalGuard: Conformal Inference under Graph Uncertainty cs.LG · 2026-05-21 · unverdicted · none · ref 21
CausalGuard aggregates LLM-proposed and data-pruned DAGs to weight doubly robust pseudo-outcomes and applies conformal calibration to deliver finite-sample marginal coverage for conditional average treatment effects under graph uncertainty.
When Do We Need LLMs? A Diagnostic for Language-Driven Bandits cs.AI · 2026-04-07 · unverdicted · none · ref 42
Lightweight numerical bandits on text embeddings match or exceed LLM accuracy in contextual bandits at a fraction of the cost, with an embedding-based diagnostic to choose between them.

Prompting strategies for enabling large language models to infer causation from correlation

fields

years

verdicts

representative citing papers

citing papers explorer