Causal Inference with Generative Artificial Intelligence: Application to Texts as Treatments

Kentaro Nakamura; Kosuke Imai

arxiv: 2410.00903 · v5 · pith:RCF7B4TRnew · submitted 2024-10-01 · 📊 stat.AP · cs.CL· cs.LG

Causal Inference with Generative Artificial Intelligence: Application to Texts as Treatments

Kosuke Imai , Kentaro Nakamura This is my paper

classification 📊 stat.AP cs.CLcs.LG

keywords causalrepresentationgenerativeinferenceproposedtextstreatmenttreatments

0 comments

read the original abstract

In this paper, we demonstrate how to enhance the validity of causal inference with unstructured high-dimensional treatments like texts, by leveraging the power of generative Artificial Intelligence (GenAI). Specifically, we propose to use a deep generative model such as large language models (LLMs) to efficiently generate treatments and use their internal representation for subsequent causal effect estimation. We show that the knowledge of this true internal representation helps disentangle the treatment features of interest, such as specific sentiments and certain topics, from other possibly unknown confounding features. Unlike existing methods, the proposed GenAI-Powered Inference (GPI) methodology eliminates the need to learn causal representation from the data, and hence produces more accurate and efficient estimates. We formally establish the conditions required for the nonparametric identification of the average treatment effect, propose an estimation strategy that avoids the violation of the overlap assumption, and derive the asymptotic properties of the proposed estimator through the application of double machine learning. Finally, using an instrumental variables approach, we extend the proposed GPI methodology to the settings in which the treatment feature is based on human perception. The GPI is also applicable to text reuse where an LLM is used to regenerate existing texts. We conduct simulation and empirical studies, using the generated text data from an open-source LLM, Llama 3, to illustrate the advantages of our estimator over state-of-the-art causal representation learning algorithms.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Triage Score: A Counterfactual Risk Assessment Instrument
stat.AP 2026-06 unverdicted novelty 7.0

Triage scores extend risk scores via additive counterfactual utilities to incorporate intervention effects in high-stakes decisions.
When Planning Fails Despite Correct Execution: On Epistemic Calibration for LLM-Based Multi-Agent Systems
cs.AI 2026-05 unverdicted novelty 6.0

Introduces EPC-AW to mitigate epistemic miscalibration in LLM multi-agent planning via consistency-based selection and refinement, reporting 9.75% average success improvement.