It is asemi-syntheticbenchmark widely used in the causal-inference literature; we use it to check that CIVeX’s safety property survives outside the bespoke Causal-ToolBench SCM

provides realistic covariates from the Infant Health, Development Program (a US RCT) paired with a simulated potential-outcome surface generated under known confounding bias (Dorie et al · 2019

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

CIVeX: Causal Intervention Verification for Language Agents

cs.AI · 2026-05-09 · unverdicted · novelty 6.0

CIVeX maps agent tool calls to structural causal queries, checks identifiability, and issues auditable verdicts to prevent false executions while preserving utility on confounded benchmarks.

citing papers explorer

Showing 1 of 1 citing paper.

CIVeX: Causal Intervention Verification for Language Agents cs.AI · 2026-05-09 · unverdicted · none · ref 15
CIVeX maps agent tool calls to structural causal queries, checks identifiability, and issues auditable verdicts to prevent false executions while preserving utility on confounded benchmarks.

It is asemi-syntheticbenchmark widely used in the causal-inference literature; we use it to check that CIVeX’s safety property survives outside the bespoke Causal-ToolBench SCM

fields

years

verdicts

representative citing papers

citing papers explorer