Causal-Adapter: Taming Text-to-Image Diffusion for Faithful Counterfactual Generation

Chaochao Lu; Chen Jin; Dino Oglic; Lei Tong; Philip Teare; Sotirios A. Tsaftaris; Tom Diethe; Zhihua Liu

arxiv: 2509.24798 · v6 · pith:BCOTYJIRnew · submitted 2025-09-29 · 💻 cs.CV · cs.AI

Causal-Adapter: Taming Text-to-Image Diffusion for Faithful Counterfactual Generation

Lei Tong , Zhihua Liu , Chaochao Lu , Dino Oglic , Tom Diethe , Philip Teare , Sotirios A. Tsaftaris , Chen Jin This is my paper

Pith reviewed 2026-05-18 12:46 UTC · model grok-4.3

classification 💻 cs.CV cs.AI

keywords causal adaptercounterfactual generationtext-to-image diffusionattribute controlstructural causal modelingimage editingdiffusion models

0 comments

The pith

Causal-Adapter adapts frozen text-to-image diffusion models for faithful counterfactual generation by enforcing causal attribute relationships.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces Causal-Adapter as a modular framework that adapts frozen text-to-image diffusion backbones to produce counterfactual images. It supports interventions on target attributes that propagate correctly to related attributes while preserving the image's core identity. A sympathetic reader would care because this enables more reliable image editing for applications like medical simulation and fairness testing. The method achieves this through structural causal modeling using prompt-aligned injection to match attributes with text embeddings and a conditioned token contrastive loss to disentangle factors and reduce unwanted correlations. It reports major gains in control precision and generation quality on both synthetic and real datasets without retraining the underlying model.

Core claim

Causal-Adapter leverages structural causal modeling with prompt-aligned injection, which aligns causal attributes with textual embeddings for precise semantic control, and a conditioned token contrastive loss that disentangles attribute factors and reduces spurious correlations. Applied to a frozen diffusion backbone, this enables causal interventions on target attributes that propagate effects to dependents while preserving core image identity, yielding state-of-the-art performance including substantial reductions in error metrics on benchmark datasets.

What carries the argument

Prompt-aligned injection and conditioned token contrastive loss applied to a frozen text-to-image diffusion backbone, which together enforce causal structure during image generation.

Load-bearing premise

Causal relationships among image attributes can be sufficiently captured and enforced using only prompt-aligned injection and a conditioned token contrastive loss on a frozen diffusion backbone, without needing an explicit causal graph or additional supervised causal annotations.

What would settle it

A clear test would be whether intervening on one attribute in generated images reliably updates its causal dependent attributes as expected while keeping the subject's identity unchanged; failure on either would falsify the central claim.

Figures

Figures reproduced from arXiv: 2509.24798 by Chaochao Lu, Chen Jin, Dino Oglic, Lei Tong, Philip Teare, Sotirios A. Tsaftaris, Tom Diethe, Zhihua Liu.

**Figure 1.** Figure 1: Non-causal editing modifies only the target attribute (e.g. age, gender); causal editing propagates changes to related attributes (e.g. beard, baldness) enforced by the causal graph. Answering counterfactual questions (e.g. inferring what an event would have happened under an alternative action) requires understanding the cause–effect relationships among variables and performing hypothetical reasoning (P… view at source ↗

**Figure 2.** Figure 2: A sketch comparison of counterfactual image generation methods based on: (a) [PITH_FULL_IMAGE:figures/full_fig_p002_2.png] view at source ↗

**Figure 3.** Figure 3: Motivational study and preliminary counterfactual generation results between T2I methods and Causal-Adapter. (a) Fine-grained anatomical counterfactual editing of brain ventricular volume using inversion-based editing (NTI (Mokady et al., 2023)), multi-concept prompt-learning editing (MCPL (Jin et al., 2024)), and our approach. (b) Comparison of counterfactual editing results on human faces. (c) Averaged c… view at source ↗

**Figure 4.** Figure 4: Method overview. A counterfactual prompt and input image [PITH_FULL_IMAGE:figures/full_fig_p005_4.png] view at source ↗

**Figure 5.** Figure 5: Pendulum counterfactuals with traversal edit [PITH_FULL_IMAGE:figures/full_fig_p007_5.png] view at source ↗

**Figure 6.** Figure 6: CelebA counterfactuals from CausalAdapter compared with prior methods. Human Face Counterfactuals. Following the benchmarking of Melistas et al. (2024), we evaluate Causal-Adapter on CelebA test set for human face counterfactual generation across four categorical attributes (age, gender, beard, bald) with the causal graph shown in [PITH_FULL_IMAGE:figures/full_fig_p008_6.png] view at source ↗

**Figure 7.** Figure 7: ADNI brain MRI counterfactual results from Causal-Adapter. Direct causal effects are [PITH_FULL_IMAGE:figures/full_fig_p009_7.png] view at source ↗

**Figure 8.** Figure 8: Ablation study on CelebA validation set. (a) Average intervention effectiveness. (b) Realism and minimality. (c) Qualitative examples, with dotted boxes indicating results of localized editing. 4 CONCLUSION We introduced Causal-Adapter to tame Text-to-Image diffusion models for counterfactual image generation. Our motivational study revealed that current Text-toImage diffusion model based editing appro… view at source ↗

**Figure 9.** Figure 9: Null-Textual Inversion (NTI) relies heavily on prompt engineering, where minor word [PITH_FULL_IMAGE:figures/full_fig_p020_9.png] view at source ↗

**Figure 10.** Figure 10: Multi-Concept Prompt Learning (MCPL) as a representative prompt-learning baseline. [PITH_FULL_IMAGE:figures/full_fig_p021_10.png] view at source ↗

**Figure 11.** Figure 11: Fine-grained anatomical counterfactual editing of brain ventricular volume. NTI and [PITH_FULL_IMAGE:figures/full_fig_p022_11.png] view at source ↗

**Figure 12.** Figure 12: Impact of guidance scale on FID and CLD across three Causal-Adapter variants. Note that [PITH_FULL_IMAGE:figures/full_fig_p026_12.png] view at source ↗

**Figure 13.** Figure 13: Impact of DDIM steps on FID and CLD 27 [PITH_FULL_IMAGE:figures/full_fig_p027_13.png] view at source ↗

**Figure 14.** Figure 14: Counterfactuals from Causal-Adapter variants under different guidance scales. The plain [PITH_FULL_IMAGE:figures/full_fig_p029_14.png] view at source ↗

**Figure 15.** Figure 15: Full ablation visualizations with optional attention guidance (AG). Causal-Adapter with [PITH_FULL_IMAGE:figures/full_fig_p029_15.png] view at source ↗

**Figure 16.** Figure 16: Average cross-attention maps from Causal-Adapter variants. Tokens denote attributes: [PITH_FULL_IMAGE:figures/full_fig_p030_16.png] view at source ↗

**Figure 17.** Figure 17: Pendulum counterfactuals from Causal-Adapter. [PITH_FULL_IMAGE:figures/full_fig_p031_17.png] view at source ↗

**Figure 18.** Figure 18: Pendulum counterfactuals from Causal-Adapter. [PITH_FULL_IMAGE:figures/full_fig_p032_18.png] view at source ↗

**Figure 19.** Figure 19: Additional counterfactual results on the CelebA dataset (with edit samples selected in a non [PITH_FULL_IMAGE:figures/full_fig_p033_19.png] view at source ↗

**Figure 20.** Figure 20: Additional counterfactual results on the CelebA dataset (with edit samples selected in a [PITH_FULL_IMAGE:figures/full_fig_p034_20.png] view at source ↗

**Figure 21.** Figure 21: Additional counterfactual results from random interventions on each attribute in the [PITH_FULL_IMAGE:figures/full_fig_p035_21.png] view at source ↗

**Figure 22.** Figure 22: Additional counterfactual results from random interventions on each attribute in the ADNI [PITH_FULL_IMAGE:figures/full_fig_p036_22.png] view at source ↗

**Figure 23.** Figure 23: Average cross-attention maps from Causal-Adapter on CelebA dataset. Token denote [PITH_FULL_IMAGE:figures/full_fig_p037_23.png] view at source ↗

**Figure 24.** Figure 24: Average cross-attention maps from Causal-Adapter on ADNI dataset. Token denote [PITH_FULL_IMAGE:figures/full_fig_p037_24.png] view at source ↗

**Figure 25.** Figure 25: Average cross-attention maps from Causal-Adapter on Pendulum dataset. Token denote [PITH_FULL_IMAGE:figures/full_fig_p038_25.png] view at source ↗

**Figure 26.** Figure 26: Counterfactuals generated by Causal-Adapter on CelebA under beard interventions. [PITH_FULL_IMAGE:figures/full_fig_p039_26.png] view at source ↗

read the original abstract

We present Causal-Adapter, a modular framework that adapts frozen text-to-image diffusion backbones for counterfactual image generation. Our method supports causal interventions on target attributes and consistently propagates their effects to causal dependents while preserving the core identity of the image. Unlike prior approaches that rely on prompt engineering without explicit causal structure, Causal-Adapter leverages structural causal modeling with two attribute-regularization strategies: (i) prompt-aligned injection, which aligns causal attributes with textual embeddings for precise semantic control, and (ii) a conditioned token contrastive loss that disentangles attribute factors and reduces spurious correlations. Causal-Adapter achieves state-of-the-art performance on both synthetic and real-world datasets, including up to a 91% reduction in MAE on Pendulum for accurate attribute control and up to an 87% reduction in FID on ADNI for high-fidelity MRI generation. These results demonstrate robust, generalizable counterfactual editing with faithful attribute modification and strong identity preservation. Code and models will be released at: https://leitong02.github.io/causaladapter/.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

Causal-Adapter adds prompt-aligned injection and a conditioned token contrastive loss to frozen diffusion backbones for stronger attribute control in counterfactuals, but the causal interpretation rests on indirect regularization rather than an explicit graph.

read the letter

The punchline is that this paper gives a modular way to adapt frozen diffusion models for counterfactual generation by adding prompt-aligned injection and a conditioned token contrastive loss, leading to reported big improvements in attribute accuracy and image fidelity. What is new is the way they tie structural causal modeling to these two specific strategies for intervening on target attributes while preserving identity. The prompt-aligned injection helps with semantic control through text embeddings, and the contrastive loss aims to reduce spurious correlations between factors. This is presented as an advance over simple prompt engineering. The paper does well on the experimental side by testing on both a synthetic dataset like Pendulum, where they cut MAE by up to 91%, and a real medical dataset ADNI for MRI, with FID reductions up to 87%. These numbers suggest the method works for precise control and high-quality outputs. Releasing the code will make it easier to verify and extend. The main soft spot is around the causal aspect. Without an explicit causal graph or formalization of interventions, the regularizers might mainly achieve better disentanglement of attributes rather than enforcing directed causal effects. The contrastive loss can discourage correlations seen in training data, but it does not necessarily ensure that changing one attribute affects only its causal descendants as required by a structural causal model. This is especially relevant for the real-world ADNI results, where improved scores could reflect stronger attribute binding instead of true counterfactual faithfulness. Overall, this paper targets researchers in generative modeling and medical image synthesis who want practical tools for causal editing. A reader working on controllable diffusion models would get concrete ideas from the regularization approaches and the results. The work shows clear thinking on the problem and has reproducible elements planned, so it deserves peer review to sort out the details on the causal claims.

Referee Report

2 major / 2 minor

Summary. The manuscript introduces Causal-Adapter, a modular adapter for frozen text-to-image diffusion backbones that enables counterfactual image generation via causal interventions on target attributes. It uses prompt-aligned injection to align attributes with textual embeddings and a conditioned token contrastive loss to disentangle factors and reduce spurious correlations, claiming to propagate effects to causal dependents while preserving image identity. The work reports state-of-the-art results including up to 91% MAE reduction on the synthetic Pendulum dataset and 87% FID reduction on the real-world ADNI MRI dataset.

Significance. If the causal claims hold, the modular design on frozen backbones represents a practical advance for counterfactual generation in diffusion models, avoiding full retraining while incorporating causal concepts. The quantitative gains on both synthetic and real data, combined with the promise of code release, would support reproducibility and broader adoption in vision applications requiring faithful attribute control.

major comments (2)

[Introduction and Methods] Introduction and Methods: The manuscript repeatedly invokes structural causal modeling (SCM) terminology such as 'causal interventions,' 'causal dependents,' and 'faithful counterfactual generation,' yet provides no explicit causal graph, no formalization of the do-operator, and no supervised causal annotations. This is load-bearing for the central claim that the two regularization strategies implement directed causal propagation rather than generic attribute disentanglement or correlation reduction.
[Experiments (§4)] Experiments (§4): The reported 91% MAE reduction on Pendulum and 87% FID reduction on ADNI are presented as evidence of causal fidelity, but without ablations that isolate the contribution of the causal components (e.g., removing the contrastive loss or prompt alignment while retaining standard conditioning) or statistical significance tests against strong non-causal baselines, it remains unclear whether gains stem from causal enforcement or improved prompt binding.

minor comments (2)

[Abstract] The abstract states that code and models will be released at the provided URL; the manuscript should clarify the exact release timeline and include a permanent archival link.
[Methods] Notation for the conditioned token contrastive loss would benefit from an explicit equation in the main text rather than relying solely on prose description.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive feedback. We address each major comment below and outline the revisions we will make to strengthen the manuscript's presentation of its causal modeling aspects and experimental validation.

read point-by-point responses

Referee: [Introduction and Methods] Introduction and Methods: The manuscript repeatedly invokes structural causal modeling (SCM) terminology such as 'causal interventions,' 'causal dependents,' and 'faithful counterfactual generation,' yet provides no explicit causal graph, no formalization of the do-operator, and no supervised causal annotations. This is load-bearing for the central claim that the two regularization strategies implement directed causal propagation rather than generic attribute disentanglement or correlation reduction.

Authors: We acknowledge that an explicit causal graph and formalization using the do-operator would better ground the SCM terminology and clarify how the regularization strategies enforce directed propagation. In the revised manuscript we will add a new subsection in Methods that (i) presents a causal graph for the attribute dependencies in both the Pendulum and ADNI datasets, (ii) formalizes the target-attribute intervention via the do-operator, and (iii) explains how prompt-aligned injection and the conditioned token contrastive loss together implement the required causal propagation without needing supervised causal annotations. This addition will distinguish the approach from generic disentanglement while preserving the modular, annotation-free nature of the method. revision: yes
Referee: [Experiments (§4)] Experiments (§4): The reported 91% MAE reduction on Pendulum and 87% FID reduction on ADNI are presented as evidence of causal fidelity, but without ablations that isolate the contribution of the causal components (e.g., removing the contrastive loss or prompt alignment while retaining standard conditioning) or statistical significance tests against strong non-causal baselines, it remains unclear whether gains stem from causal enforcement or improved prompt binding.

Authors: We agree that isolating the causal components and providing statistical tests would strengthen the causal claims. In the revised version we will add ablation experiments that disable prompt-aligned injection and the conditioned token contrastive loss individually (while retaining the base conditioning and adapter architecture) and report the resulting MAE and FID degradations. We will also include statistical significance tests (paired t-tests across multiple random seeds) comparing the full Causal-Adapter against strong non-causal baselines such as standard LoRA conditioning and prompt-only editing. These results will be presented in an expanded §4 and the supplementary material. revision: yes

Circularity Check

0 steps flagged

No significant circularity in derivation chain

full rationale

The paper introduces Causal-Adapter as a modular adaptation of frozen text-to-image diffusion backbones using prompt-aligned injection and a conditioned token contrastive loss to support causal interventions on attributes. Performance is evaluated empirically via MAE reductions on Pendulum and FID on ADNI, with no equations or claims in the abstract reducing these metrics or the central counterfactual claims to quantities defined by the method's own fitted parameters or self-referential definitions. The invocation of structural causal modeling terminology is presented as an application of existing concepts rather than a self-citation load-bearing step or ansatz that collapses to the inputs. The derivation remains self-contained with independent empirical content.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The central claim rests on the assumption that causal structure can be approximated via text embeddings and contrastive losses without explicit graphs; no free parameters or invented entities are described in the abstract.

axioms (1)

domain assumption Causal relationships among visual attributes can be effectively modeled and enforced through alignment with textual embeddings and token-level contrastive objectives in a frozen diffusion model.
This premise is invoked to justify the two attribute-regularization strategies and the claim of faithful propagation of interventions.

pith-pipeline@v0.9.0 · 5738 in / 1334 out tokens · 43256 ms · 2026-05-18T12:46:27.823039+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

L = L_DM + λ L_CTC ... InfoNCE ... prompt-aligned injection (PAI) ... Conditioned Token Contrast (CTC)

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.