Causal Disentanglement for Robust Long-tail Medical Image Generation

Anan Liu; Bruno Lepri; Nicu Sebe; Weijie Wang; Weizhi Nie; Zichun Zhang

arxiv: 2504.14450 · v2 · pith:QPTEMACVnew · submitted 2025-04-20 · 💻 cs.CV

Causal Disentanglement for Robust Long-tail Medical Image Generation

Weizhi Nie , Zichun Zhang , Weijie Wang , Bruno Lepri , Anan Liu , Nicu Sebe This is my paper

classification 💻 cs.CV

keywords medicalfeaturesimagespathologicaldatagenerationmodelcausal

0 comments

read the original abstract

Counterfactual medical image generation effectively addresses data scarcity and enhances the interpretability of medical images. However, due to the complex and diverse pathological features of medical images and the imbalanced class distribution in medical data, generating high-quality and diverse medical images from limited data is significantly challenging. Additionally, to fully leverage the information in limited data, such as anatomical structure information and generate more structurally stable medical images while avoiding distortion or inconsistency. In this paper, in order to enhance the clinical relevance of generated data and improve the interpretability of the model, we propose a novel medical image generation framework, which generates independent pathological and structural features based on causal disentanglement and utilizes text-guided modeling of pathological features to regulate the generation of counterfactual images. First, we achieve feature separation through causal disentanglement and analyze the interactions between features. Here, we introduce group supervision to ensure the independence of pathological and identity features. Second, we leverage a diffusion model guided by pathological findings to model pathological features, enabling the generation of diverse counterfactual images. Meanwhile, we enhance accuracy by leveraging a large language model to extract lesion severity and location from medical reports. Additionally, we improve the performance of the latent diffusion model on long-tailed categories through initial noise optimization.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Flow Matching with Optimized Subclass Priors for Medical Image Augmentation
eess.IV 2026-05 unverdicted novelty 6.0

Optimizes subclass priors in flow matching via latent GMM partitioning and conditioned sources to improve rare disease image generation fidelity, diversity, and downstream classification on long-tailed medical datasets.
GRASP: Guided Residual Adapters with Sample-wise Partitioning
cs.CV 2025-12 unverdicted novelty 5.0

GRASP applies deterministic conditioning-space partitioning and sample-wise residual adapters to improve tail-class fidelity, diversity, and downstream utility in flow matching models, outperforming full fine-tuning a...