pith. sign in

arxiv: 2504.14450 · v2 · pith:QPTEMACVnew · submitted 2025-04-20 · 💻 cs.CV

Causal Disentanglement for Robust Long-tail Medical Image Generation

classification 💻 cs.CV
keywords medicalfeaturesimagespathologicaldatagenerationmodelcausal
0
0 comments X
read the original abstract

Counterfactual medical image generation effectively addresses data scarcity and enhances the interpretability of medical images. However, due to the complex and diverse pathological features of medical images and the imbalanced class distribution in medical data, generating high-quality and diverse medical images from limited data is significantly challenging. Additionally, to fully leverage the information in limited data, such as anatomical structure information and generate more structurally stable medical images while avoiding distortion or inconsistency. In this paper, in order to enhance the clinical relevance of generated data and improve the interpretability of the model, we propose a novel medical image generation framework, which generates independent pathological and structural features based on causal disentanglement and utilizes text-guided modeling of pathological features to regulate the generation of counterfactual images. First, we achieve feature separation through causal disentanglement and analyze the interactions between features. Here, we introduce group supervision to ensure the independence of pathological and identity features. Second, we leverage a diffusion model guided by pathological findings to model pathological features, enabling the generation of diverse counterfactual images. Meanwhile, we enhance accuracy by leveraging a large language model to extract lesion severity and location from medical reports. Additionally, we improve the performance of the latent diffusion model on long-tailed categories through initial noise optimization.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Flow Matching with Optimized Subclass Priors for Medical Image Augmentation

    eess.IV 2026-05 unverdicted novelty 6.0

    Optimizes subclass priors in flow matching via latent GMM partitioning and conditioned sources to improve rare disease image generation fidelity, diversity, and downstream classification on long-tailed medical datasets.

  2. GRASP: Guided Residual Adapters with Sample-wise Partitioning

    cs.CV 2025-12 unverdicted novelty 5.0

    GRASP applies deterministic conditioning-space partitioning and sample-wise residual adapters to improve tail-class fidelity, diversity, and downstream utility in flow matching models, outperforming full fine-tuning a...