Debiasing Text-to-Image Diffusion Models

Chuhui Xue; Haoru Tan; Ruifei He; Song Bai; Wenqing Zhang; Xiaojuan Qi; Yingchen Yu

arxiv: 2402.14577 · v1 · pith:2E7RBUJAnew · submitted 2024-02-22 · 💻 cs.CV

Debiasing Text-to-Image Diffusion Models

Ruifei He , Chuhui Xue , Haoru Tan , Wenqing Zhang , Yingchen Yu , Song Bai , Xiaojuan Qi This is my paper

classification 💻 cs.CV

keywords diffusionbiasmodelsproblemsocialconvergenceresolvingtext-to-image

0 comments

read the original abstract

Learning-based Text-to-Image (TTI) models like Stable Diffusion have revolutionized the way visual content is generated in various domains. However, recent research has shown that nonnegligible social bias exists in current state-of-the-art TTI systems, which raises important concerns. In this work, we target resolving the social bias in TTI diffusion models. We begin by formalizing the problem setting and use the text descriptions of bias groups to establish an unsafe direction for guiding the diffusion process. Next, we simplify the problem into a weight optimization problem and attempt a Reinforcement solver, Policy Gradient, which shows sub-optimal performance with slow convergence. Further, to overcome limitations, we propose an iterative distribution alignment (IDA) method. Despite its simplicity, we show that IDA shows efficiency and fast convergence in resolving the social bias in TTI diffusion models. Our code will be released.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Attention, May I Have Your Decision? Localizing Generative Choices in Diffusion Models
cs.CV 2026-03 unverdicted novelty 6.0

Implicit generative choices in diffusion models for ambiguous prompts are localized principally in self-attention layers, enabling a targeted ICM steering method that outperforms prior debiasing approaches.