TIBET: Identifying and Evaluating Biases in Text-to-Image Generative Models

Aditya Chinchure; Gaurav Bhatt; Kartik Hosanagar; Kiri Salij; Leonid Sigal; Matthew Turk; Pushkar Shukla

arxiv: 2312.01261 · v2 · pith:D6GILRIGnew · submitted 2023-12-03 · 💻 cs.CV · cs.CY

TIBET: Identifying and Evaluating Biases in Text-to-Image Generative Models

Aditya Chinchure , Pushkar Shukla , Gaurav Bhatt , Kiri Salij , Kartik Hosanagar , Leonid Sigal , Matthew Turk This is my paper

classification 💻 cs.CV cs.CY

keywords biasesmodelspromptabilityapproachcomplexconceptsgenerate

0 comments

read the original abstract

Text-to-Image (TTI) generative models have shown great progress in the past few years in terms of their ability to generate complex and high-quality imagery. At the same time, these models have been shown to suffer from harmful biases, including exaggerated societal biases (e.g., gender, ethnicity), as well as incidental correlations that limit such a model's ability to generate more diverse imagery. In this paper, we propose a general approach to study and quantify a broad spectrum of biases, for any TTI model and for any prompt, using counterfactual reasoning. Unlike other works that evaluate generated images on a predefined set of bias axes, our approach automatically identifies potential biases that might be relevant to the given prompt, and measures those biases. In addition, we complement quantitative scores with post-hoc explanations in terms of semantic concepts in the images generated. We show that our method is uniquely capable of explaining complex multi-dimensional biases through semantic concepts, as well as the intersectionality between different biases for any given prompt. We perform extensive user studies to illustrate that the results of our method and analysis are consistent with human judgements.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 3 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Context Matters: Auditing Gender Bias in T2I Generation through Risk-Tiered Use-Case Profiles
cs.CY 2026-05 unverdicted novelty 7.0

A new framework called THUMB cards organizes gender bias metrics for T2I models by risk-tiered use cases, measurement categories, and harm typologies aligned with the EU AI Act.
BiasIG: Benchmarking Multi-dimensional Social Biases in Text-to-Image Models
cs.CY 2026-04 conditional novelty 6.0

BiasIG is a multi-dimensional benchmark for social biases in T2I models that shows debiasing interventions frequently cause confounding discrimination effects.
Who Defines Fairness? Target-Based Prompting for Demographic Representation in Generative Models
cs.AI 2026-04 unverdicted novelty 5.0

Target-based prompting lets users define fairness distributions for skin tones in generative AI, shifting outputs closer to chosen targets across 36 tested prompts for occupations and contexts.