Multilingual Conceptual Coverage in Text-to-Image Models

Michael Saxon; William Yang Wang

arxiv: 2306.01735 · v1 · pith:CF27UGRBnew · submitted 2023-06-02 · 💻 cs.CL · cs.AI· cs.CV· eess.IV

Multilingual Conceptual Coverage in Text-to-Image Models

Michael Saxon , William Yang Wang This is my paper

classification 💻 cs.CL cs.AIcs.CVeess.IV

keywords languageconceptualcoveragetargetgeneratedimagesmodelmodels

0 comments

read the original abstract

We propose "Conceptual Coverage Across Languages" (CoCo-CroLa), a technique for benchmarking the degree to which any generative text-to-image system provides multilingual parity to its training language in terms of tangible nouns. For each model we can assess "conceptual coverage" of a given target language relative to a source language by comparing the population of images generated for a series of tangible nouns in the source language to the population of images generated for each noun under translation in the target language. This technique allows us to estimate how well-suited a model is to a target language as well as identify model-specific weaknesses, spurious correlations, and biases without a-priori assumptions. We demonstrate how it can be used to benchmark T2I models in terms of multilinguality, and how despite its simplicity it is a good proxy for impressive generalization.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

BAFIS: Dataset + Framework to assess occupational Bias and Human Preference in modern Text-to-image Models
cs.CV 2026-06 unverdicted novelty 6.0

BAFIS supplies a new dataset and human-feedback framework demonstrating systematic gender and ethnicity biases in occupational image generation by five text-to-image models, with partial alignment between automated me...