Conditional Vendi Score: Prompt-Aware Diversity Evaluation for Generative AI Models and LLMs

Amin Gohari; Azim Ospanov; Farzan Farnia; Mohammad Jalali

arxiv: 2411.02817 · v2 · pith:GT6S4PKKnew · submitted 2024-11-05 · 💻 cs.LG · cs.AI· cs.CV· cs.IT· math.IT

Conditional Vendi Score: Prompt-Aware Diversity Evaluation for Generative AI Models and LLMs

Mohammad Jalali , Azim Ospanov , Amin Gohari , Farzan Farnia This is my paper

classification 💻 cs.LG cs.AIcs.CVcs.ITmath.IT

keywords diversitymodelsconditionalconditional-vendiconditional-rkegenerativematricesmodel-induced

0 comments

read the original abstract

Generative models guided by text prompts are widely evaluated for fidelity and prompt alignment, yet their ability to produce outputs remains underexplored. Existing diversity metrics such as Vendi and RKE, which are based on the von Neumann and R\'enyi entropies of kernel matrices, were developed for unconditional models and cannot distinguish prompt-induced from model-induced variability. We address this gap by introducing \textit{Conditional-Vendi} and \textit{Conditional-RKE}, diversity measures derived from the conditional entropy of positive semidefinite matrices. These scores isolate model-induced diversity in prompt-guided generation, with Conditional-RKE enjoying an $O(1/\sqrt{n})$ convergence rate. For Conditional-Vendi, we introduce a truncated-spectrum approximation that yields scalable and consistent estimates. Experiments on text-to-image, image-captioning, and LLM tasks show that the conditional scores recover ground-truth diversity orderings and can also guide diffusion models toward more diverse samples. The codebase is available at https://github.com/mjalali/conditional-vendi.

This paper has not been read by Pith yet.

Conditional Vendi Score: Prompt-Aware Diversity Evaluation for Generative AI Models and LLMs

discussion (0)