ArtELingo: A Million Emotion Annotations of WikiArt with Emphasis on Diversity over Language and Culture

Feifan Li; Kenneth Ward Church; Mohamed Abdelfattah; Mohamed Elhoseiny; Shyma Alhuwaider; Xiangliang Zhang; Youssef Mohamed

arxiv: 2211.10780 · v1 · pith:3ILIOK4Xnew · submitted 2022-11-19 · 💻 cs.CL · cs.AI· cs.CY· cs.LG

ArtELingo: A Million Emotion Annotations of WikiArt with Emphasis on Diversity over Language and Culture

Youssef Mohamed , Mohamed Abdelfattah , Shyma Alhuwaider , Feifan Li , Xiangliang Zhang , Kenneth Ward Church , Mohamed Elhoseiny This is my paper

classification 💻 cs.CL cs.AIcs.CYcs.LG

keywords artelingodiversityannotationslanguagesacrossartworksbaselinecultures

0 comments

read the original abstract

This paper introduces ArtELingo, a new benchmark and dataset, designed to encourage work on diversity across languages and cultures. Following ArtEmis, a collection of 80k artworks from WikiArt with 0.45M emotion labels and English-only captions, ArtELingo adds another 0.79M annotations in Arabic and Chinese, plus 4.8K in Spanish to evaluate "cultural-transfer" performance. More than 51K artworks have 5 annotations or more in 3 languages. This diversity makes it possible to study similarities and differences across languages and cultures. Further, we investigate captioning tasks, and find diversity improves the performance of baseline models. ArtELingo is publicly available at https://www.artelingo.org/ with standard splits and baseline models. We hope our work will help ease future research on multilinguality and culturally-aware AI.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Beyond Semantics: Modeling Factual and Affective Perceptual Experiences from Vision-Language Data
cs.CV 2026-06 unverdicted novelty 5.0

PercepT discovers perceptual topic clusters from vision-language data via unsupervised training and maps images to them with attention pooling, reporting silhouette 0.97 and AUC 0.94 on ArtELingo.