CREMA -D: Crowd -sourced emotional multimodal actors dataset,

· 2014 · arXiv 2014.233624

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

representative citing papers

Unmasking LAION-5B: Age, Gender, Race, and Emotion Biases in Large-Scale Image Datasets

cs.CV · 2026-06-22 · unverdicted · novelty 6.0

Empirical audit of LAION-2B-en and LAION-2B-multi finds overrepresentation of young adults, White people, and males plus stereotypical emotion associations across two attribute classifiers.

Before Fusion, Ask What to Keep: Contextual Calibration of Multimodal Signals

cs.LG · 2026-06-01 · unverdicted · novelty 6.0

A pre-fusion calibration module modulates multimodal features using cross-modality support and conflict cues to improve performance on five benchmarks including sentiment analysis and audio-visual tasks.

Beyond the Mouth: Upper-Face Affective Cues in Audiovisual Sentence Recognition under Acoustic Uncertainty

cs.SD · 2026-05-30 · unverdicted · novelty 5.0

Upper-face affective features improve model calibration in noisy audiovisual sentence recognition but add only small accuracy gains compared to mouth features.

Evaluation of Conversational Agents: Understanding Culture, Context and Environment in Emotion Detection

cs.CV · 2026-05-28 · unverdicted · novelty 2.0

An emotion prediction model using 3-layer CNN plus AFME algorithm on speech and image data detects seven basic emotions and sarcasm at 85-96% accuracy, addressing cultural challenges in Black African conversational AI.

citing papers explorer

Showing 4 of 4 citing papers after filters.

Unmasking LAION-5B: Age, Gender, Race, and Emotion Biases in Large-Scale Image Datasets cs.CV · 2026-06-22 · unverdicted · none · ref 58
Empirical audit of LAION-2B-en and LAION-2B-multi finds overrepresentation of young adults, White people, and males plus stereotypical emotion associations across two attribute classifiers.
Before Fusion, Ask What to Keep: Contextual Calibration of Multimodal Signals cs.LG · 2026-06-01 · unverdicted · none · ref 1
A pre-fusion calibration module modulates multimodal features using cross-modality support and conflict cues to improve performance on five benchmarks including sentiment analysis and audio-visual tasks.
Beyond the Mouth: Upper-Face Affective Cues in Audiovisual Sentence Recognition under Acoustic Uncertainty cs.SD · 2026-05-30 · unverdicted · none · ref 5
Upper-face affective features improve model calibration in noisy audiovisual sentence recognition but add only small accuracy gains compared to mouth features.
Evaluation of Conversational Agents: Understanding Culture, Context and Environment in Emotion Detection cs.CV · 2026-05-28 · unverdicted · none · ref 26
An emotion prediction model using 3-layer CNN plus AFME algorithm on speech and image data detects seven basic emotions and sarcasm at 85-96% accuracy, addressing cultural challenges in Black African conversational AI.

CREMA -D: Crowd -sourced emotional multimodal actors dataset,

fields

years

verdicts

representative citing papers

citing papers explorer