Controlled study finds CLIP-based body-scene fusion model for emotion recognition on EMOTIC is not improved by context debiasing or rare-class training, with best mAP of 34.52%.
In: IEEE/RSJ International Conference on Intelligent Robots and Systems (2024)
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CV 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
A Controlled Study of CLIP-Based Body-Scene Fusion for Emotion Recognition in Context
Controlled study finds CLIP-based body-scene fusion model for emotion recognition on EMOTIC is not improved by context debiasing or rare-class training, with best mAP of 34.52%.