VLMs mistake head orientation for gaze direction in controlled real-world photos, creating a large performance gap with humans that stems from data biases.
R., Brooks, R., and Meltzoff, A
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CV 1years
2025 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Vision-Language Models Mistake Head Orientation for Gaze Direction: Nonverbal Conversation Cues
VLMs mistake head orientation for gaze direction in controlled real-world photos, creating a large performance gap with humans that stems from data biases.