LLMs struggle to infer pragmatic meaning from non-verbal responses alone, showing accuracy drops of up to 60 percentage points versus verbal responses, though in-context learning improves results.
M u T ual: A Dataset for Multi-Turn Dialogue Reasoning
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CL 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Unveiling the Limits of Large Language Models in Inferring Pragmatic Meaning from Non-Verbal Responses
LLMs struggle to infer pragmatic meaning from non-verbal responses alone, showing accuracy drops of up to 60 percentage points versus verbal responses, though in-context learning improves results.