A fine-grained comparison of pragmatic language understanding in humans and language models

Hu, Jennifer, Floyd, Sammy, Jouravlev, Olessia, Fedorenko, Evelina, Gibson, Edward · 2023 · DOI 10.18653/v1/2023.acl-long.230

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

open at publisher browse 4 citing papers

representative citing papers

LLMs Infer Cultural Context but Fail to Apply It When Responding

cs.CL · 2026-06-16 · unverdicted · novelty 7.0

LLMs infer cultural context from cues but fail to apply it for adapted responses unless prompted sequentially, shown via the CAPRI dataset on units, time, and quantity expressions.

Do LLMs Use Cultural Knowledge Without Being Told? A Multilingual Evaluation of Implicit Pragmatic Adaptation

cs.CL · 2026-04-20 · conditional · novelty 7.0

LLMs recover only ~20% of explicit pragmatic shifts under implicit cultural cues across five languages, responding mainly to linguistic structure rather than cultural associations as shown by Hindi-Urdu controls.

Unveiling the Limits of Large Language Models in Inferring Pragmatic Meaning from Non-Verbal Responses

cs.CL · 2026-06-01 · unverdicted · novelty 6.0

LLMs struggle to infer pragmatic meaning from non-verbal responses alone, showing accuracy drops of up to 60 percentage points versus verbal responses, though in-context learning improves results.

How Hypocritical Is Your LLM judge? Listener-Speaker Asymmetries in the Pragmatic Competence of Large Language Models

cs.CL · 2026-04-17 · unverdicted · novelty 6.0

LLMs perform substantially better as pragmatic listeners judging language than as speakers generating it, revealing weak alignment between the two roles.

citing papers explorer

Showing 3 of 3 citing papers after filters.

LLMs Infer Cultural Context but Fail to Apply It When Responding cs.CL · 2026-06-16 · unverdicted · none · ref 20
LLMs infer cultural context from cues but fail to apply it for adapted responses unless prompted sequentially, shown via the CAPRI dataset on units, time, and quantity expressions.
Unveiling the Limits of Large Language Models in Inferring Pragmatic Meaning from Non-Verbal Responses cs.CL · 2026-06-01 · unverdicted · none · ref 4
LLMs struggle to infer pragmatic meaning from non-verbal responses alone, showing accuracy drops of up to 60 percentage points versus verbal responses, though in-context learning improves results.
How Hypocritical Is Your LLM judge? Listener-Speaker Asymmetries in the Pragmatic Competence of Large Language Models cs.CL · 2026-04-17 · unverdicted · none · ref 16
LLMs perform substantially better as pragmatic listeners judging language than as speakers generating it, revealing weak alignment between the two roles.

A fine-grained comparison of pragmatic language understanding in humans and language models

fields

years

verdicts

representative citing papers

citing papers explorer