M ulti P rag E val: Multilingual Pragmatic Evaluation of Large Language Models

Park, Dojun, Lee, Jiwoo, Park, Seohyun, Jeong, Hyeyun, Koo, Youngeun, Hwang, Soonha · 2024 · DOI 10.18653/v1/2024.genbench-1.7

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

open at publisher browse 2 citing papers

representative citing papers

Unveiling the Limits of Large Language Models in Inferring Pragmatic Meaning from Non-Verbal Responses

cs.CL · 2026-06-01 · unverdicted · novelty 6.0

LLMs struggle to infer pragmatic meaning from non-verbal responses alone, showing accuracy drops of up to 60 percentage points versus verbal responses, though in-context learning improves results.

How Hypocritical Is Your LLM judge? Listener-Speaker Asymmetries in the Pragmatic Competence of Large Language Models

cs.CL · 2026-04-17 · unverdicted · novelty 6.0

LLMs perform substantially better as pragmatic listeners judging language than as speakers generating it, revealing weak alignment between the two roles.

citing papers explorer

Showing 2 of 2 citing papers after filters.

Unveiling the Limits of Large Language Models in Inferring Pragmatic Meaning from Non-Verbal Responses cs.CL · 2026-06-01 · unverdicted · none · ref 7
LLMs struggle to infer pragmatic meaning from non-verbal responses alone, showing accuracy drops of up to 60 percentage points versus verbal responses, though in-context learning improves results.
How Hypocritical Is Your LLM judge? Listener-Speaker Asymmetries in the Pragmatic Competence of Large Language Models cs.CL · 2026-04-17 · unverdicted · none · ref 31
LLMs perform substantially better as pragmatic listeners judging language than as speakers generating it, revealing weak alignment between the two roles.

M ulti P rag E val: Multilingual Pragmatic Evaluation of Large Language Models

fields

years

verdicts

representative citing papers

citing papers explorer