LLMs perform substantially better as pragmatic listeners judging language than as speakers generating it, revealing weak alignment between the two roles.
Title resolution pending
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
fields
cs.CL 2years
2026 2verdicts
UNVERDICTED 2representative citing papers
Continued pre-training on web data and LLM-ensemble synthetic labels improve multilingual hate speech detection, with gains up to 11% for small models in low-resource settings.
citing papers explorer
-
How Hypocritical Is Your LLM judge? Listener-Speaker Asymmetries in the Pragmatic Competence of Large Language Models
LLMs perform substantially better as pragmatic listeners judging language than as speakers generating it, revealing weak alignment between the two roles.
-
Toward Generalized Cross-Lingual Hateful Language Detection with Web-Scale Data and Ensemble LLM Annotations
Continued pre-training on web data and LLM-ensemble synthetic labels improve multilingual hate speech detection, with gains up to 11% for small models in low-resource settings.