DIA-HARM reveals that human-written dialectal English degrades disinformation detector F1 by 1.4-3.6% while AI-generated dialectal content stays stable, with multilingual models generalizing better than monolingual ones.
InProceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 11328–11348, Toronto, Canada
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CL 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
DIA-HARM: Dialectal Disparities in Harmful Content Detection Across 50 English Dialects
DIA-HARM reveals that human-written dialectal English degrades disinformation detector F1 by 1.4-3.6% while AI-generated dialectal content stays stable, with multilingual models generalizing better than monolingual ones.