Incidental multilingualism from uneven web training makes LLMs unequal, brittle, and opaque across languages.
and Jauhiainen, Tommi
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
fields
cs.CL 2years
2026 2verdicts
UNVERDICTED 2representative citing papers
Transfer learning from a source ASR model plus data augmentation and self-attention enables Chinese dialect discrimination that outperforms prior methods on two benchmarks.
citing papers explorer
-
Lost in the Tower of Babel: The Adverse Effects of Incidental Multilingualism in LLMs
Incidental multilingualism from uneven web training makes LLMs unequal, brittle, and opaque across languages.
-
Low-resource Language Discrimination Towards Chinese Dialects with Transfer learning and Data Augmentation
Transfer learning from a source ASR model plus data augmentation and self-attention enables Chinese dialect discrimination that outperforms prior methods on two benchmarks.