Evaluating Japanese Dialect Robustness Across Speech and Text-based Large Language Models

Atsushi Kojima; Hao Shi; Lianbo Liu; Tomoya Mizumoto; Yui Sudo; Yusuke Fujita

arxiv: 2606.25436 · v1 · pith:PSNU4RKGnew · submitted 2026-06-24 · 📡 eess.AS · cs.CL· cs.SD

Evaluating Japanese Dialect Robustness Across Speech and Text-based Large Language Models

Tomoya Mizumoto , Yusuke Fujita , Hao Shi , Lianbo Liu , Atsushi Kojima , Yui Sudo This is my paper

classification 📡 eess.AS cs.CLcs.SD

keywords dialectalrobustnesslanguagespeechllmsmodelsslmsdialects

0 comments

read the original abstract

Dialogue systems based on large language models (LLMs) have advanced significantly in recent years. However, dialectal variation remains a major challenge, particularly for systems that process spoken input. LLM-based speech language models (SLMs), which integrate LLMs with speech processing components, show promise for spoken language tasks, yet their ability to comprehend dialects has not been sufficiently studied. Moreover, it remains unclear how the dialectal understanding of the base LLM affects SLM performance. This study investigates the dialectal robustness of both LLMs and SLMs using Japanese dialects as a test case. We define robustness as the ratio of performance on dialectal versus standard inputs, enabling fair comparisons. Our experiments show that SLM robustness correlates with that of their text-based counterparts. Furthermore, training with dialectal data and fine-tuning the speech encoder each improves robustness in SLMs.

This paper has not been read by Pith yet.

Evaluating Japanese Dialect Robustness Across Speech and Text-based Large Language Models

discussion (0)