Data and Speaker Roles The backbone TTS model is pretrained on LJSpeech [22] and the English subset of ESD [23], both Standard American En- glish only

Experimental Setup 3

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Few-Shot Accent Synthesis for ASR with LLM-Guided Phoneme Editing

cs.SD · 2026-04-30 · unverdicted · novelty 5.0

Few-shot TTS adaptation combined with LLM-guided phoneme editing produces synthetic accented speech that improves ASR word error rates on real accented audio even in cross-speaker and ultra-low-data settings.

citing papers explorer

Showing 1 of 1 citing paper.

Few-Shot Accent Synthesis for ASR with LLM-Guided Phoneme Editing cs.SD · 2026-04-30 · unverdicted · none · ref 4
Few-shot TTS adaptation combined with LLM-guided phoneme editing produces synthetic accented speech that improves ASR word error rates on real accented audio even in cross-speaker and ultra-low-data settings.

Data and Speaker Roles The backbone TTS model is pretrained on LJSpeech [22] and the English subset of ESD [23], both Standard American En- glish only

fields

years

verdicts

representative citing papers

citing papers explorer