Skerry-Ryan and Daisy Stanton and Yonghui Wu and Ron J

Yuxuan Wang · 2017 · DOI 10.21437/interspeech.2017-1452

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

open at publisher browse 2 citing papers

representative citing papers

OpenBibleTTS: Large-Scale Speech Resources and TTS Models for Low-Resource Languages

cs.CL · 2026-06-08 · accept · novelty 7.0

OpenBibleTTS supplies speech data and alignments for 37 underrepresented languages and shows that no single TTS system leads on all metrics, with Gemini-TTS highest in listener ratings but monolingual EveryVoice models strongest on intelligibility for several African languages.

Efficient ASR Training with Conversations that Never Happened

cs.CL · 2026-06-02 · unverdicted · novelty 6.0

Mixing 636 hours of LLM-generated synthetic conversations with 67 hours of real data outperforms a model trained on 2700 hours of real Hungarian speech on the BEA-Dialogue benchmark.

citing papers explorer

Showing 2 of 2 citing papers.

OpenBibleTTS: Large-Scale Speech Resources and TTS Models for Low-Resource Languages cs.CL · 2026-06-08 · accept · none · ref 17
OpenBibleTTS supplies speech data and alignments for 37 underrepresented languages and shows that no single TTS system leads on all metrics, with Gemini-TTS highest in listener ratings but monolingual EveryVoice models strongest on intelligibility for several African languages.
Efficient ASR Training with Conversations that Never Happened cs.CL · 2026-06-02 · unverdicted · none · ref 11
Mixing 636 hours of LLM-generated synthetic conversations with 67 hours of real data outperforms a model trained on 2700 hours of real Hungarian speech on the BEA-Dialogue benchmark.

Skerry-Ryan and Daisy Stanton and Yonghui Wu and Ron J

fields

years

verdicts

representative citing papers

citing papers explorer