Raon-OpenTTS provides an open 510K-hour curated speech dataset and DiT-based TTS models up to 1B parameters that achieve competitive WER and speaker similarity on benchmarks versus closed models trained on millions of hours.
L2- ARCTIC: A Non-native English Speech Corpus
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
citation-role summary
dataset 1
citation-polarity summary
years
2026 2verdicts
UNVERDICTED 2roles
dataset 1polarities
use dataset 1representative citing papers
The survey reviews the evolution of accent conversion from early DSP approaches to neural models, situating them in linguistic foundations and highlighting constraints, datasets, evaluations, and future directions.
citing papers explorer
-
Raon-OpenTTS: Open Models and Data for Robust Text-to-Speech
Raon-OpenTTS provides an open 510K-hour curated speech dataset and DiT-based TTS models up to 1B parameters that achieve competitive WER and speaker similarity on benchmarks versus closed models trained on millions of hours.
-
Accent Conversion: A Problem-Driven Survey of Sociolinguistic and Technical Constraints
The survey reviews the evolution of accent conversion from early DSP approaches to neural models, situating them in linguistic foundations and highlighting constraints, datasets, evaluations, and future directions.