YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot V oice Conversion for everyone

· 2021 · arXiv 2112.02418

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Tibetan-TTS:Low-Resource Tibetan Speech Synthesis with Large Model Adaptation

cs.SD · 2026-05-04 · unverdicted · novelty 7.0

Large-model adaptation with Tibetan text handling produces natural speech from limited data, outperforming commercial systems.

MLAAD: The Multi-Language Audio Anti-Spoofing Dataset

cs.SD · 2024-01-17 · unverdicted · novelty 6.0

MLAAD provides a large-scale multi-language synthetic audio dataset for training and evaluating audio anti-spoofing models, showing better training performance than InTheWild and FakeOrReal and alternating superiority with ASVspoof 2019 across eight test sets.

citing papers explorer

Showing 2 of 2 citing papers after filters.

Tibetan-TTS:Low-Resource Tibetan Speech Synthesis with Large Model Adaptation cs.SD · 2026-05-04 · unverdicted · none · ref 18
Large-model adaptation with Tibetan text handling produces natural speech from limited data, outperforming commercial systems.
MLAAD: The Multi-Language Audio Anti-Spoofing Dataset cs.SD · 2024-01-17 · unverdicted · none · ref 30
MLAAD provides a large-scale multi-language synthetic audio dataset for training and evaluating audio anti-spoofing models, showing better training performance than InTheWild and FakeOrReal and alternating superiority with ASVspoof 2019 across eight test sets.

YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot V oice Conversion for everyone

fields

years

verdicts

representative citing papers

citing papers explorer