JETS: Jointly training FastSpeech2 and HiFi-GAN for end to end text to speech.arXiv preprint arXiv:2203.16852, 2022

Dan Lim, Sunghee Jung, Eesung Kim · 2022 · arXiv 2203.16852

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

WavTTS: Towards High-Quality Zero-Shot TTS via Direct Raw Waveform Modeling

eess.AS · 2026-06-02 · unverdicted · novelty 8.0

WavTTS is the first raw-waveform diffusion TTS model using DiT flow matching and multi-scale mel supervision that approaches SOTA latent zero-shot performance while beating prior end-to-end models.

citing papers explorer

Showing 1 of 1 citing paper after filters.

WavTTS: Towards High-Quality Zero-Shot TTS via Direct Raw Waveform Modeling eess.AS · 2026-06-02 · unverdicted · none · ref 50
WavTTS is the first raw-waveform diffusion TTS model using DiT flow matching and multi-scale mel supervision that approaches SOTA latent zero-shot performance while beating prior end-to-end models.

JETS: Jointly training FastSpeech2 and HiFi-GAN for end to end text to speech.arXiv preprint arXiv:2203.16852, 2022

fields

years

verdicts

representative citing papers

citing papers explorer