International Conference on Machine Learning , pages=

Grad-tts: A diffusion probabilistic model for text-to-speech , author= · 2021

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

representative citing papers

On the Limits of Latent Reuse in Diffusion Models

stat.ML · 2026-05-13 · unverdicted · novelty 5.0

Reusing source latent spaces in diffusion models under distribution shift produces target score error set by principal-angle misalignment and diffusion-time-amplified ambient noise.

F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching

eess.AS · 2024-10-09 · unverdicted · novelty 5.0

F5-TTS generates natural speech from text via flow matching on DiT with simple text padding, ConvNeXt refinement, and sway sampling, trained on 100K hours multilingual data.

citing papers explorer

Showing 2 of 2 citing papers.

On the Limits of Latent Reuse in Diffusion Models stat.ML · 2026-05-13 · unverdicted · none · ref 45
Reusing source latent spaces in diffusion models under distribution shift produces target score error set by principal-angle misalignment and diffusion-time-amplified ambient noise.
F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching eess.AS · 2024-10-09 · unverdicted · none · ref 73
F5-TTS generates natural speech from text via flow matching on DiT with simple text padding, ConvNeXt refinement, and sway sampling, trained on 100K hours multilingual data.

International Conference on Machine Learning , pages=

fields

years

verdicts

representative citing papers

citing papers explorer