DART: disentanglement of accent and speaker representation in multispeaker text-to-speech,

· 2024 · arXiv 2410.13342

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

Joycent: Diffusion-based Accent TTS without Accented Phone Prediction

cs.SD · 2026-06-15 · unverdicted · novelty 6.0

Joycent uses diffusion modeling and conditional layer normalization to synthesize accented speech from standard phones and references, claiming better accentedness and speaker preservation than two-stage baselines.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Joycent: Diffusion-based Accent TTS without Accented Phone Prediction cs.SD · 2026-06-15 · unverdicted · none · ref 14
Joycent uses diffusion modeling and conditional layer normalization to synthesize accented speech from standard phones and references, claiming better accentedness and speaker preservation than two-stage baselines.

DART: disentanglement of accent and speaker representation in multispeaker text-to-speech,

fields

years

verdicts

representative citing papers

citing papers explorer