prosody, by conditioning synthesis on la- tent representations [8–12] in addition to text

Introduction Recentend-to-endneuralTTSmodels[1–3]havebeenextended to enable control of speaker identity [4–7] as well as unlabelled speech attributes, e

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Learning to Speak Fluently in a Foreign Language: Multilingual Speech Synthesis and Cross-Language Voice Cloning

cs.CL · 2019-07-09 · unverdicted · novelty 7.0

A Tacotron model with phonemic inputs and adversarial disentanglement enables cross-lingual voice cloning without parallel data, producing intelligible speech in native and foreign accents.

citing papers explorer

Showing 1 of 1 citing paper.

Learning to Speak Fluently in a Foreign Language: Multilingual Speech Synthesis and Cross-Language Voice Cloning cs.CL · 2019-07-09 · unverdicted · none · ref 1
A Tacotron model with phonemic inputs and adversarial disentanglement enables cross-lingual voice cloning without parallel data, producing intelligible speech in native and foreign accents.

prosody, by conditioning synthesis on la- tent representations [8–12] in addition to text

fields

years

verdicts

representative citing papers

citing papers explorer