GST-Tacotron with cross-entropy loss on style tokens outperforms standard Tacotron for emotional speech synthesis with only 5% emotion-labeled data and approaches full-label performance.
Informed blending of databases for emotional speech synthesis,
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
eess.AS 1years
2019 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
End-to-End Emotional Speech Synthesis Using Style Tokens and Semi-Supervised Training
GST-Tacotron with cross-entropy loss on style tokens outperforms standard Tacotron for emotional speech synthesis with only 5% emotion-labeled data and approaches full-label performance.