Exploring Transfer Learning for Low Resource Emotional TTS

· 2019 · cs.SD · arXiv 1901.04276

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

During the last few years, spoken language technologies have known a big improvement thanks to Deep Learning. However Deep Learning-based algorithms require amounts of data that are often difficult and costly to gather. Particularly, modeling the variability in speech of different speakers, different styles or different emotions with few data remains challenging. In this paper, we investigate how to leverage fine-tuning on a pre-trained Deep Learning-based TTS model to synthesize speech with a small dataset of another speaker. Then we investigate the possibility to adapt this model to have emotional TTS by fine-tuning the neutral TTS model with a small emotional dataset.

representative citing papers

A Methodology for Controlling the Emotional Expressiveness in Synthetic Speech -- a Deep Learning approach

eess.AS · 2019-07-05 · unverdicted · novelty 3.0

A methodology is proposed for emotional text-to-speech using emotional data collection, transfer-learning-based annotation of expressiveness features, and fine-tuning of a neutral TTS model.

citing papers explorer

Showing 1 of 1 citing paper.

A Methodology for Controlling the Emotional Expressiveness in Synthetic Speech -- a Deep Learning approach eess.AS · 2019-07-05 · unverdicted · none · ref 32 · internal anchor
A methodology is proposed for emotional text-to-speech using emotional data collection, transfer-learning-based annotation of expressiveness features, and fine-tuning of a neutral TTS model.

Exploring Transfer Learning for Low Resource Emotional TTS

fields

years

verdicts

representative citing papers

citing papers explorer