Deep Denoising Auto-encoder for Statistical Speech Synthesis

Junichi Yamagishi; Shinji Takaki; Zhenzhou Wu

arxiv: 1506.05268 · v1 · pith:2URS6EUKnew · submitted 2015-06-17 · 💻 cs.SD · cs.LG

Deep Denoising Auto-encoder for Statistical Speech Synthesis

Zhenzhou Wu , Shinji Takaki , Junichi Yamagishi This is my paper

classification 💻 cs.SD cs.LG

keywords featuresspeechauto-encoderdeepdenoisingexperimentsextractsynthesis

0 comments

read the original abstract

This paper proposes a deep denoising auto-encoder technique to extract better acoustic features for speech synthesis. The technique allows us to automatically extract low-dimensional features from high dimensional spectral features in a non-linear, data-driven, unsupervised way. We compared the new stochastic feature extractor with conventional mel-cepstral analysis in analysis-by-synthesis and text-to-speech experiments. Our results confirm that the proposed method increases the quality of synthetic speech in both experiments.

This paper has not been read by Pith yet.

Deep Denoising Auto-encoder for Statistical Speech Synthesis

discussion (0)