To get around this problem, we ﬁrst pretrain our network as an autoencoder with a large sin- gle speaker TTS corpus [46], with the source and target voices being the same

Autoencoder pretraining, transfer learning V oice conversion with DNNs for parallel data is a difﬁcult undertaking owing to the lack of availability of large multispeaker voic

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Hierarchical Sequence to Sequence Voice Conversion with Limited Data

eess.AS · 2019-07-15 · unverdicted · novelty 4.0

Hierarchical seq2seq model for parallel voice conversion pretrained as autoencoder on single-speaker data then adapted to limited multispeaker data, using mel spectrograms converted via wavenet vocoder.

citing papers explorer

Showing 1 of 1 citing paper.

Hierarchical Sequence to Sequence Voice Conversion with Limited Data eess.AS · 2019-07-15 · unverdicted · none · ref 5
Hierarchical seq2seq model for parallel voice conversion pretrained as autoencoder on single-speaker data then adapted to limited multispeaker data, using mel spectrograms converted via wavenet vocoder.

To get around this problem, we ﬁrst pretrain our network as an autoencoder with a large sin- gle speaker TTS corpus [46], with the source and target voices being the same

fields

years

verdicts

representative citing papers

citing papers explorer