Likewise, it has been demonstrated that ASR can be handled excellently by seq2seq architectures

Introduction Recently, sequence to sequence models have been adapted with great success in producing realistic sounding speech in TTS systems [1, 2, 3, 4, 5]

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Hierarchical Sequence to Sequence Voice Conversion with Limited Data

eess.AS · 2019-07-15 · unverdicted · novelty 4.0

Hierarchical seq2seq model for parallel voice conversion pretrained as autoencoder on single-speaker data then adapted to limited multispeaker data, using mel spectrograms converted via wavenet vocoder.

citing papers explorer

Showing 1 of 1 citing paper.

Hierarchical Sequence to Sequence Voice Conversion with Limited Data eess.AS · 2019-07-15 · unverdicted · none · ref 1
Hierarchical seq2seq model for parallel voice conversion pretrained as autoencoder on single-speaker data then adapted to limited multispeaker data, using mel spectrograms converted via wavenet vocoder.

Likewise, it has been demonstrated that ASR can be handled excellently by seq2seq architectures

fields

years

verdicts

representative citing papers

citing papers explorer