pith. sign in

arxiv: 1904.10760 · v1 · pith:F5A2LCV4new · submitted 2019-04-23 · 💻 cs.CL · cs.SD· eess.AS

End-to-End Spoken Language Translation

classification 💻 cs.CL cs.SDeess.AS
keywords languagespokensentencesmethodmodelnetworkunseenachieves
0
0 comments X
read the original abstract

In this paper, we address the task of spoken language understanding. We present a method for translating spoken sentences from one language into spoken sentences in another language. Given spectrogram-spectrogram pairs, our model can be trained completely from scratch to translate unseen sentences. Our method consists of a pyramidal-bidirectional recurrent network combined with a convolutional network to output sentence-level spectrograms in the target language. Empirically, our model achieves competitive performance with state-of-the-art methods on multiple languages and can generalize to unseen speakers.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.