pith. machine review for the scientific record. sign in

arxiv: 1810.13088 · v2 · submitted 2018-10-31 · 💻 cs.CL · cs.LG· cs.SD· eess.AS

Recognition: unknown

Attention-based sequence-to-sequence model for speech recognition: development of state-of-the-art system on LibriSpeech and its application to non-native English

Authors on Pith no claims yet
classification 💻 cs.CL cs.LGcs.SDeess.AS
keywords speechenglishstate-of-the-artsystemattention-baseddevelopmentlibrispeechnon-native
0
0 comments X
read the original abstract

Recent research has shown that attention-based sequence-to-sequence models such as Listen, Attend, and Spell (LAS) yield comparable results to state-of-the-art ASR systems on various tasks. In this paper, we describe the development of such a system and demonstrate its performance on two tasks: first we achieve a new state-of-the-art word error rate of 3.43% on the test clean subset of LibriSpeech English data; second on non-native English speech, including both read speech and spontaneous speech, we obtain very competitive results compared to a conventional system built with the most updated Kaldi recipe.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.