Lattice Rescoring Strategies for Long Short Term Memory Language Models in Speech Recognition

Ananda Theertha Suresh; Daniel Holtmann-Rice; Felix Yu; Hank Liao; Michael Nirschl; Shankar Kumar

arxiv: 1711.05448 · v1 · pith:ZKI3M4LTnew · submitted 2017-11-15 · 📊 stat.ML · cs.CL· cs.LG

Lattice Rescoring Strategies for Long Short Term Memory Language Models in Speech Recognition

Shankar Kumar , Michael Nirschl , Daniel Holtmann-Rice , Hank Liao , Ananda Theertha Suresh , Felix Yu This is my paper

classification 📊 stat.ML cs.CLcs.LG

keywords speechmodelsrecognitionlatticen-gramrescoringalgorithmsintegrate

0 comments

read the original abstract

Recurrent neural network (RNN) language models (LMs) and Long Short Term Memory (LSTM) LMs, a variant of RNN LMs, have been shown to outperform traditional N-gram LMs on speech recognition tasks. However, these models are computationally more expensive than N-gram LMs for decoding, and thus, challenging to integrate into speech recognizers. Recent research has proposed the use of lattice-rescoring algorithms using RNNLMs and LSTMLMs as an efficient strategy to integrate these models into a speech recognition system. In this paper, we evaluate existing lattice rescoring algorithms along with new variants on a YouTube speech recognition task. Lattice rescoring using LSTMLMs reduces the word error rate (WER) for this task by 8\% relative to the WER obtained using an N-gram LM.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Non-Intrusive Automatic Speech Recognition Refinement: A Survey
eess.AS 2025-08 accept novelty 4.0

A survey that classifies non-intrusive ASR refinement methods into five categories, reviews domain adaptation and evaluation datasets, proposes standardized metrics, and identifies future research directions.