pith. machine review for the scientific record. sign in

arxiv: 1611.05104 · v2 · submitted 2016-11-16 · 💻 cs.CL · cs.AI

Recognition: unknown

A Way out of the Odyssey: Analyzing and Combining Recent Insights for LSTMs

Authors on Pith no claims yet
classification 💻 cs.CL cs.AI
keywords lstmsdeepimprovementsmanymodelmodelsmodificationsrecent
0
0 comments X
read the original abstract

LSTMs have become a basic building block for many deep NLP models. In recent years, many improvements and variations have been proposed for deep sequence models in general, and LSTMs in particular. We propose and analyze a series of augmentations and modifications to LSTM networks resulting in improved performance for text classification datasets. We observe compounding improvements on traditional LSTMs using Monte Carlo test-time model averaging, average pooling, and residual connections, along with four other suggested modifications. Our analysis provides a simple, reliable, and high quality baseline model.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.