Recurrent Memory Networks for Language Modeling

Arianna Bisazza; Christof Monz; Ke Tran

arxiv: 1601.01272 · v2 · pith:SEVMLXZZnew · submitted 2016-01-06 · 💻 cs.CL

Recurrent Memory Networks for Language Modeling

Ke Tran , Arianna Bisazza , Christof Monz This is my paper

classification 💻 cs.CL

keywords languagememorymodelingrecurrentsentencechallengecompletionlarge

0 comments

read the original abstract

Recurrent Neural Networks (RNN) have obtained excellent result in many natural language processing (NLP) tasks. However, understanding and interpreting the source of this success remains a challenge. In this paper, we propose Recurrent Memory Network (RMN), a novel RNN architecture, that not only amplifies the power of RNN but also facilitates our understanding of its internal functioning and allows us to discover underlying patterns in data. We demonstrate the power of RMN on language modeling and sentence completion tasks. On language modeling, RMN outperforms Long Short-Term Memory (LSTM) network on three large German, Italian, and English dataset. Additionally we perform in-depth analysis of various linguistic dimensions that RMN captures. On Sentence Completion Challenge, for which it is essential to capture sentence coherence, our RMN obtains 69.2% accuracy, surpassing the previous state-of-the-art by a large margin.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

R-Transformer: Recurrent Neural Network Enhanced Transformer
cs.LG 2019-07 unverdicted novelty 6.0

R-Transformer integrates RNNs with multi-head attention to model local and global sequence dependencies without position embeddings and reports large-margin gains over prior methods on diverse tasks.