Recurrent Dropout without Memory Loss

Aliaksei Severyn; Erhardt Barth; Stanislau Semeniuta

arxiv: 1603.05118 · v2 · pith:NLHNXJWXnew · submitted 2016-03-16 · 💻 cs.CL

Recurrent Dropout without Memory Loss

Stanislau Semeniuta , Aliaksei Severyn , Erhardt Barth This is my paper

classification 💻 cs.CL

keywords dropoutfeed-forwardmemoryrecurrentapproachconnectionslossnetwork

0 comments

read the original abstract

This paper presents a novel approach to recurrent neural network (RNN) regularization. Differently from the widely adopted dropout method, which is applied to \textit{forward} connections of feed-forward architectures or RNNs, we propose to drop neurons directly in \textit{recurrent} connections in a way that does not cause loss of long-term memory. Our approach is as easy to implement and apply as the regular feed-forward dropout and we demonstrate its effectiveness for Long Short-Term Memory network, the most popular type of RNN cells. Our experiments on NLP benchmarks show consistent improvements even when combined with conventional feed-forward dropout.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

DropAttention: A Regularization Method for Fully-Connected Self-Attention Networks
cs.CL 2019-07 unverdicted novelty 6.0

DropAttention regularizes attention weights in fully-connected self-attention networks to reduce overfitting and improve performance.