pith. sign in

arxiv: 1508.03790 · v4 · pith:223KUFLVnew · submitted 2015-08-16 · 💻 cs.NE · cs.CL

Depth-Gated LSTM

classification 💻 cs.NE cs.CL
keywords memorygatelayercelldependencedepthfunctionlinear
0
0 comments X
read the original abstract

In this short note, we present an extension of long short-term memory (LSTM) neural networks to using a depth gate to connect memory cells of adjacent layers. Doing so introduces a linear dependence between lower and upper layer recurrent units. Importantly, the linear dependence is gated through a gating function, which we call depth gate. This gate is a function of the lower layer memory cell, the input to and the past memory cell of this layer. We conducted experiments and verified that this new architecture of LSTMs was able to improve machine translation and language modeling performances.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.