pith. sign in

arxiv: 1607.03085 · v3 · pith:VQVNNDQXnew · submitted 2016-07-11 · 💻 cs.LG · cs.NE

Recurrent Memory Array Structures

classification 💻 cs.LG cs.NE
keywords memoryreportachievingapproacharchitecturearrayarray-lstmaugmenting
0
0 comments X
read the original abstract

The following report introduces ideas augmenting standard Long Short Term Memory (LSTM) architecture with multiple memory cells per hidden unit in order to improve its generalization capabilities. It considers both deterministic and stochastic variants of memory operation. It is shown that the nondeterministic Array-LSTM approach improves state-of-the-art performance on character level text prediction achieving 1.402 BPC on enwik8 dataset. Furthermore, this report estabilishes baseline neural-based results of 1.12 BPC and 1.19 BPC for enwik9 and enwik10 datasets respectively.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.