pith. sign in

arxiv: 1605.03835 · v1 · pith:YZIOR2NHnew · submitted 2016-05-12 · 💻 cs.CL · cs.LG· stat.ML

Noisy Parallel Approximate Decoding for Conditional Recurrent Language Model

classification 💻 cs.CL cs.LGstat.ML
keywords decodingalgorithmsconditionallanguagenetworkneuralnovelrecurrent
0
0 comments X
read the original abstract

Recent advances in conditional recurrent language modelling have mainly focused on network architectures (e.g., attention mechanism), learning algorithms (e.g., scheduled sampling and sequence-level training) and novel applications (e.g., image/video description generation, speech recognition, etc.) On the other hand, we notice that decoding algorithms/strategies have not been investigated as much, and it has become standard to use greedy or beam search. In this paper, we propose a novel decoding strategy motivated by an earlier observation that nonlinear hidden layers of a deep neural network stretch the data manifold. The proposed strategy is embarrassingly parallelizable without any communication overhead, while improving an existing decoding algorithm. We extensively evaluate it with attention-based neural machine translation on the task of En->Cz translation.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.