On Compositionality in Neural Machine Translation

Florian Metze; Vaibhav Kumar; Vikas Raunak

arxiv: 1911.01497 · v3 · pith:ZRXX6SDQnew · submitted 2019-11-04 · 💻 cs.CL · cs.LG

On Compositionality in Neural Machine Translation

Vikas Raunak , Vaibhav Kumar , Florian Metze This is my paper

classification 💻 cs.CL cs.LG

keywords modelabilitycompositionalitymachineneuralproductivitypropertiessequence

0 comments

read the original abstract

We investigate two specific manifestations of compositionality in Neural Machine Translation (NMT) : (1) Productivity - the ability of the model to extend its predictions beyond the observed length in training data and (2) Systematicity - the ability of the model to systematically recombine known parts and rules. We evaluate a standard Sequence to Sequence model on tests designed to assess these two properties in NMT. We quantitatively demonstrate that inadequate temporal processing, in the form of poor encoder representations is a bottleneck for both Productivity and Systematicity. We propose a simple pre-training mechanism which alleviates model performance on the two properties and leads to a significant improvement in BLEU scores.

This paper has not been read by Pith yet.

On Compositionality in Neural Machine Translation

discussion (0)