Generalizing Back-Translation in Neural Machine Translation

Hermann Ney; Julian Schamper; Miguel Gra\c{c}a; Shahram Khadivi; Yunsu Kim

arxiv: 1906.07286 · v1 · pith:LTX7OYF2new · submitted 2019-06-17 · 💻 cs.CL · cs.LG

Generalizing Back-Translation in Neural Machine Translation

Miguel Gra\c{c}a , Yunsu Kim , Julian Schamper , Shahram Khadivi , Hermann Ney This is my paper

classification 💻 cs.CL cs.LG

keywords back-translationdatamodeltranslationformulationmachineneuralsampling

0 comments

read the original abstract

Back-translation - data augmentation by translating target monolingual data - is a crucial component in modern neural machine translation (NMT). In this work, we reformulate back-translation in the scope of cross-entropy optimization of an NMT model, clarifying its underlying mathematical assumptions and approximations beyond its heuristic usage. Our formulation covers broader synthetic data generation schemes, including sampling from a target-to-source NMT model. With this formulation, we point out fundamental problems of the sampling-based approaches and propose to remedy them by (i) disabling label smoothing for the target-to-source model and (ii) sampling from a restricted search space. Our statements are investigated on the WMT 2018 German - English news translation task.

This paper has not been read by Pith yet.

Generalizing Back-Translation in Neural Machine Translation

discussion (0)