Agreement-based Joint Training for Bidirectional Attention-based Neural Machine Translation

Hua Wu; Maosong Sun; Shiqi Shen; Wei He; Yang Liu; Yong Cheng; Zhongjun He

arxiv: 1512.04650 · v2 · pith:RRLNK5Y4new · submitted 2015-12-15 · 💻 cs.CL

Agreement-based Joint Training for Bidirectional Attention-based Neural Machine Translation

Yong Cheng , Shiqi Shen , Zhongjun He , Wei He , Hua Wu , Maosong Sun , Yang Liu This is my paper

classification 💻 cs.CL

keywords trainingtranslationagreement-basedattention-basedjointmachinemodelsneural

0 comments

read the original abstract

The attentional mechanism has proven to be effective in improving end-to-end neural machine translation. However, due to the intricate structural divergence between natural languages, unidirectional attention-based models might only capture partial aspects of attentional regularities. We propose agreement-based joint training for bidirectional attention-based end-to-end neural machine translation. Instead of training source-to-target and target-to-source translation models independently,our approach encourages the two complementary models to agree on word alignment matrices on the same training data. Experiments on Chinese-English and English-French translation tasks show that agreement-based joint training significantly improves both alignment and translation quality over independent training.

This paper has not been read by Pith yet.

Agreement-based Joint Training for Bidirectional Attention-based Neural Machine Translation

discussion (0)