Minimum Risk Training for Neural Machine Translation

Hua Wu; Maosong Sun; Shiqi Shen; Wei He; Yang Liu; Yong Cheng; Zhongjun He

arxiv: 1512.02433 · v3 · pith:LBHFBFNAnew · submitted 2015-12-08 · 💻 cs.CL

Minimum Risk Training for Neural Machine Translation

Shiqi Shen , Yong Cheng , Zhongjun He , Wei He , Hua Wu , Maosong Sun , Yang Liu This is my paper

classification 💻 cs.CL

keywords neuralmachineminimumrisktrainingtranslationapproachestimation

0 comments

read the original abstract

We propose minimum risk training for end-to-end neural machine translation. Unlike conventional maximum likelihood estimation, minimum risk training is capable of optimizing model parameters directly with respect to arbitrary evaluation metrics, which are not necessarily differentiable. Experiments show that our approach achieves significant improvements over maximum likelihood estimation on a state-of-the-art neural machine translation system across various languages pairs. Transparent to architectures, our approach can be applied to more neural networks and potentially benefit more NLP tasks.

This paper has not been read by Pith yet.

Minimum Risk Training for Neural Machine Translation

discussion (0)