pith. sign in

arxiv: 1805.01553 · v3 · pith:TGJJDDSJnew · submitted 2018-05-03 · 💻 cs.CL · stat.ML

A Reinforcement Learning Approach to Interactive-Predictive Neural Machine Translation

classification 💻 cs.CL stat.ML
keywords humaneffortfeedbacktranslationsapproacheveryinteractive-predictivelearning
0
0 comments X
read the original abstract

We present an approach to interactive-predictive neural machine translation that attempts to reduce human effort from three directions: Firstly, instead of requiring humans to select, correct, or delete segments, we employ the idea of learning from human reinforcements in form of judgments on the quality of partial translations. Secondly, human effort is further reduced by using the entropy of word predictions as uncertainty criterion to trigger feedback requests. Lastly, online updates of the model parameters after every interaction allow the model to adapt quickly. We show in simulation experiments that reward signals on partial translations significantly improve character F-score and BLEU compared to feedback on full translations only, while human effort can be reduced to an average number of $5$ feedback requests for every input.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.