The UMD Neural Machine Translation Systems at WMT17 Bandit Learning Task

Amr Sharaf; Hal Daum\'e III; Khanh Nguyen; Kiant\'e Brantley; Shi Feng

arxiv: 1708.01318 · v2 · pith:C6FBB4JPnew · submitted 2017-08-03 · 💻 cs.CL · cs.AI· cs.HC

The UMD Neural Machine Translation Systems at WMT17 Bandit Learning Task

Amr Sharaf , Shi Feng , Khanh Nguyen , Kiant\'e Brantley , Hal Daum\'e III This is my paper

classification 💻 cs.CL cs.AIcs.HC

keywords banditlearningtranslationfeedbackmachinesystemtaskadaptation

0 comments

read the original abstract

We describe the University of Maryland machine translation systems submitted to the WMT17 German-English Bandit Learning Task. The task is to adapt a translation system to a new domain, using only bandit feedback: the system receives a German sentence to translate, produces an English sentence, and only gets a scalar score as feedback. Targeting these two challenges (adaptation and bandit learning), we built a standard neural machine translation system and extended it in two ways: (1) robust reinforcement learning techniques to learn effectively from the bandit feedback, and (2) domain adaptation using data selection from a large corpus of parallel data.

This paper has not been read by Pith yet.

The UMD Neural Machine Translation Systems at WMT17 Bandit Learning Task

discussion (0)