pith. sign in

arxiv: 1708.01318 · v2 · pith:C6FBB4JPnew · submitted 2017-08-03 · 💻 cs.CL · cs.AI· cs.HC

The UMD Neural Machine Translation Systems at WMT17 Bandit Learning Task

classification 💻 cs.CL cs.AIcs.HC
keywords banditlearningtranslationfeedbackmachinesystemtaskadaptation
0
0 comments X
read the original abstract

We describe the University of Maryland machine translation systems submitted to the WMT17 German-English Bandit Learning Task. The task is to adapt a translation system to a new domain, using only bandit feedback: the system receives a German sentence to translate, produces an English sentence, and only gets a scalar score as feedback. Targeting these two challenges (adaptation and bandit learning), we built a standard neural machine translation system and extended it in two ways: (1) robust reinforcement learning techniques to learn effectively from the bandit feedback, and (2) domain adaptation using data selection from a large corpus of parallel data.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.