Simultaneous Translation with Flexible Policy via Restricted Imitation Learning

Baigong Zheng; Liang Huang; Mingbo Ma; Renjie Zheng

arxiv: 1906.01135 · v2 · pith:VLXYXJAInew · submitted 2019-06-04 · 💻 cs.CL

Simultaneous Translation with Flexible Policy via Restricted Imitation Learning

Baigong Zheng , Renjie Zheng , Mingbo Ma , Liang Huang This is my paper

classification 💻 cs.CL

keywords policiessimultaneoustranslationflexiblelearningmodelrestrictedwork

0 comments

read the original abstract

Simultaneous translation is widely useful but remains one of the most difficult tasks in NLP. Previous work either uses fixed-latency policies, or train a complicated two-staged model using reinforcement learning. We propose a much simpler single model that adds a `delay' token to the target vocabulary, and design a restricted dynamic oracle to greatly simplify training. Experiments on Chinese<->English simultaneous translation show that our work leads to flexible policies that achieve better BLEU scores and lower latencies compared to both fixed and RL-learned policies.

This paper has not been read by Pith yet.

Simultaneous Translation with Flexible Policy via Restricted Imitation Learning

discussion (0)