Incorporating Structural Alignment Biases into an Attentional Neural Translation Model

Chris Dyer; Cong Duy Vu Hoang; Ekaterina Vymolova; Gholamreza Haffari; Kaisheng Yao; Trevor Cohn

arxiv: 1601.01085 · v1 · pith:QQMOWPJ6new · submitted 2016-01-06 · 💻 cs.CL

Incorporating Structural Alignment Biases into an Attentional Neural Translation Model

Trevor Cohn , Cong Duy Vu Hoang , Ekaterina Vymolova , Kaisheng Yao , Chris Dyer , Gholamreza Haffari This is my paper

classification 💻 cs.CL

keywords translationmodelmodelsattentionalbiasesneuralalignmentseveral

0 comments

read the original abstract

Neural encoder-decoder models of machine translation have achieved impressive results, rivalling traditional translation models. However their modelling formulation is overly simplistic, and omits several key inductive biases built into traditional models. In this paper we extend the attentional neural translation model to include structural biases from word based alignment models, including positional bias, Markov conditioning, fertility and agreement over translation directions. We show improvements over a baseline attentional model and standard phrase-based model over several language pairs, evaluating on difficult languages in a low resource setting.

This paper has not been read by Pith yet.

Incorporating Structural Alignment Biases into an Attentional Neural Translation Model

discussion (0)