pith. sign in

arxiv: 1503.00107 · v1 · pith:5SP4XGFInew · submitted 2015-02-28 · 💻 cs.CL · cs.NE

Non-linear Learning for Statistical Machine Translation

classification 💻 cs.CL cs.NE
keywords featureslineartranslationmodelnon-linearlearningmachinecombination
0
0 comments X
read the original abstract

Modern statistical machine translation (SMT) systems usually use a linear combination of features to model the quality of each translation hypothesis. The linear combination assumes that all the features are in a linear relationship and constrains that each feature interacts with the rest features in an linear manner, which might limit the expressive power of the model and lead to a under-fit model on the current data. In this paper, we propose a non-linear modeling for the quality of translation hypotheses based on neural networks, which allows more complex interaction between features. A learning framework is presented for training the non-linear models. We also discuss possible heuristics in designing the network structure which may improve the non-linear learning performance. Experimental results show that with the basic features of a hierarchical phrase-based machine translation system, our method produce translations that are better than a linear model.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.