pith. sign in

arxiv: 1601.01073 · v1 · pith:DKPVMI2Hnew · submitted 2016-01-06 · 💻 cs.CL · stat.ML

Multi-Way, Multilingual Neural Machine Translation with a Shared Attention Mechanism

classification 💻 cs.CL stat.ML
keywords languagetranslationmodelmulti-waymultilingualneuralpairsproposed
0
0 comments X
read the original abstract

We propose multi-way, multilingual neural machine translation. The proposed approach enables a single neural translation model to translate between multiple languages, with a number of parameters that grows only linearly with the number of languages. This is made possible by having a single attention mechanism that is shared across all language pairs. We train the proposed multi-way, multilingual model on ten language pairs from WMT'15 simultaneously and observe clear performance improvements over models trained on only one language pair. In particular, we observe that the proposed model significantly improves the translation quality of low-resource language pairs.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Massively Multilingual Neural Machine Translation in the Wild: Findings and Challenges

    cs.CL 2019-07 unverdicted novelty 5.0

    A single multilingual NMT model for 103 languages trained on 25B examples demonstrates transfer learning benefits for low-resource languages.

  2. Improving Zero-shot Translation with Language-Independent Constraints

    cs.CL 2019-06 unverdicted novelty 4.0

    Language-independent constraints and regularization in multilingual Transformer NMT yield a 2.23 BLEU average gain on zero-shot pairs from the IWSLT 2017 dataset.