Marian: Cost-effective High-Quality Neural Machine Translation in C++

Anthony Aue; Hieu Hoang; Kenneth Heafield; Marcin Junczys-Dowmunt; Roman Grundkiewicz

arxiv: 1805.12096 · v1 · pith:LSGFD6VSnew · submitted 2018-05-30 · 💻 cs.CL

Marian: Cost-effective High-Quality Neural Machine Translation in C++

Marcin Junczys-Dowmunt , Kenneth Heafield , Hieu Hoang , Roman Grundkiewicz , Anthony Aue This is my paper

classification 💻 cs.CL

keywords high-qualitymarianmethodssharedtasktransformerattentionauto-tuning

0 comments

read the original abstract

This paper describes the submissions of the "Marian" team to the WNMT 2018 shared task. We investigate combinations of teacher-student training, low-precision matrix products, auto-tuning and other methods to optimize the Transformer model on GPU and CPU. By further integrating these methods with the new averaging attention networks, a recently introduced faster Transformer variant, we create a number of high-quality, high-performance models on the GPU and CPU, dominating the Pareto frontier for this shared task.

This paper has not been read by Pith yet.

Marian: Cost-effective High-Quality Neural Machine Translation in C++

discussion (0)