pith. sign in

arxiv: 1805.12096 · v1 · pith:LSGFD6VSnew · submitted 2018-05-30 · 💻 cs.CL

Marian: Cost-effective High-Quality Neural Machine Translation in C++

classification 💻 cs.CL
keywords high-qualitymarianmethodssharedtasktransformerattentionauto-tuning
0
0 comments X
read the original abstract

This paper describes the submissions of the "Marian" team to the WNMT 2018 shared task. We investigate combinations of teacher-student training, low-precision matrix products, auto-tuning and other methods to optimize the Transformer model on GPU and CPU. By further integrating these methods with the new averaging attention networks, a recently introduced faster Transformer variant, we create a number of high-quality, high-performance models on the GPU and CPU, dominating the Pareto frontier for this shared task.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.