Mixed-Precision Training for NLP and Speech Recognition with OpenSeq2Seq

Boris Ginsburg; Carl Case; Huyen Nguyen; Igor Gitman; Jason Li; Oleksii Kuchaiev; Paulius Micikevicius; Vitaly Lavrukhin

arxiv: 1805.10387 · v2 · pith:VPPIHJVRnew · submitted 2018-05-25 · 💻 cs.CL

Mixed-Precision Training for NLP and Speech Recognition with OpenSeq2Seq

Oleksii Kuchaiev , Boris Ginsburg , Igor Gitman , Vitaly Lavrukhin , Jason Li , Huyen Nguyen , Carl Case , Paulius Micikevicius This is my paper

classification 💻 cs.CL

keywords openseq2seqspeechtrainingmodelsrecognitionmachinemixed-precisiontasks

0 comments

read the original abstract

We present OpenSeq2Seq - a TensorFlow-based toolkit for training sequence-to-sequence models that features distributed and mixed-precision training. Benchmarks on machine translation and speech recognition tasks show that models built using OpenSeq2Seq give state-of-the-art performance at 1.5-3x less training time. OpenSeq2Seq currently provides building blocks for models that solve a wide range of tasks including neural machine translation, automatic speech recognition, and speech synthesis.

This paper has not been read by Pith yet.

Mixed-Precision Training for NLP and Speech Recognition with OpenSeq2Seq

discussion (0)