pith. sign in

arxiv: 1705.02414 · v1 · pith:7OVLQCD2new · submitted 2017-05-05 · 💻 cs.LG · stat.ML

A comprehensive study of batch construction strategies for recurrent neural networks in MXNet

classification 💻 cs.LG stat.ML
keywords trainingbatchconstructionrecurrentapproachbucketingcomparemxnet
0
0 comments X p. Extension
pith:7OVLQCD2 Add to your LaTeX paper What is a Pith Number?
\usepackage{pith}
\pithnumber{7OVLQCD2}

Prints a linked pith:7OVLQCD2 badge after your title and writes the identifier into PDF metadata. Compiles on arXiv with no extra files. Learn more

read the original abstract

In this work we compare different batch construction methods for mini-batch training of recurrent neural networks. While popular implementations like TensorFlow and MXNet suggest a bucketing approach to improve the parallelization capabilities of the recurrent training process, we propose a simple ordering strategy that arranges the training sequences in a stochastic alternatingly sorted way. We compare our method to sequence bucketing as well as various other batch construction strategies on the CHiME-4 noisy speech recognition corpus. The experiments show that our alternated sorting approach is able to compete both in training time and recognition performance while being conceptually simpler to implement.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.