arxiv: 1610.03035 · v6 · pith:MANQJ6SRnew · submitted 2016-10-10 · 📊 stat.ML · cs.CL· cs.LG

Latent Sequence Decompositions

William Chan , Yu Zhang , Quoc Le , Navdeep Jaitly This is my paper

classification 📊 stat.ML cs.CLcs.LG

keywords sequencealgorithmdecompositionslatentoutputachieveachievesapproximate

0 comments

read the original abstract

We present the Latent Sequence Decompositions (LSD) framework. LSD decomposes sequences with variable lengthed output units as a function of both the input sequence and the output sequence. We present a training algorithm which samples valid extensions and an approximate decoding algorithm. We experiment with the Wall Street Journal speech recognition task. Our LSD model achieves 12.9% WER compared to a character baseline of 14.8% WER. When combined with a convolutional network on the encoder, we achieve 9.6% WER.

This paper has not been read by Pith yet.

Latent Sequence Decompositions

discussion (0)