Diverse Embedding Neural Network Language Models

Abhinav Sethy; Bhuvana Ramabhadran; Kartik Audhkhasi

arxiv: 1412.7063 · v5 · pith:5XHKRYKZnew · submitted 2014-12-22 · 💻 cs.CL · cs.LG· cs.NE

Diverse Embedding Neural Network Language Models

Kartik Audhkhasi , Abhinav Sethy , Bhuvana Ramabhadran This is my paper

classification 💻 cs.CL cs.LGcs.NE

keywords diversenetworklanguageneuraldennlmembeddingmodelssub-spaces

0 comments

read the original abstract

We propose Diverse Embedding Neural Network (DENN), a novel architecture for language models (LMs). A DENNLM projects the input word history vector onto multiple diverse low-dimensional sub-spaces instead of a single higher-dimensional sub-space as in conventional feed-forward neural network LMs. We encourage these sub-spaces to be diverse during network training through an augmented loss function. Our language modeling experiments on the Penn Treebank data set show the performance benefit of using a DENNLM.

This paper has not been read by Pith yet.

Diverse Embedding Neural Network Language Models

discussion (0)