Do Convolutional Networks need to be Deep for Text Classification ?

Alexandre Denis; Christophe Cerisara; Hoa T. Le

arxiv: 1707.04108 · v1 · pith:V4R334NBnew · submitted 2017-07-13 · 💻 cs.CL

Do Convolutional Networks need to be Deep for Text Classification ?

Hoa T. Le , Christophe Cerisara , Alexandre Denis This is my paper

classification 💻 cs.CL

keywords textclassificationdeepmodelswordconvolutionalinputsnetworks

0 comments

read the original abstract

We study in this work the importance of depth in convolutional models for text classification, either when character or word inputs are considered. We show on 5 standard text classification and sentiment analysis tasks that deep models indeed give better performances than shallow networks when the text input is represented as a sequence of characters. However, a simple shallow-and-wide network outperforms deep models such as DenseNet with word inputs. Our shallow word model further establishes new state-of-the-art performances on two datasets: Yelp Binary (95.9\%) and Yelp Full (64.9\%).

This paper has not been read by Pith yet.

Do Convolutional Networks need to be Deep for Text Classification ?

discussion (0)