pith. sign in

arxiv: 1707.04108 · v1 · pith:V4R334NBnew · submitted 2017-07-13 · 💻 cs.CL

Do Convolutional Networks need to be Deep for Text Classification ?

classification 💻 cs.CL
keywords textclassificationdeepmodelswordconvolutionalinputsnetworks
0
0 comments X
read the original abstract

We study in this work the importance of depth in convolutional models for text classification, either when character or word inputs are considered. We show on 5 standard text classification and sentiment analysis tasks that deep models indeed give better performances than shallow networks when the text input is represented as a sequence of characters. However, a simple shallow-and-wide network outperforms deep models such as DenseNet with word inputs. Our shallow word model further establishes new state-of-the-art performances on two datasets: Yelp Binary (95.9\%) and Yelp Full (64.9\%).

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.