pith. sign in

arxiv: 1412.1058 · v2 · pith:FUHYYMRBnew · submitted 2014-12-01 · 💻 cs.CL · cs.LG· stat.ML

Effective Use of Word Order for Text Categorization with Convolutional Neural Networks

classification 💻 cs.CL cs.LGstat.ML
keywords textdataneuralstructurewordcategorizationconvolutionconvolutional
0
0 comments X
read the original abstract

Convolutional neural network (CNN) is a neural network that can make use of the internal structure of data such as the 2D structure of image data. This paper studies CNN on text categorization to exploit the 1D structure (namely, word order) of text data for accurate prediction. Instead of using low-dimensional word vectors as input as is often done, we directly apply CNN to high-dimensional text data, which leads to directly learning embedding of small text regions for use in classification. In addition to a straightforward adaptation of CNN from image to text, a simple but new variation which employs bag-of-word conversion in the convolution layer is proposed. An extension to combine multiple convolution layers is also explored for higher accuracy. The experiments demonstrate the effectiveness of our approach in comparison with state-of-the-art methods.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Automatically Learning Construction Injury Precursors from Text

    cs.CL 2019-07 unverdicted novelty 4.0

    Standard NLP classifiers can surface valid injury precursors from raw construction safety reports.