An empirical study on large scale text classification with skip-gram embeddings

Georgios Balikas; Massih-Reza Amini

arxiv: 1606.06623 · v1 · pith:SM4COQ5Lnew · submitted 2016-06-21 · 💻 cs.CL · cs.IR

An empirical study on large scale text classification with skip-gram embeddings

Georgios Balikas , Massih-Reza Amini This is my paper

classification 💻 cs.CL cs.IR

keywords classificationembeddingslargebeencombinationempiricalinvestigaterepresentations

0 comments

read the original abstract

We investigate the integration of word embeddings as classification features in the setting of large scale text classification. Such representations have been used in a plethora of tasks, however their application in classification scenarios with thousands of classes has not been extensively researched, partially due to hardware limitations. In this work, we examine efficient composition functions to obtain document-level from word-level embeddings and we subsequently investigate their combination with the traditional one-hot-encoding representations. By presenting empirical evidence on large, multi-class, multi-label classification problems, we demonstrate the efficiency and the performance benefits of this combination.

This paper has not been read by Pith yet.

An empirical study on large scale text classification with skip-gram embeddings

discussion (0)