Character-based Neural Embeddings for Tweet Clustering
classification
💻 cs.IR
cs.CL
keywords
clusteringcharacter-basedneuraltweetallowsapproachavailablecode
read the original abstract
In this paper we show how the performance of tweet clustering can be improved by leveraging character-based neural networks. The proposed approach overcomes the limitations related to the vocabulary explosion in the word-based models and allows for the seamless processing of the multilingual content. Our evaluation results and code are available on-line at https://github.com/vendi12/tweet2vec_clustering
This paper has not been read by Pith yet.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.