pith. sign in

arxiv: 1810.07653 · v1 · pith:7M5SIN4Dnew · submitted 2018-10-15 · 💻 cs.CL

Super Characters: A Conversion from Sentiment Classification to Image Classification

classification 💻 cs.CL
keywords classificationcharacterssentimentsupermethodimageimageslarge
0
0 comments X
read the original abstract

We propose a method named Super Characters for sentiment classification. This method converts the sentiment classification problem into image classification problem by projecting texts into images and then applying CNN models for classification. Text features are extracted automatically from the generated Super Characters images, hence there is no need of any explicit step of embedding the words or characters into numerical vector representations. Experimental results on large social media corpus show that the Super Characters method consistently outperforms other methods for sentiment classification and topic classification tasks on ten large social media datasets of millions of contents in four different languages, including Chinese, Japanese, Korean and English.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. MIXAR: Scaling Autoregressive Pixel-based Language Models to Multiple Languages and Scripts

    cs.CL 2026-04 unverdicted novelty 7.0

    MIXAR is the first autoregressive pixel-based language model for eight languages and scripts, with empirical gains on multilingual tasks, robustness to unseen languages, and further improvements when scaled to 0.5B pa...

  2. Fuzzy Convolution Neural Networks for Tabular Data Classification

    cs.LG 2024-06 unverdicted novelty 5.0

    FCNN maps tabular features to fuzzy memberships, arranges them as images, and uses CNNs to classify, reporting competitive or superior results versus DT, SVM, FNN, Bayes, and RF on six generated noisy datasets.