Multimodal Classification for Analysing Social Media

Chi Thang Duong; Karl Aberer; Remi Lebret

arxiv: 1708.02099 · v1 · pith:OIFUKL3Tnew · submitted 2017-08-07 · 💻 cs.CL · cs.IR· cs.SI

Multimodal Classification for Analysing Social Media

Chi Thang Duong , Remi Lebret , Karl Aberer This is my paper

classification 💻 cs.CL cs.IRcs.SI

keywords modalitiesclassificationmediasocialapproachesdifferentinformationmodels

0 comments

read the original abstract

Classification of social media data is an important approach in understanding user behavior on the Web. Although information on social media can be of different modalities such as texts, images, audio or videos, traditional approaches in classification usually leverage only one prominent modality. Techniques that are able to leverage multiple modalities are often complex and susceptible to the absence of some modalities. In this paper, we present simple models that combine information from different modalities to classify social media content and are able to handle the above problems with existing techniques. Our models combine information from different modalities using a pooling layer and an auxiliary learning task is used to learn a common feature space. We demonstrate the performance of our models and their robustness to the missing of some modalities in the emotion classification domain. Our approaches, although being simple, can not only achieve significantly higher accuracies than traditional fusion approaches but also have comparable results when only one modality is available.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Emotion Recognition Using Fusion of Audio and Video Features
cs.LG 2019-06 unverdicted novelty 4.0

Feature-level or decision-level fusion of CNN video features and audio descriptors via SVR achieves CCC 0.749 (arousal) and 0.565 (valence) on RECOLA after preprocessing and post-processing.