MOMENTA: A Multimodal Framework for Detecting Harmful Memes and Their Targets

Dimitar Dimitrov; Md Shad Akhtar; Preslav Nakov; Shivam Sharma; Shraman Pramanick; Tanmoy Chakraborty

arxiv: 2109.05184 · v2 · pith:NEFSFXZ3new · submitted 2021-09-11 · 💻 cs.MM · cs.CL

MOMENTA: A Multimodal Framework for Detecting Harmful Memes and Their Targets

Shraman Pramanick , Shivam Sharma , Dimitar Dimitrov , Md Shad Akhtar , Preslav Nakov , Tanmoy Chakraborty This is my paper

classification 💻 cs.MM cs.CL

keywords memesharmfuldetectingmomentamultimodalframeworkgloballocal

0 comments

read the original abstract

Internet memes have become powerful means to transmit political, psychological, and socio-cultural ideas. Although memes are typically humorous, recent days have witnessed an escalation of harmful memes used for trolling, cyberbullying, and abuse. Detecting such memes is challenging as they can be highly satirical and cryptic. Moreover, while previous work has focused on specific aspects of memes such as hate speech and propaganda, there has been little work on harm in general. Here, we aim to bridge this gap. We focus on two tasks: (i)detecting harmful memes, and (ii)identifying the social entities they target. We further extend a recently released HarMeme dataset, which covered COVID-19, with additional memes and a new topic: US politics. To solve these tasks, we propose MOMENTA (MultimOdal framework for detecting harmful MemEs aNd Their tArgets), a novel multimodal deep neural network that uses global and local perspectives to detect harmful memes. MOMENTA systematically analyzes the local and the global perspective of the input meme (in both modalities) and relates it to the background context. MOMENTA is interpretable and generalizable, and our experiments show that it outperforms several strong rivaling approaches.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Distinguishing Right from Wrong in Debates: Attribution Analysis of Chinese Harmful Memes
cs.CL 2026-05 unverdicted novelty 6.0

Introduces Ex-ToxiCN-MM dataset and RIKE framework (with AKE and RIR modules) that outperforms baselines on attributing harm in ambiguous Chinese memes using C-HarmKB.