pith. sign in

arxiv: 1805.07685 · v1 · pith:IYGZ553Vnew · submitted 2018-05-20 · 💻 cs.CL · cs.LG

Fighting Offensive Language on Social Media with Unsupervised Text Style Transfer

classification 💻 cs.CL cs.LG
keywords offensivestyletexttransferapproachdatalanguagemedia
0
0 comments X
read the original abstract

We introduce a new approach to tackle the problem of offensive language in online social media. Our approach uses unsupervised text style transfer to translate offensive sentences into non-offensive ones. We propose a new method for training encoder-decoders using non-parallel data that combines a collaborative classifier, attention and the cycle consistency loss. Experimental results on data from Twitter and Reddit show that our method outperforms a state-of-the-art text style transfer system in two out of three quantitative metrics and produces reliable non-offensive transferred sentences.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.