Linguistic Characteristics of Censorable Language on SinaWeibo

Anna Feldman; Chris Leberknight; Jing Peng; Kei Yin Ng

arxiv: 1807.03654 · v1 · pith:UYFGAI2Knew · submitted 2018-07-10 · 💻 cs.CL

Linguistic Characteristics of Censorable Language on SinaWeibo

Kei Yin Ng , Anna Feldman , Jing Peng , Chris Leberknight This is my paper

classification 💻 cs.CL

keywords linguisticcensoredcensorshipcorpustopicsbuildcensorablecharacteristics

0 comments

read the original abstract

This paper investigates censorship from a linguistic perspective. We collect a corpus of censored and uncensored posts on a number of topics, build a classifier that predicts censorship decisions independent of discussion topics. Our investigation reveals that the strongest linguistic indicator of censored content of our corpus is its readability.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Assessing Post Deletion in Sina Weibo: Multi-modal Classification of Hot Topics
cs.SI 2019-06 unverdicted novelty 5.0

Multi-modal analysis of 994 Weibo posts and 18,966 images finds sentiment as the sole consistent predictor of censorship, with anti-government topics deleted more often and average deletion time of three hours.