Linguistic Characteristics of Censorable Language on SinaWeibo

Abstract

This paper investigates censorship from a linguistic perspective. We collecta corpus of censored and uncensored posts on a number of topics, build aclassifier that predicts censorship decisions independent of discussion topics.Our investigation reveals that the strongest linguistic indicator of censoredcontent of our corpus is its readability.

Quick Read (beta)

loading the full paper ...