HateMonitors: Language Agnostic Abuse Detection in Social Media

  • 2019-09-27 12:15:58
  • Punyajoy Saha, Binny Mathew, Pawan Goyal, Animesh Mukherjee
  • 3

Abstract

Reducing hateful and offensive content in online social media pose a dualproblem for the moderators. On the one hand, rigid censorship on social mediacannot be imposed. On the other, the free flow of such content cannot beallowed. Hence, we require efficient abusive language detection system todetect such harmful content in social media. In this paper, we present ourmachine learning model, HateMonitor, developed for Hate Speech and OffensiveContent Identification in Indo-European Languages (HASOC), a shared task atFIRE 2019. We have used a Gradient Boosting model, along with BERT and LASERembeddings, to make the system language agnostic. Our model came at Firstposition for the German sub-task A. We have also made our model public athttps://github.com/punyajoy/HateMonitors-HASOC .

 

Quick Read (beta)

loading the full paper ...