Hostility Detection in Hindi leveraging Pre-Trained Language Models

  • 2021-01-14 08:04:32
  • Ojasv Kamal, Adarsh Kumar, Tejas Vaidhya
  • 7

Abstract

Hostile content on social platforms is ever increasing. This has led to theneed for proper detection of hostile posts so that appropriate action can betaken to tackle them. Though a lot of work has been done recently in theEnglish Language to solve the problem of hostile content online, similar worksin Indian Languages are quite hard to find. This paper presents a transferlearning based approach to classify social media (i.e Twitter, Facebook, etc.)posts in Hindi Devanagari script as Hostile or Non-Hostile. Hostile posts arefurther analyzed to determine if they are Hateful, Fake, Defamation, andOffensive. This paper harnesses attention based pre-trained models fine-tunedon Hindi data with Hostile-Non hostile task as Auxiliary and fusing itsfeatures for further sub-tasks classification. Through this approach, weestablish a robust and consistent model without any ensembling or complexpre-processing. We have presented the results from our approach inCONSTRAINT-2021 Shared Task on hostile post detection where our model performsextremely well with 3rd runner up in terms of Weighted Fine-Grained F1 Score.

 

Quick Read (beta)

loading the full paper ...