Directions in Abusive Language Training Data: Garbage In, Garbage Out

  • 2020-04-03 16:51:33
  • Bertie Vidgen, Leon Derczynski
  • 4

Abstract

Data-driven analysis and detection of abusive online content covers manydifferent tasks, phenomena, contexts, and methodologies. This papersystematically reviews abusive language dataset creation and content inconjunction with an open website for cataloguing abusive language data. Thiscollection of knowledge leads to a synthesis providing evidence-basedrecommendations for practitioners working with this complex and highly diversedata.

 

Quick Read (beta)

loading the full paper ...