Effects of Annotations' Density on Named Entity Recognition Models' Performance in the Context of African Languages

  • 2022-08-09 08:15:20
  • Manuel A. Fokam
  • 0

Abstract

African languages have recently been the subject of several studies inNatural Language Processing (NLP) and, this has caused a significant increasein their representation in the field. However, most studies tend to focus moreon the models than the quality of the datasets when assessing the models'performance in tasks such as Named Entity Recognition (NER). While this workswell in most cases, it does not account for the limitations of doing NLP withlow-resource languages, that is, the quality and the quantity of the dataset atour disposal. This paper provides an analysis of the performance of variousmodels based on the quality of the dataset. We evaluate different pre-trainedmodels with respect to the entity density per sentence of some African NERdatasets. We hope with this study to improve the way NLP studies are done inthe context of low-resourced languages.

 

Quick Read (beta)

loading the full paper ...