Automatic Language Identification System for Hindi and Magahi

  • 2018-04-13 19:38:52
  • Priya Rani, Atul Kr. Ojha, Girish Nath Jha
  • 2

Abstract

Language identification has become a prerequisite for all kinds of automatedtext processing systems. In this paper, we present a rule-based languageidentifier tool for two closely related Indo-Aryan languages: Hindi and Magahi.This system has currently achieved an accuracy of approx 86.34%. We hope toimprove this in the future. Automatic identification of languages will besignificant in the accuracy of output of Web Crawlers.

 

Quick Read (beta)

loading the full paper ...