Classifier Ensembles for Dialect and Language Variety Identification

  • 2018-08-14 17:22:25
  • Liviu P. Dinu, Alina Maria Ciobanu, Marcos Zampieri, Shervin Malmasi
  • 1

Abstract

In this paper we present ensemble-based systems for dialect and languagevariety identification using the datasets made available by the organizers ofthe VarDial Evaluation Campaign 2018. We present a system developed todiscriminate between Flemish and Dutch in subtitles and a system trained todiscriminate between four Arabic dialects: Egyptian, Levantine, Gulf, NorthAfrican, and Modern Standard Arabic in speech broadcasts. Finally, we comparethe performance of these two systems with the other systems submitted to theDiscriminating between Dutch and Flemish in Subtitles (DFS) and the ArabicDialect Identification (ADI) shared tasks at VarDial 2018.

 

Quick Read (beta)

loading the full paper ...