Language and Dialect Identification of Cuneiform Texts

  • 2019-03-13 14:54:22
  • Tommi Jauhiainen, Heidi Jauhiainen, Tero Alstola, Krister Lindén
  • 0


This article introduces a corpus of cuneiform texts from which the datasetfor the use of the Cuneiform Language Identification (CLI) 2019 shared task wasderived as well as some preliminary language identification experimentsconducted using that corpus. We also describe the CLI dataset and how it wasderived from the corpus. In addition, we provide some baseline languageidentification results using the CLI dataset. To the best of our knowledge, theexperiments detailed here are the first time automatic language identificationmethods have been used on cuneiform data.


Introduction (beta)



Conclusion (beta)