Building an Endangered Language Resource in the Classroom: Universal Dependencies for Kakataibo

  • 2022-06-21 13:58:56
  • Roberto Zariquiey, Claudia Alvarado, Ximena Echevarria, Luisa Gomez, Rosa Gonzales, Mariana Illescas, Sabina Oporto, Frederic Blum, Arturo Oncevay, Javier Vera
  • 1

Abstract

In this paper, we launch a new Universal Dependencies treebank for anendangered language from Amazonia: Kakataibo, a Panoan language spoken in Peru.We first discuss the collaborative methodology implemented, which provedeffective to create a treebank in the context of a Computational Linguisticcourse for undergraduates. Then, we describe the general details of thetreebank and the language-specific considerations implemented for the proposedannotation. We finally conduct some experiments on part-of-speech tagging andsyntactic dependency parsing. We focus on monolingual and transfer learningsettings, where we study the impact of a Shipibo-Konibo treebank, anotherPanoan language resource.

 

Quick Read (beta)

loading the full paper ...