TICO-19: the Translation Initiative for Covid-19

  • 2020-07-06 14:13:51
  • Antonios Anastasopoulos, Alessandro Cattelan, Zi-Yi Dou, Marcello Federico, Christian Federman, Dmitriy Genzel, Francisco Guzmán, Junjie Hu, Macduff Hughes, Philipp Koehn, Rosie Lazar, Will Lewis, Graham Neubig, Mengmeng Niu, Alp Öktem, Eric Paquin, Grace Tang, Sylwia Tur
  • 0

Abstract

The COVID-19 pandemic is the worst pandemic to strike the world in over acentury. Crucial to stemming the tide of the SARS-CoV-2 virus is communicatingto vulnerable populations the means by which they can protect themselves. Tothis end, the collaborators forming the Translation Initiative for COvid-19(TICO-19) have made test and development data available to AI and MTresearchers in 35 different languages in order to foster the development oftools and resources for improving access to information about COVID-19 in theselanguages. In addition to 9 high-resourced, "pivot" languages, the team istargeting 26 lesser resourced languages, in particular languages of Africa,South Asia and South-East Asia, whose populations may be the most vulnerable tothe spread of the virus. The same data is translated into all of the languagesrepresented, meaning that testing or development can be done for any pairing oflanguages in the set. Further, the team is converting the test and developmentdata into translation memories (TMXs) that can be used by localizers from andto any of the languages.

 

Quick Read (beta)

loading the full paper ...