Semantic Relatedness for All (Languages): A Comparative Analysis of Multilingual Semantic Relatedness Using Machine Translation

  • 2018-05-16 20:43:45
  • Andre Freitas, Siamak Barzegar, Juliano Efson Sales, Siegfried Handschuh, Brian Davis
  • 1

Abstract

This paper provides a comparative analysis of the performance of fourstate-of-the-art distributional semantic models (DSMs) over 11 languages,contrasting the native language-specific models with the use of machinetranslation over English-based DSMs. The experimental results show that thereis a significant improvement (average of 16.7% for the Spearman correlation) byusing state-of-the-art machine translation approaches. The results also showthat the benefit of using the most informative corpus outweighs the possibleerrors introduced by the machine translation. For all languages, thecombination of machine translation over the Word2Vec English distributionalmodel provided the best results consistently (average Spearman correlation of0.68).

 

Quick Read (beta)

loading the full paper ...