Bilingual Sentiment Embeddings: Joint Projection of Sentiment Across Languages

  • 2018-05-23 08:56:15
  • Jeremy Barnes, Roman Klinger, Sabine Schulte im Walde
  • 1

Abstract

Sentiment analysis in low-resource languages suffers from a lack of annotatedcorpora to estimate high-performing models. Machine translation and bilingualword embeddings provide some relief through cross-lingual sentiment approaches.However, they either require large amounts of parallel data or do notsufficiently capture sentiment information. We introduce Bilingual SentimentEmbeddings (BLSE), which jointly represent sentiment information in a sourceand target language. This model only requires a small bilingual lexicon, asource-language corpus annotated for sentiment, and monolingual word embeddingsfor each language. We perform experiments on three language combinations(Spanish, Catalan, Basque) for sentence-level cross-lingual sentimentclassification and find that our model significantly outperformsstate-of-the-art methods on four out of six experimental setups, as well ascapturing complementary information to machine translation. Our analysis of theresulting embedding space provides evidence that it represents sentimentinformation in the resource-poor target language without any annotated data inthat language.

 

Quick Read (beta)

loading the full paper ...