Can Uniform Meaning Representation Help GPT-4 Translate from Indigenous Languages?

  • 2025-06-01 02:53:16
  • Shira Wein
  • 0

Abstract

While ChatGPT and GPT-based models are able to effectively perform many taskswithout additional fine-tuning, they struggle with tasks related to extremelylow-resource languages and indigenous languages. Uniform Meaning Representation(UMR), a semantic representation designed to capture the meaning of texts inmany languages, is well-positioned to be leveraged in the development oflow-resource language technologies. In this work, we explore the downstreamutility of UMR for low-resource languages by incorporating it into GPT-4prompts. Specifically, we examine the ability of GPT-4 to perform translationfrom three indigenous languages (Navajo, Ar\'apaho, and Kukama), with andwithout demonstrations, as well as with and without UMR annotations.Ultimately, we find that in the majority of our test cases, integrating UMRinto the prompt results in a statistically significant increase in performance,which is a promising indication of future applications of the UMR formalism.

 

Quick Read (beta)

loading the full paper ...