IntGrad MT: Eliciting LLMs' Machine Translation Capabilities with Sentence Interpolation and Gradual MT

  • 2024-10-16 02:45:28
  • Seung-Woo Choi, Ga-Hyun Yoo, Jay-Yoon Lee
  • 0

Abstract

Recent Large Language Models (LLMs) have demonstrated strong performance intranslation without needing to be finetuned on additional parallel corpora.However, they still underperform for low-resource language pairs. Previousworks have focused on mitigating this issue by leveraging relevant few-shotexamples or external resources such as dictionaries or grammar books, makingmodels heavily reliant on these nonparametric sources of information. In thispaper, we propose a novel method named IntGrad MT that focuses on fullyexploiting an LLM's inherent translation capability. IntGrad MT achieves thisby constructing a chain of few-shot examples, each consisting of a sourcesentence and the model's own translation, that rise incrementally indifficulty. IntGrad MT employs two techniques: Sentence Interpolation, whichgenerates a sequence of sentences that gradually change from an easy sentenceto translate to a difficult one, and Gradual MT, which sequentially translatesthis chain using translations of earlier sentences as few-shot examples for thetranslation of subsequent ones. With this approach, we observe a substantialenhancement in the xCOMET scores of various LLMs for multiple languages,especially in low-resource languages such as Hindi(8.26), Swahili(7.10),Bengali(6.97) and Marathi(13.03). Our approach presents a practical way ofenhancing LLMs' performance without extra training.

 

Quick Read (beta)

loading the full paper ...