Chain-of-Translation Prompting (CoTR): A Novel Prompting Technique for Low Resource Languages

  • 2024-09-06 18:15:17
  • Tejas Deshpande, Nidhi Kowtal, Raviraj Joshi
  • 0

Abstract

This paper introduces Chain of Translation Prompting (CoTR), a novel strategydesigned to enhance the performance of language models in low-resourcelanguages. CoTR restructures prompts to first translate the input context froma low-resource language into a higher-resource language, such as English. Thespecified task like generation, classification, or any other NLP function isthen performed on the translated text, with the option to translate the outputback to the original language if needed. All these steps are specified in asingle prompt. We demonstrate the effectiveness of this method through a casestudy on the low-resource Indic language Marathi. The CoTR strategy is appliedto various tasks, including sentiment analysis, hate speech classification,subject classification and text generation, and its efficacy is showcased bycomparing it with regular prompting methods. Our results underscore thepotential of translation-based prompting strategies to significantly improvemultilingual LLM performance in low-resource languages, offering valuableinsights for future research and applications. We specifically see the highestaccuracy improvements with the hate speech detection task. The technique alsohas the potential to enhance the quality of synthetic data generation forunderrepresented languages using LLMs.

 

Quick Read (beta)

loading the full paper ...