Adaptive Machine Translation with Large Language Models

Abstract

Consistency is a key requirement of high-quality translation. It isespecially important to adhere to pre-approved terminology and correctedtranslations in domain-specific projects. Machine translation (MT) has achievedsignificant progress in the area of domain adaptation. However, real-timeadaptation remains challenging. Large-scale language models (LLMs) haverecently shown interesting capabilities of in-context learning, where theylearn to replicate certain input-output text generation patterns, withoutfurther fine-tuning. By feeding an LLM with a prompt that consists of a list oftranslation pairs, it can then simulate the domain and style characteristics atinference time. This work aims to investigate how we can utilize in-contextlearning to improve real-time adaptive MT. Our extensive experiments showpromising results at translation time. For example, GPT-3.5 can adapt to a setof in-domain sentence pairs and/or terminology while translating a newsentence. We observe that the translation quality with few-shot in-contextlearning can surpass that of strong encoder-decoder MT systems, especially forhigh-resource languages. Moreover, we investigate whether we can combine MTfrom strong encoder-decoder models with fuzzy matches, which can furtherimprove the translation, especially for less supported languages. We conductour experiments across five diverse languages, namely English-to-Arabic(EN-AR), English-to-Chinese (EN-ZH), English-to-French (EN-FR),English-to-Kinyarwanda (EN-RW), and English-to-Spanish (EN-ES) language pairs.

Quick Read (beta)

loading the full paper ...