TAT-R1: Terminology-Aware Translation with Reinforcement Learning and Word Alignment

Abstract

Recently, deep reasoning large language models(LLMs) like DeepSeek-R1 havemade significant progress in tasks such as mathematics and coding. Inspired bythis, several studies have employed reinforcement learning(RL) to enhancemodels' deep reasoning capabilities and improve machine translation(MT)quality. However, the terminology translation, an essential task in MT, remainsunexplored in deep reasoning LLMs. In this paper, we propose \textbf{TAT-R1}, aterminology-aware translation model trained with reinforcement learning andword alignment. Specifically, we first extract the keyword translation pairsusing a word alignment model. Then we carefully design three types ofrule-based alignment rewards with the extracted alignment relationships. Withthose alignment rewards, the RL-trained translation model can learn to focus onthe accurate translation of key information, including terminology in thesource text. Experimental results show the effectiveness of TAT-R1. Our modelsignificantly improves terminology translation accuracy compared to thebaseline models while maintaining comparable performance on general translationtasks. In addition, we conduct detailed ablation studies of theDeepSeek-R1-like training paradigm for machine translation and reveal severalkey findings.

Quick Read (beta)

loading the full paper ...