Abstract
Most existing document-level neural machine translation (NMT) models leveragea fixed number of the previous or all global source sentences to handle thecontext-independent problem in standard NMT. However, the translating of eachsource sentence benefits from various sizes of context, and inappropriatecontext may harm the translation performance. In this work, we introduce adata-adaptive method that enables the model to adopt the necessary and usefulcontext. Specifically, we introduce a light predictor into two document-leveltranslation models to select the explicit context. Experiments demonstrate theproposed approach can significantly improve the performance over the previousmethods with a gain up to 1.99 BLEU points.
Quick Read (beta)
loading the full paper ...