Alternative Input Signals Ease Transfer in Multilingual Machine Translation

Abstract

Recent work in multilingual machine translation (MMT) has focused on thepotential of positive transfer between languages, particularly cases wherehigher-resourced languages can benefit lower-resourced ones. While training anMMT model, the supervision signals learned from one language pair can betransferred to the other via the tokens shared by multiple source languages.However, the transfer is inhibited when the token overlap among sourcelanguages is small, which manifests naturally when languages use differentwriting systems. In this paper, we tackle inhibited transfer by augmenting thetraining data with alternative signals that unify different writing systems,such as phonetic, romanized, and transliterated input. We test these signals onIndic and Turkic languages, two language families where the writing systemsdiffer but languages still share common features. Our results indicate that astraightforward multi-source self-ensemble -- training a model on a mixture ofvarious signals and ensembling the outputs of the same model fed with differentsignals during inference, outperforms strong ensemble baselines by 1.3 BLEUpoints on both language families. Further, we find that incorporatingalternative inputs via self-ensemble can be particularly effective whentraining set is small, leading to +5 BLEU when only 5% of the total trainingdata is accessible. Finally, our analysis demonstrates that includingalternative signals yields more consistency and translates named entities moreaccurately, which is crucial for increased factuality of automated systems.

Quick Read (beta)

loading the full paper ...