Bi-Decoder Augmented Network for Neural Machine Translation

Abstract

Neural Machine Translation (NMT) has become a popular technology in recentyears, and the encoder-decoder framework is the mainstream among all themethods. It's obvious that the quality of the semantic representations fromencoding is very crucial and can significantly affect the performance of themodel. However, existing unidirectional source-to-target architectures mayhardly produce a language-independent representation of the text because theyrely heavily on the specific relations of the given language pairs. Toalleviate this problem, in this paper, we propose a novel Bi-Decoder AugmentedNetwork (BiDAN) for the neural machine translation task. Besides the originaldecoder which generates the target language sequence, we add an auxiliarydecoder to generate back the source language sequence at the training time.Since each decoder transforms the representations of the input text into itscorresponding language, jointly training with two target ends can make theshared encoder has the potential to produce a language-independent semanticspace. We conduct extensive experiments on several NMT benchmark datasets andthe results demonstrate the effectiveness of our proposed approach.

Quick Read (beta)

loading the full paper ...