Neural Machine Translation by Generating Multiple Linguistic Factors

  • 2017-12-05 18:53:49
  • Mercedes García-Martínez, Loïc Barrault, Fethi Bougares
  • 0

Abstract

Factored neural machine translation (FNMT) is founded on the idea of usingthe morphological and grammatical decomposition of the words (factors) at theoutput side of the neural network. This architecture addresses two well-knownproblems occurring in MT, namely the size of target language vocabulary and thenumber of unknown tokens produced in the translation. FNMT system is designedto manage larger vocabulary and reduce the training time (for systems withequivalent target language vocabulary size). Moreover, we can producegrammatically correct words that are not part of the vocabulary. FNMT model isevaluated on IWSLT'15 English to French task and compared to the baselineword-based and BPE-based NMT systems. Promising qualitative and quantitativeresults (in terms of BLEU and METEOR) are reported.

 

Quick Read (beta)

loading the full paper ...