Multilingual Graphemic Hybrid ASR with Massive Data Augmentation

  • 2020-04-02 20:39:07
  • Chunxi Liu, Qiaochu Zhang, Xiaohui Zhang, Kritika Singh, Yatharth Saraf, Geoffrey Zweig
  • 0

Abstract

Towards developing high-performing ASR for low-resource languages, approachesto address the lack of resources are to make use of data from multiplelanguages, and to augment the training data by creating acoustic variations. Inthis work we present a single grapheme-based ASR model learned on 7geographically proximal languages, using standard hybrid BLSTM-HMM acousticmodels with lattice-free MMI objective. We build the single ASR grapheme setvia taking the union over each language-specific grapheme set, and we find suchmultilingual graphemic hybrid ASR model can perform language-independentrecognition on all 7 languages, and substantially outperform each monolingualASR model. Secondly, we evaluate the efficacy of multiple data augmentationalternatives within language, as well as their complementarity withmultilingual modeling. Overall, we show that the proposed multilingualgraphemic hybrid ASR with various data augmentation can not only recognize anywithin training set languages, but also provide large ASR performanceimprovements.

 

Quick Read (beta)

loading the full paper ...