Dual Language Models for Code Switched Speech Recognition

  • 2018-08-03 13:46:46
  • Saurabh Garg, Tanmay Parekh, Preethi Jyothi
  • 0

Abstract

In this work, we present a simple and elegant approach to language modelingfor bilingual code-switched text. Since code-switching is a blend of two ormore different languages, a standard bilingual language model can be improvedupon by using structures of the monolingual language models. We propose a noveltechnique called dual language models, which involves building twocomplementary monolingual language models and combining them using aprobabilistic model for switching between the two. We evaluate the efficacy ofour approach using a conversational Mandarin-English speech corpus. We provethe robustness of our model by showing significant improvements in perplexitymeasures over the standard bilingual language model without the use of anyexternal information. Similar consistent improvements are also reflected inautomatic speech recognition error rates.

 

Quick Read (beta)

loading the full paper ...