Abstract
In a modern spoken language understanding (SLU) system, the natural languageunderstanding (NLU) module takes interpretations of a speech from the automaticspeech recognition (ASR) module as the input. The NLU module usually uses thefirst best interpretation of a given speech in downstream tasks such as domainand intent classification. However, the ASR module might misrecognize somespeeches and the first best interpretation could be erroneous and noisy. Solelyrelying on the first best interpretation could make the performance ofdownstream tasks non-optimal. To address this issue, we introduce a series ofsimple yet efficient models for improving the understanding of semantics of theinput speeches by collectively exploiting the n-best speech interpretationsfrom the ASR module.