Abstract
The majority of existing speech emotion recognition models are trained andevaluated on a single corpus and a single language setting. These systems donot perform as well when applied in a cross-corpus and cross-language scenario.This paper presents results for speech emotion recognition for 4 languages inboth single corpus and cross corpus setting. Additionally, since multi-tasklearning (MTL) with gender, naturalness and arousal as auxiliary tasks hasshown to enhance the generalisation capabilities of the emotion models, thispaper introduces language ID as another auxiliary task in MTL framework toexplore the role of spoken language on emotion recognition which has not beenstudied yet.
Quick Read (beta)
loading the full paper ...