Zero-shot Cross-lingual Transfer Learning with Multiple Source and Target Languages for Information Extraction: Language Selection and Adversarial Training

Abstract

The majority of previous researches addressing multi-lingual IE are limitedto zero-shot cross-lingual single-transfer (one-to-one) setting, withhigh-resource languages predominantly as source training data. As a result,these works provide little understanding and benefit for the realistic goal ofdeveloping a multi-lingual IE system that can generalize to as many languagesas possible. Our study aims to fill this gap by providing a detailed analysison Cross-Lingual Multi-Transferability (many-to-many transfer learning), forthe recent IE corpora that cover a diverse set of languages. Specifically, wefirst determine the correlation between single-transfer performance and a widerange of linguistic-based distances. From the obtained insights, a combinedlanguage distance metric can be developed that is not only highly correlatedbut also robust across different tasks and model scales. Next, we investigatethe more general zero-shot multi-lingual transfer settings where multiplelanguages are involved in the training and evaluation processes. Languageclustering based on the newly defined distance can provide directions forachieving the optimal cost-performance trade-off in data (languages) selectionproblem. Finally, a relational-transfer setting is proposed to furtherincorporate multi-lingual unlabeled data based on adversarial training usingthe relation induced from the above linguistic distance.

Quick Read (beta)

loading the full paper ...