Cross-Lingual Transfer Robustness to Lower-Resource Languages on Adversarial Datasets

Abstract

Multilingual Language Models (MLLMs) exhibit robust cross-lingual transfercapabilities, or the ability to leverage information acquired in a sourcelanguage and apply it to a target language. These capabilities find practicalapplications in well-established Natural Language Processing (NLP) tasks suchas Named Entity Recognition (NER). This study aims to investigate theeffectiveness of a source language when applied to a target language,particularly in the context of perturbing the input test set. We evaluate on 13pairs of languages, each including one high-resource language (HRL) and onelow-resource language (LRL) with a geographic, genetic, or borrowingrelationship. We evaluate two well-known MLLMs--MBERT and XLM-R--on thesepairs, in native LRL and cross-lingual transfer settings, in two tasks, under aset of different perturbations. Our findings indicate that NER cross-lingualtransfer depends largely on the overlap of entity chunks. If a source andtarget language have more entities in common, the transfer ability is stronger.Models using cross-lingual transfer also appear to be somewhat more robust tocertain perturbations of the input, perhaps indicating an ability to leveragestronger representations derived from the HRL. Our research provides valuableinsights into cross-lingual transfer and its implications for NLP applications,and underscores the need to consider linguistic nuances and potentiallimitations when employing MLLMs across distinct languages.

Quick Read (beta)

loading the full paper ...