Clustering Pseudo Language Family in Multilingual Translation Models with Fisher Information Matrix

Abstract

In multilingual translation research, the comprehension and utilization oflanguage families are of paramount importance. Nevertheless, clusteringlanguages based solely on their ancestral families can yield suboptimal resultsdue to variations in the datasets employed during the model's training phase.To mitigate this challenge, we introduce an innovative method that leveragesthe fisher information matrix (FIM) to cluster language families, anchored onthe multilingual translation model's characteristics. We hypothesize thatlanguage pairs with similar effects on model parameters exhibit a considerabledegree of linguistic congruence and should thus be grouped cohesively. Thisconcept has led us to define pseudo language families. We provide an in-depthdiscussion regarding the inception and application of these pseudo languagefamilies. Empirical evaluations reveal that employing these pseudo languagefamilies enhances performance over conventional language families in adapting amultilingual translation model to unfamiliar language pairs. The proposedmethodology may also be extended to scenarios requiring language similaritymeasurements. The source code and associated scripts can be accessed athttps://github.com/ecoli-hit/PseudoFamily.

Quick Read (beta)

loading the full paper ...