No Train but Gain: Language Arithmetic for training-free Language Adapters enhancement

  • 2024-09-08 16:03:14
  • Mateusz Klimaszewski, Piotr Andruszkiewicz, Alexandra Birch
  • 0

Abstract

Modular deep learning is the state-of-the-art solution for lifting the curseof multilinguality, preventing the impact of negative interference and enablingcross-lingual performance in Multilingual Pre-trained Language Models. However,a trade-off of this approach is the reduction in positive transfer learningfrom closely related languages. In response, we introduce a novel method calledlanguage arithmetic, which enables training-free post-processing to addressthis limitation. Extending the task arithmetic framework, we apply learning viaaddition to the language adapters, transitioning the framework from amulti-task to a multilingual setup. The effectiveness of the proposed solutionis demonstrated on three downstream tasks in a MAD-X-based set of cross-lingualschemes, acting as a post-processing procedure. Language arithmeticconsistently improves the baselines with significant gains, especially in themost challenging case of zero-shot application. Our code and models areavailable at https://github.com/mklimasz/language-arithmetic .

 

Quick Read (beta)

loading the full paper ...