Efficient Gender Debiasing of Pre-trained Indic Language Models

Abstract

The gender bias present in the data on which language models are pre-trainedgets reflected in the systems that use these models. The model's intrinsicgender bias shows an outdated and unequal view of women in our culture andencourages discrimination. Therefore, in order to establish more equitablesystems and increase fairness, it is crucial to identify and mitigate the biasexisting in these models. While there is a significant amount of work in thisarea in English, there is a dearth of research being done in other gendered andlow resources languages, particularly the Indian languages. English is anon-gendered language, where it has genderless nouns. The methodologies forbias detection in English cannot be directly deployed in other genderedlanguages, where the syntax and semantics vary. In our paper, we measure genderbias associated with occupations in Hindi language models. Our majorcontributions in this paper are the construction of a novel corpus to evaluateoccupational gender bias in Hindi, quantify this existing bias in these systemsusing a well-defined metric, and mitigate it by efficiently fine-tuning ourmodel. Our results reflect that the bias is reduced post-introduction of ourproposed mitigation techniques. Our codebase is available publicly.

Quick Read (beta)

loading the full paper ...