Code-mixed LLM: Improve Large Language Models' Capability to Handle Code-Mixing through Reinforcement Learning from AI Feedback

Abstract

Code-mixing(CM) or code-switching(CSW) refers to the juxtaposition oflinguistic units from two or more languages during the conversation orsometimes even a single utterance. Code-mixing introduces unique challenges indaily life, such as syntactic mismatches and semantic blending, that are rarelyencountered in monolingual settings. Large language models (LLMs) haverevolutionized the field of natural language processing (NLP) by offeringunprecedented capabilities in understanding human languages. However, theeffectiveness of current state-of-the-art multilingual LLMs has not yet beenfully explored in the CM scenario. To fill this gap, we first benchmark theperformance of multilingual LLMs on various code-mixing NLP tasks. Then wepropose to improve the multilingual LLMs' ability to understand code-mixingthrough reinforcement learning from human feedback (RLHF) and code-mixedmachine translation tasks. Given the high-cost and time-consuming preferencelabeling procedure, we improve this by utilizing LLMs as annotators to performthe reinforcement learning from AI feedback (RLAIF). The experiments show theeffectiveness of the proposed method.

Quick Read (beta)

loading the full paper ...