Adapting Language-Specific LLMs to a Reasoning Model in One Day via Model Merging - An Open Recipe

Abstract

This paper investigates data selection and model merging methodologies aimedat incorporating advanced reasoning capabilities such as those of DeepSeek R1into language-specific large language models (LLMs), with a particular focus onthe Thai LLM. Our goal is to enhance the reasoning capabilities oflanguage-specific LLMs while maintaining their target language abilities.DeepSeek R1 excels in reasoning but primarily benefits high-resource languagessuch as English and Chinese. However, low-resource languages remain underserveddue to the dominance of English-centric training data and model optimizations,which limit performance in these languages. This limitation results inunreliable code-switching and diminished effectiveness on tasks in low-resourcelanguages. Meanwhile, local and regional LLM initiatives have attempted tobridge this gap by developing language-specific LLMs that focus on improvinglocal linguistic fidelity. We demonstrate that, with only publicly availabledatasets and a computational budget of $120, it is possible to enhance thereasoning capabilities of language-specific LLMs to match the level of DeepSeekR1, without compromising their performance on target language tasks.

Quick Read (beta)

loading the full paper ...