SeaLLMs 3: Open Foundation and Chat Multilingual Large Language Models for Southeast Asian Languages

Abstract

Large Language Models (LLMs) have shown remarkable abilities across varioustasks, yet their development has predominantly centered on high-resourcelanguages like English and Chinese, leaving low-resource languages underserved.To address this disparity, we present SeaLLMs 3, the latest iteration of theSeaLLMs model family, tailored for Southeast Asian languages. This region,characterized by its rich linguistic diversity, has lacked adequate languagetechnology support. SeaLLMs 3 aims to bridge this gap by covering acomprehensive range of languages spoken in this region, including English,Chinese, Indonesian, Vietnamese, Thai, Tagalog, Malay, Burmese, Khmer, Lao,Tamil, and Javanese. Leveraging efficient language enhancement techniques and aspecially constructed instruction tuning dataset, SeaLLMs 3 significantlyreduces training costs while maintaining high performance and versatility. Ourmodel excels in tasks such as world knowledge, mathematical reasoning,translation, and instruction following, achieving state-of-the-art performanceamong similarly sized models. Additionally, we prioritized safety andreliability by addressing both general and culture-specific considerations andincorporated mechanisms to reduce hallucinations. This work underscores theimportance of inclusive AI, showing that advanced LLM capabilities can benefitunderserved linguistic and cultural communities.

Quick Read (beta)

loading the full paper ...