Abstract
Breeze 2 is a suite of advanced multi-modal language models, available in 3Band 8B parameter configurations, specifically designed to enhance TraditionalChinese language representation. Building upon the Llama 3, Breeze 2 continuespretraining on an extensive corpus to enhance the linguistic and culturalheritage of Traditional Chinese. It incorporates vision-aware capabilitiesthrough a visual encoder and a bridge module, and supports function-calling viaprompt templates and post-training on function-calling data. The effectivenessof Breeze 2 is benchmarked across various tasks, including Taiwan generalknowledge, instruction-following, long context, function calling, and visionunderstanding. Furthermore, we showcase the capabilities of the its 3B model ina mobile application. We are publicly releasing all Breeze 2 models under theLlama 3 Community License.