The Breeze 2 Herd of Models: Traditional Chinese LLMs Based on Llama with Vision-Aware and Function-Calling Capabilities

  • 2025-01-23 18:59:02
  • Chan-Jan Hsu, Chia-Sheng Liu, Meng-Hsi Chen, Muxi Chen, Po-Chun Hsu, Yi-Chang Chen, Da-Shan Shiu
  • 0

Abstract

Breeze 2 is a suite of advanced multi-modal language models, available in 3Band 8B parameter configurations, specifically designed to enhance TraditionalChinese language representation. Building upon the Llama 3, Breeze 2 continuespretraining on an extensive corpus to enhance the linguistic and culturalheritage of Traditional Chinese. It incorporates vision-aware capabilitiesthrough a visual encoder and a bridge module, and supports function-calling viaprompt templates and post-training on function-calling data. The effectivenessof Breeze 2 is benchmarked across various tasks, including Taiwan generalknowledge, instruction-following, long context, function calling, and visionunderstanding. Furthermore, we showcase the capabilities of the its 3B model ina mobile application. We are publicly releasing all Breeze 2 models under theLlama 3 Community License.

 

Quick Read (beta)

loading the full paper ...