EmbeddingGemma: Powerful and Lightweight Text Representations

  • 2025-09-24 17:56:51
  • Henrique Schechter Vera, Sahil Dua, Biao Zhang, Daniel Salz, Ryan Mullins, Sindhu Raghuram Panyam, Sara Smoot, Iftekhar Naim, Joe Zou, Feiyang Chen, Daniel Cer, Alice Lisak, Min Choi, Lucas Gonzalez, Omar Sanseviero, Glenn Cameron, Ian Ballantyne, Kat Black, Kaifeng Chen, Weiyi Wang, Zhe Li, Gus Martins, Jinhyuk Lee, Mark Sherwood, Juyeong Ji, Renjie Wu, Jingxiao Zheng, Jyotinder Singh, Abheesht Sharma, Divya Sreepat, Aashi Jain, Adham Elarabawy, AJ Co, Andreas Doumanoglou, Babak Samari, Ben Hora, Brian Potetz, Dahun Kim, Enrique Alfonseca, Fedor Moiseev, Feng Han, Frank Palma Gomez, Gustavo Hernández Ábrego, Hesen Zhang, Hui Hui, Jay Han, Karan Gill, Ke Chen, Koert Chen, Madhuri Shanbhogue, Michael Boratko, Paul Suganthan, Sai Meher Karthik Duddu, Sandeep Mariserla, Setareh Ariafar, Shanf
  • 0

Abstract

We introduce EmbeddingGemma, a new lightweight, open text embedding modelbased on the Gemma 3 language model family. Our innovative training recipestrategically captures knowledge from larger models via encoder-decoderinitialization and geometric embedding distillation. We improve modelrobustness and expressiveness with a spread-out regularizer, and ensuregeneralizability by merging checkpoints from varied, optimized mixtures.Evaluated on the Massive Text Embedding Benchmark (MTEB) across multilingual,English, and code domains, EmbeddingGemma (300M) achieves state-of-the-artresults. Notably, it outperforms prior top models, both proprietary and open,with fewer than 500M parameters, and provides performance comparable to modelsdouble its size, offering an exceptional performance-to-cost ratio. Remarkably,this lead persists when quantizing model weights or truncating embeddingoutputs. This makes EmbeddingGemma particularly well-suited for low-latency andhigh-throughput use cases such as on-device applications. We provide ablationstudies exploring our key design choices. We release EmbeddingGemma to thecommunity to promote further research.

 

Quick Read (beta)

loading the full paper ...