GENIUS: A Generative Framework for Universal Multimodal Search

  • 2025-03-25 18:32:31
  • Sungyeon Kim, Xinliang Zhu, Xiaofan Lin, Muhammet Bastan, Douglas Gray, Suha Kwak
  • 0

Abstract

Generative retrieval is an emerging approach in information retrieval thatgenerates identifiers (IDs) of target data based on a query, providing anefficient alternative to traditional embedding-based retrieval methods.However, existing models are task-specific and fall short of embedding-basedretrieval in performance. This paper proposes GENIUS, a universal generativeretrieval framework supporting diverse tasks across multiple modalities anddomains. At its core, GENIUS introduces modality-decoupled semanticquantization, transforming multimodal data into discrete IDs encoding bothmodality and semantics. Moreover, to enhance generalization, we propose a queryaugmentation that interpolates between a query and its target, allowing GENIUSto adapt to varied query forms. Evaluated on the M-BEIR benchmark, it surpassesprior generative methods by a clear margin. Unlike embedding-based retrieval,GENIUS consistently maintains high retrieval speed across database size, withcompetitive performance across multiple benchmarks. With additional re-ranking,GENIUS often achieves results close to those of embedding-based methods whilepreserving efficiency.

 

Quick Read (beta)

loading the full paper ...