Abstract
Retrieval-Augmented Generation (RAG) systems typically rely on a single fixedretriever, despite growing evidence that no single retriever performs optimallyacross all query types. In this paper, we explore a query routing approach thatdynamically selects from a pool of retrievers based on the query, using bothtrain-free heuristics and learned routing models. We frame routing as alearning-to-rank (LTR) problem and introduce LTRR, a framework that learns torank retrievers by their expected utility gain to downstream LLM performance.Our experiments, conducted on synthetic QA data with controlled query typevariations, show that routing-based RAG systems can outperform the bestsingle-retriever-based systems. Performance gains are especially pronounced inmodels trained with the Answer Correctness (AC) metric and with pairwiselearning approaches, especially with XGBoost. We also observe improvements ingeneralization to out-of-distribution queries. As part of the SIGIR 2025LiveRAG challenge, our submitted system demonstrated the practical viability ofour approach, achieving competitive performance in both answer correctness andfaithfulness. These findings highlight the importance of both trainingmethodology and metric selection in query routing for RAG systems.