CORE-RAG: Lossless Compression for Retrieval-Augmented LLMs via Reinforcement Learning

  • 2025-09-19 09:16:41
  • Ziqiang Cui, Yunpeng Weng, Xing Tang, Peiyang Liu, Shiwei Li, Bowei He, Jiamin Chen, Yansen Zhang, Xiuqiang He, Chen Ma
  • 0

Abstract

Retrieval-Augmented Generation (RAG) has emerged as a promising approach toenhance the timeliness of knowledge and the factual accuracy of responses inLarge Language Models (LLMs). However, the inclusion of excessive retrieveddocuments substantially increases the input length, leading to highercomputational costs. Previous studies have attempted to compress retrieveddocuments into shorter texts before in-context integration, but such methodsoften compromise end-task performance. The lack of well-defined compressiontargets forces many approaches to rely on fixed heuristics, which cannotguarantee that the compressed content will effectively support the end task. Toaddress these limitations, we propose CORE, a novel method designed to achievelossless context compression for RAG. CORE employs reinforcement learning tooptimize the compression process without relying on predefined compressionlabels, which enables the compressor to generate summaries that maximize theaccuracy of answers generated by the LLM. Extensive experiments on fourdatasets demonstrate the superiority of our approach. With a high compressionratio of 3\%, our method not only avoids performance degradation compared toprepending full documents across all datasets but also improves the averageExact Match (EM) score by 3.3 points. The code will be released soon.

 

Quick Read (beta)

loading the full paper ...