Decomposed Reasoning with Reinforcement Learning for Relevance Assessment in UGC Platforms

Abstract

Retrieval-augmented generation (RAG) plays a critical role in user-generatedcontent (UGC) platforms, but its effectiveness depends heavily on accuraterelevance assessment of query-document pairs. Despite recent advances inapplying large language models (LLMs) to relevance modeling, UGC platformspresent unique challenges: 1) ambiguous user intent due to sparse user feedbackin RAG scenarios, and 2) substantial noise introduced by informal andunstructured language. To address these issues, we propose the ReinforcedReasoning Model for Relevance Assessment (R3A), which introduces a decomposedreasoning framework over queries and candidate documents before scoring. R3Afirst leverages auxiliary high-ranked documents within the platform to inferlatent query intent. It then performs verbatim fragment extraction to justifyrelevance decisions, thereby reducing errors caused by noisy UGC. Based on areinforcement learning framework, R3A is optimized to mitigate distortionsarising from ambiguous queries and unstructured content. Experimental resultsshow that R3A significantly outperforms existing baseline methods in terms ofrelevance accuracy, across both offline benchmarks and online experiments.

Quick Read (beta)

loading the full paper ...