Abstract
Leveraging the autonomous decision-making capabilities of large languagemodels (LLMs) has demonstrated superior performance in reasoning tasks.However, despite the success of iterative or agentic retrieval-augmentedgeneration (RAG) techniques, these methods are often constrained to a singlesolution space when confronted with complex problems. In this paper, we proposea novel thinking pattern in RAG that integrates autonomous strategic planningwith efficient reasoning actions, significantly activating intrinsic reasoningcapabilities and expanding the solution space of specific tasks via Monte CarloTree Search (MCTS), which we refer to as AirRAG. Specifically, our approachdesigns five fundamental reasoning actions, which are expanded to a broadtree-based reasoning space using MCTS. The approach also incorporatesself-consistency verification to explore potential reasoning paths andinference scaling law. Additionally, computationally optimal strategies areemployed to allocate more inference resources to key actions, thereby enhancingoverall performance. Experimental results demonstrate the effectiveness ofAirRAG, showing significant performance gains on complex question-answeringdatasets. Furthermore, AirRAG is flexible and lightweight, making it easy tointegrate with other advanced technologies and models.