Don't Overthink It: A Survey of Efficient R1-style Large Reasoning Models

Abstract

Recently, Large Reasoning Models (LRMs) have gradually become a researchhotspot due to their outstanding performance in handling complex tasks. Amongthem, DeepSeek R1 has garnered significant attention for its exceptionalperformance and open-source nature, driving advancements in the research ofR1-style LRMs. Unlike traditional Large Language Models (LLMs), these modelsenhance logical deduction and decision-making capabilities during reasoning byincorporating mechanisms such as long chain-of-thought and self-reflectionthrough reinforcement learning. However, with the widespread application ofthese models, the problem of overthinking has gradually emerged. Specifically,when generating answers, these models often construct excessively longreasoning chains with redundant or repetitive steps, which leads to reducedreasoning efficiency and may affect the accuracy of the final answer. To thisend, various efficient reasoning methods have been proposed, aiming to reducethe length of reasoning paths without compromising model performance andreasoning capability. By reviewing the current research advancements in thefield of efficient reasoning methods systematically, we categorize existingworks into two main directions based on the lens of single-model optimizationversus model collaboration: (1) Efficient Reasoning with Single Model, whichfocuses on improving the reasoning efficiency of individual models; and (2)Efficient Reasoning with Model Collaboration, which explores optimizingreasoning paths through collaboration among multiple models. Besides, wemaintain a public GitHub repository that tracks the latest progress inefficient reasoning methods.

Quick Read (beta)

loading the full paper ...