Towards Reasoning Era: A Survey of Long Chain-of-Thought for Reasoning Large Language Models

Abstract

Recent advancements in reasoning with large language models (RLLMs), such asOpenAI-O1 and DeepSeek-R1, have demonstrated their impressive capabilities incomplex domains like mathematics and coding. A central factor in their successlies in the application of long chain-of-thought (Long CoT) characteristics,which enhance reasoning abilities and enable the solution of intricateproblems. However, despite these developments, a comprehensive survey on LongCoT is still lacking, limiting our understanding of its distinctions fromtraditional short chain-of-thought (Short CoT) and complicating ongoing debateson issues like "overthinking" and "test-time scaling." This survey seeks tofill this gap by offering a unified perspective on Long CoT. (1) We firstdistinguish Long CoT from Short CoT and introduce a novel taxonomy tocategorize current reasoning paradigms. (2) Next, we explore the keycharacteristics of Long CoT: deep reasoning, extensive exploration, andfeasible reflection, which enable models to handle more complex tasks andproduce more efficient, coherent outcomes compared to the shallower Short CoT.(3) We then investigate key phenomena such as the emergence of Long CoT withthese characteristics, including overthinking, and test-time scaling, offeringinsights into how these processes manifest in practice. (4) Finally, weidentify significant research gaps and highlight promising future directions,including the integration of multi-modal reasoning, efficiency improvements,and enhanced knowledge frameworks. By providing a structured overview, thissurvey aims to inspire future research and further the development of logicalreasoning in artificial intelligence.

Quick Read (beta)

loading the full paper ...