Abstract
Achieving human-level intelligence requires refining the transition from thefast, intuitive System 1 to the slower, more deliberate System 2 reasoning.While System 1 excels in quick, heuristic decisions, System 2 relies on logicalreasoning for more accurate judgments and reduced biases. Foundational LargeLanguage Models (LLMs) excel at fast decision-making but lack the depth forcomplex reasoning, as they have not yet fully embraced the step-by-stepanalysis characteristic of true System 2 thinking. Recently, reasoning LLMslike OpenAI's o1/o3 and DeepSeek's R1 have demonstrated expert-levelperformance in fields such as mathematics and coding, closely mimicking thedeliberate reasoning of System 2 and showcasing human-like cognitive abilities.This survey begins with a brief overview of the progress in foundational LLMsand the early development of System 2 technologies, exploring how theircombination has paved the way for reasoning LLMs. Next, we discuss how toconstruct reasoning LLMs, analyzing their features, the core methods enablingadvanced reasoning, and the evolution of various reasoning LLMs. Additionally,we provide an overview of reasoning benchmarks, offering an in-depth comparisonof the performance of representative reasoning LLMs. Finally, we explorepromising directions for advancing reasoning LLMs and maintain a real-time\href{https://github.com/zzli2022/Awesome-Slow-Reason-System}{GitHubRepository} to track the latest developments. We hope this survey will serve asa valuable resource to inspire innovation and drive progress in this rapidlyevolving field.