Abstract
Large language models (LLMs) demonstrate exceptional capabilities, yet stillface the hallucination issue. Typical text generation approaches adopt anauto-regressive generation without deliberate reasoning, which often results inuntrustworthy and factually inaccurate responses. In this paper, we proposeHaluSearch, a novel framework that incorporates tree search-based algorithms(e.g. MCTS) to enable an explicit slow thinking generation process formitigating hallucinations of LLMs during inference. Specifically, HaluSearchframes text generation as a step-by-step reasoning process, using aself-evaluation reward model to score each generation step and guide the treesearch towards the most reliable generation pathway for fully exploiting theinternal knowledge of LLMs. To balance efficiency and quality, we introduce ahierarchical thinking system switch mechanism inspired by the dual processtheory in cognitive science, which dynamically alternates between fast and slowthinking modes at both the instance and step levels, adapting to the complexityof questions and reasoning states. We conduct extensive experiments on bothEnglish and Chinese datasets and the results show that our approachsignificantly outperforms baseline approaches.