Mind the Gap: Bridging Thought Leap for Improved Chain-of-Thought Tuning

  • 2025-05-20 18:59:31
  • Haolei Xu, Yuchen Yan, Yongliang Shen, Wenqi Zhang, Guiyang Hou, Shengpei Jiang, Kaitao Song, Weiming Lu, Jun Xiao, Yueting Zhuang
  • 0

Abstract

Large language models (LLMs) have achieved remarkable progress onmathemati-cal tasks through Chain-of-Thought (CoT) reasoning. However, existingmathematical CoT datasets often suffer from Thought Leaps due to expertsomitting intermediate steps, which negatively impacts model learning andgeneralization. We propose the CoT Thought Leap Bridge Task, which aims toautomatically detect leaps and generate missing intermediate reasoning steps torestore the completeness and coherence of CoT. To facilitate this, weconstructed a specialized training dataset called ScaleQM+, based on thestructured ScaleQuestMath dataset, and trained CoT-Bridge to bridge thoughtleaps. Through comprehensive experiments on mathematical reasoning benchmarks,we demonstrate that models fine-tuned on bridged datasets consistentlyoutperform those trained on original datasets, with improvements of up to+5.87% on NuminaMath. Our approach effectively enhances distilled data (+3.02%)and provides better starting points for reinforcement learning (+3.1%),functioning as a plug-and-play module compatible with existing optimizationtechniques. Furthermore, CoT-Bridge demonstrate improved generalization toout-of-domain logical reasoning tasks, confirming that enhancing reasoningcompleteness yields broadly applicable benefits.

 

Quick Read (beta)

loading the full paper ...