Mind the Gap: Bridging Thought Leap for Improved Chain-of-Thought Tuning

Abstract

Large language models (LLMs) have achieved remarkable progress onmathemati-cal tasks through Chain-of-Thought (CoT) reasoning. However, existingmathematical CoT datasets often suffer from Thought Leaps due to expertsomitting intermediate steps, which negatively impacts model learning andgeneralization. We propose the CoT Thought Leap Bridge Task, which aims toautomatically detect leaps and generate missing intermediate reasoning steps torestore the completeness and coherence of CoT. To facilitate this, weconstructed a specialized training dataset called ScaleQM+, based on thestructured ScaleQuestMath dataset, and trained CoT-Bridge to bridge thoughtleaps. Through comprehensive experiments on mathematical reasoning benchmarks,we demonstrate that models fine-tuned on bridged datasets consistentlyoutperform those trained on original datasets, with improvements of up to+5.87% on NuminaMath. Our approach effectively enhances distilled data (+3.02%)and provides better starting points for reinforcement learning (+3.1%),functioning as a plug-and-play module compatible with existing optimizationtechniques. Furthermore, CoT-Bridge demonstrate improved generalization toout-of-domain logical reasoning tasks, confirming that enhancing reasoningcompleteness yields broadly applicable benefits.

Quick Read (beta)

loading the full paper ...