Abstract
The effectiveness of instruction fine-tuning for Large Language Models isfundamentally constrained by the quality and efficiency of training datasets.This work introduces Low-Confidence Gold (LCG), a novel filtering frameworkthat employs centroid-based clustering and confidence-guided selection foridentifying valuable instruction pairs. Through a semi-supervised approachusing a lightweight classifier trained on representative samples, LCG curateshigh-quality subsets while preserving data diversity. Experimental evaluationdemonstrates that models fine-tuned on LCG-filtered subsets of 6K samplesachieve superior performance compared to existing methods, with substantialimprovements on MT-bench and consistent gains across comprehensive evaluationmetrics. The framework's efficacy while maintaining model performanceestablishes a promising direction for efficient instruction tuning.