OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking

Abstract

Machine writing with large language models often relies onretrieval-augmented generation. However, these approaches remain confinedwithin the boundaries of the model's predefined scope, limiting the generationof content with rich information. Specifically, vanilla-retrieved informationtends to lack depth, utility, and suffers from redundancy, which negativelyimpacts the quality of generated articles, leading to shallow, repetitive, andunoriginal outputs. To address these issues, we propose OmniThink, a machinewriting framework that emulates the human-like process of iterative expansionand reflection. The core idea behind OmniThink is to simulate the cognitivebehavior of learners as they progressively deepen their knowledge of thetopics. Experimental results demonstrate that OmniThink improves the knowledgedensity of generated articles without compromising metrics such as coherenceand depth. Human evaluations and expert feedback further highlight thepotential of OmniThink to address real-world challenges in the generation oflong-form articles.

Quick Read (beta)

loading the full paper ...