A Prefrontal Cortex-inspired Architecture for Planning in Large Language Models

Abstract

Large language models (LLMs) demonstrate impressive performance on a widevariety of tasks, but they often struggle with tasks that require multi-stepreasoning or goal-directed planning. To address this, we take inspiration fromthe human brain, in which planning is accomplished via the recurrentinteraction of specialized modules in the prefrontal cortex (PFC). Thesemodules perform functions such as conflict monitoring, state prediction, stateevaluation, task decomposition, and task coordination. We find that LLMs aresometimes capable of carrying out these functions in isolation, but struggle toautonomously coordinate them in the service of a goal. Therefore, we propose ablack box architecture with multiple LLM-based (GPT-4) modules. Thearchitecture improves planning through the interaction of specializedPFC-inspired modules that break down a larger problem into multiple briefautomated calls to the LLM. We evaluate the combined architecture on threechallenging planning tasks -- graph traversal, Tower of Hanoi, and logistics --finding that it yields significant improvements over standard LLM methods(e.g., zero-shot prompting, in-context learning, and chain-of-thought). Theseresults demonstrate the benefit of utilizing knowledge from cognitiveneuroscience to improve planning in LLMs.

Quick Read (beta)

loading the full paper ...