Failure Modes of LLMs for Causal Reasoning on Narratives

Abstract

The ability to robustly identify causal relationships is essential forautonomous decision-making and adaptation to novel scenarios. However,accurately inferring causal structure requires integrating both world knowledgeand abstract logical reasoning. In this work, we investigate the interactionbetween these two capabilities through the representative task of causalreasoning over narratives. Through controlled synthetic, semi-synthetic, andreal-world experiments, we find that state-of-the-art large language models(LLMs) often rely on superficial heuristics -- for example, inferring causalityfrom event order or recalling memorized world knowledge without attending tocontext. Furthermore, we show that simple reformulations of the task can elicitmore robust reasoning behavior. Our evaluation spans a range of causalstructures, from linear chains to complex graphs involving colliders and forks.These findings uncover systematic patterns in how LLMs perform causal reasoningand lay the groundwork for developing methods that better align LLM behaviorwith principled causal inference.

Quick Read (beta)

loading the full paper ...