Abstract
Multilingual reasoning remains a significant challenge for large languagemodels (LLMs), with performance disproportionately favoring high-resourcelanguages. Drawing inspiration from cognitive neuroscience, which suggests thathuman reasoning functions largely independently of language processing, wehypothesize that LLMs similarly encode reasoning and language as separablecomponents that can be disentangled to enhance multilingual reasoning. Toevaluate this, we perform a causal intervention by ablating language-specificrepresentations at inference time. Experiments on 10 open-source LLMs spanning11 typologically diverse languages show that this language-specific ablationconsistently boosts multilingual reasoning performance. Layer-wise analysesfurther confirm that language and reasoning representations can be effectivelydecoupled throughout the model, yielding improved multilingual reasoningcapabilities, while preserving top-layer language features remains essentialfor maintaining linguistic fidelity. Compared to post-training such assupervised fine-tuning or reinforcement learning, our training-free ablationachieves comparable or superior results with minimal computational overhead.These findings shed light on the internal mechanisms underlying multilingualreasoning in LLMs and suggest a lightweight and interpretable strategy forimproving cross-lingual generalization.