One Mind, Many Tongues: A Deep Dive into Language-Agnostic Knowledge Neurons in Large Language Models

Abstract

Large language models (LLMs) have learned vast amounts of factual knowledgethrough self-supervised pre-training on large-scale corpora. Meanwhile, LLMshave also demonstrated excellent multilingual capabilities, which can expressthe learned knowledge in multiple languages. However, the knowledge storagemechanism in LLMs still remains mysterious. Some researchers attempt todemystify the factual knowledge in LLMs from the perspective of knowledgeneurons, and subsequently discover language-agnostic knowledge neurons thatstore factual knowledge in a form that transcends language barriers. However,the preliminary finding suffers from two limitations: 1) High Uncertainty inLocalization Results. Existing study only uses a prompt-based probe to localizeknowledge neurons for each fact, while LLMs cannot provide consistent answersfor semantically equivalent queries. Thus, it leads to inaccurate localizationresults with high uncertainty. 2) Lack of Analysis in More Languages. The studyonly analyzes language-agnostic knowledge neurons on English and Chinese data,without exploring more language families and languages. Naturally, it limitsthe generalizability of the findings. To address aforementioned problems, wefirst construct a new benchmark called Rephrased Multilingual LAMA (RML-LAMA),which contains high-quality cloze-style multilingual parallel queries for eachfact. Then, we propose a novel method named Multilingual Integrated Gradientswith Uncertainty Estimation (MATRICE), which quantifies the uncertainty acrossqueries and languages during knowledge localization. Extensive experiments showthat our method can accurately localize language-agnostic knowledge neurons. Wealso further investigate the role of language-agnostic knowledge neurons incross-lingual knowledge editing, knowledge enhancement and new knowledgeinjection.

Quick Read (beta)

loading the full paper ...