Abstract
While understanding the knowledge boundaries of LLMs is crucial to preventhallucination, research on knowledge boundaries of LLMs has predominantlyfocused on English. In this work, we present the first study to analyze howLLMs recognize knowledge boundaries across different languages by probing theirinternal representations when processing known and unknown questions inmultiple languages. Our empirical studies reveal three key findings: 1) LLMs'perceptions of knowledge boundaries are encoded in the middle to middle-upperlayers across different languages. 2) Language differences in knowledgeboundary perception follow a linear structure, which motivates our proposal ofa training-free alignment method that effectively transfers knowledge boundaryperception ability across languages, thereby helping reduce hallucination riskin low-resource languages; 3) Fine-tuning on bilingual question pairtranslation further enhances LLMs' recognition of knowledge boundaries acrosslanguages. Given the absence of standard testbeds for cross-lingual knowledgeboundary analysis, we construct a multilingual evaluation suite comprisingthree representative types of knowledge boundary data. Our code and datasetsare publicly available athttps://github.com/DAMO-NLP-SG/LLM-Multilingual-Knowledge-Boundaries.