Abstract
Understanding how styles differ across languages is advantageous for trainingboth humans and computers to generate culturally appropriate text. We introducean explanation framework to extract stylistic differences from multilingual LMsand compare styles across languages. Our framework (1) generates comprehensivestyle lexica in any language and (2) consolidates feature importances from LMsinto comparable lexical categories. We apply this framework to comparepoliteness, creating the first holistic multilingual politeness dataset andexploring how politeness varies across four languages. Our approach enables aneffective evaluation of how distinct linguistic categories contribute tostylistic variations and provides interpretable insights into how peoplecommunicate differently around the world.