Abstract
Most social media users come from non-English speaking countries in theGlobal South, where much of harmful content appears in local languages. Yet,current AI-driven moderation systems struggle with low-resource languagesspoken in these regions. This work examines the systemic challenges in buildingautomated moderation tools for these languages. We conducted semi-structuredinterviews with 22 AI experts working on detecting harmful content in fourlow-resource languages: Tamil (South Asia), Swahili (East Africa), MaghrebiArabic (North Africa), and Quechua (South America). Our findings show thatbeyond the well-known data scarcity in local languages, technical issues--suchas outdated machine translation systems, sentiment and toxicity models groundedin Western values, and unreliable language detection technologies--underminemoderation efforts. Even with more data, current language models andpreprocessing pipelines--primarily designed for English--struggle with themorphological richness, linguistic complexity, and code-mixing. As a result,automated moderation in Tamil, Swahili, Arabic, and Quechua remains fraughtwith inaccuracies and blind spots. Based on our findings, we argue that theselimitations are not just technical gaps but reflect deeper structuralinequities that continue to reproduce historical power imbalances. We concludeby discussing multi-stakeholder approaches to improve automated moderation forlow-resource languages.