Abstract
Many educational technologies use artificial intelligence (AI) that presentsgenerated or produced language to the learner. We contend that all language,including all AI communication, encodes information about the identity of thehuman or humans who contributed to crafting the language. With AIcommunication, however, the user may index identity information that does notmatch the source. This can lead to representational harms if languageassociated with one cultural group is presented as "standard" or "neutral", ifthe language advantages one group over another, or if the language reinforcesnegative stereotypes. In this work, we discuss a case study using a VisualQuestion Generation (VQG) task involving gathering crowdsourced data fromtargeted demographic groups. Generated questions will be presented to humanevaluators to understand how they index the identity behind the language,whether and how they perceive any representational harms, and how they wouldideally address any such harms caused by AI communication. We reflect on theeducational applications of this work as well as the implications for equality,diversity, and inclusion (EDI).