Abstract
Kenyan Sign Language (KSL) is the primary language used by the deaf communityin Kenya. It is the medium of instruction from Pre-primary 1 to universityamong deaf learners, facilitating their education and academic achievement.Kenyan Sign Language is used for social interaction, expression of needs,making requests and general communication among persons who are deaf in Kenya.However, there exists a language barrier between the deaf and the hearingpeople in Kenya. Thus, the innovation on AI4KSL is key in eliminating thecommunication barrier. Artificial intelligence for KSL is a two-year researchproject (2023-2024) that aims to create a digital open-access AI of spontaneousand elicited data from a representative sample of the Kenyan deaf community.The purpose of this study is to develop AI assistive technology dataset thattranslates English to KSL as a way of fostering inclusion and bridging languagebarriers among deaf learners in Kenya. Specific objectives are: Build KSLdataset for spoken English and video recorded Kenyan Sign Language and to buildtranscriptions of the KSL signs to a phonetic-level interface of the signlanguage. In this paper, the methodology for building the dataset is described.Data was collected from 48 teachers and tutors of the deaf learners and 400learners who are Deaf. Participants engaged mainly in sign language elicitationtasks through reading and singing. Findings of the dataset consisted of about14,000 English sentences with corresponding KSL Gloss derived from a pool ofabout 4000 words and about 20,000 signed KSL videos that are either signedwords or sentences. The second level of data outcomes consisted of 10,000 splitand segmented KSL videos. The third outcome of the dataset consists of 4,000transcribed words into five articulatory parameters according to HamNoSyssystem.