AmaSQuAD: A Benchmark for Amharic Extractive Question Answering

Abstract

This research presents a novel framework for translating extractivequestion-answering datasets into low-resource languages, as demonstrated by thecreation of the AmaSQuAD dataset, a translation of SQuAD 2.0 into Amharic. Themethodology addresses challenges related to misalignment between translatedquestions and answers, as well as the presence of multiple answer instances inthe translated context. For this purpose, we used cosine similarity utilizingembeddings from a fine-tuned BERT-based model for Amharic and Longest CommonSubsequence (LCS). Additionally, we fine-tune the XLM-R model on the AmaSQuADsynthetic dataset for Amharic Question-Answering. The results show animprovement in baseline performance, with the fine-tuned model achieving anincrease in the F1 score from 36.55% to 44.41% and 50.01% to 57.5% on theAmaSQuAD development dataset. Moreover, the model demonstrates improvement onthe human-curated AmQA dataset, increasing the F1 score from 67.80% to 68.80%and the exact match score from 52.50% to 52.66%.The AmaSQuAD dataset ispublicly available Datasets

Quick Read (beta)

loading the full paper ...