Abstract
With the proliferation of algorithmic decision-making, increased scrutiny hasbeen placed on these systems. This paper explores the relationship between thequality of the training data and the overall fairness of the models trainedwith such data in the context of supervised classification. We measure keyfairness metrics across a range of algorithms over multiple imageclassification datasets that have a varying level of noise in both the labelsand the training data itself. We describe noise in the labels as inaccuraciesin the labelling of the data in the training set and noise in the data asdistortions in the data, also in the training set. By adding noise to theoriginal datasets, we can explore the relationship between the quality of thetraining data and the fairness of the output of the models trained on thatdata.