Abstract
Trained machine learning models are increasingly used to perform high-impacttasks in areas such as law enforcement, medicine, education, and employment. Inorder to clarify the intended use cases of machine learning models and minimizetheir usage in contexts for which they are not well suited, we recommend thatreleased models be accompanied by documentation detailing their performancecharacteristics. In this paper, we propose a framework that we call modelcards, to encourage such transparent model reporting. Model cards are shortdocuments accompanying trained machine learning models that provide benchmarkedevaluation in a variety of conditions, such as across different cultural,demographic, or phenotypic groups (e.g., race, geographic location, sex,Fitzpatrick skin type) and intersectional groups (e.g., age and race, or sexand Fitzpatrick skin type) that are relevant to the intended applicationdomains. Model cards also disclose the context in which models are intended tobe used, details of the performance evaluation procedures, and other relevantinformation. While we focus primarily on human-centered machine learning modelsin the application fields of computer vision and natural language processing,this framework can be used to document any trained machine learning model. Tosolidify the concept, we provide cards for two supervised models: One trainedto detect smiling faces in images, and one trained to detect toxic comments intext. We propose model cards as a step towards the responsible democratizationof machine learning and related AI technology, increasing transparency into howwell AI technology works. We hope this work encourages those releasing trainedmachine learning models to accompany model releases with similar detailedevaluation numbers and other relevant documentation.