Abstract
The paper analyzes the accuracy of publicly available object-recognitionsystems on a geographically diverse dataset. This dataset contains householditems and was designed to have a more representative geographical coverage thancommonly used image datasets in object recognition. We find that the systemsperform relatively poorly on household items that commonly occur in countrieswith a low household income. Qualitative analyses suggest the drop inperformance is primarily due to appearance differences within an object class(e.g., dish soap) and due to items appearing in a different context (e.g.,toothbrushes appearing outside of bathrooms). The results of our study suggestthat further work is needed to make object-recognition systems work equallywell for people across different countries and income levels.