Abstract
Recently, there has been a growing interest in developing saliency methodsthat provide visual explanations of network predictions. Still, the usabilityof existing methods is limited to image classification models. To overcome thislimitation, we extend the existing approaches to generate grid saliencies,which provide spatially coherent visual explanations for (pixel-level) denseprediction networks. As the proposed grid saliency allows to spatiallydisentangle the object and its context, we specifically explore its potentialto produce context explanations for semantic segmentation networks, discoveringwhich context most influences the class predictions inside a target objectarea. We investigate the effectiveness of grid saliency on a synthetic datasetwith an artificially induced bias between objects and their context as well ason the real-world Cityscapes dataset using state-of-the-art segmentationnetworks. Our results show that grid saliency can be successfully used toprovide easily interpretable context explanations and, moreover, can beemployed for detecting and localizing contextual biases present in the data.