Abstract
Recent research has investigated the shape and texture biases of deep neuralnetworks (DNNs) in image classification which influence their generalizationcapabilities and robustness. It has been shown that, in comparison to regularDNN training, training with stylized images reduces texture biases in imageclassification and improves robustness with respect to image corruptions. In aneffort to advance this line of research, we examine whether style transfer canlikewise deliver these two effects in semantic segmentation. To this end, weperform style transfer with style varying across artificial image areas. Thoserandom areas are formed by a chosen number of Voronoi cells. The resultingstyle-transferred data is then used to train semantic segmentation DNNs withthe objective of reducing their dependence on texture cues while enhancingtheir reliance on shape-based features. In our experiments, it turns out thatin semantic segmentation, style transfer augmentation reduces texture bias andstrongly increases robustness with respect to common image corruptions as wellas adversarial attacks. These observations hold for convolutional neuralnetworks and transformer architectures on the Cityscapes dataset as well as onPASCAL Context, showing the generality of the proposed method.