Abstract
Recent studies have shown remarkable success in image-to-image translationfor two domains. However, existing approaches have limited scalability androbustness in handling more than two domains, since different models should bebuilt independently for every pair of image domains. To address thislimitation, we propose StarGAN, a novel and scalable approach that can performimage-to-image translations for multiple domains using only a single model.Such a unified model architecture of StarGAN allows simultaneous training ofmultiple datasets with different domains within a single network. This leads toStarGAN's superior quality of translated images compared to existing models aswell as the novel capability of flexibly translating an input image to anydesired target domain. We empirically demonstrate the effectiveness of ourapproach on a facial attribute transfer and a facial expression synthesistasks.