Abstract
Zero-shot artistic style transfer is an important image synthesis problemaiming at transferring arbitrary style into content images. However, thetrade-off between the generalization and efficiency in existing methods impedesa high quality zero-shot style transfer in real-time. In this paper, we resolvethis dilemma and propose an efficient yet effective Avatar-Net that enablesvisually plausible multi-scale transfer for arbitrary style. The key ingredientof our method is a style decorator that makes up the content features bysemantically aligned style features from an arbitrary style image, which doesnot only holistically match their feature distributions but also preservedetailed style patterns in the decorated features. By embedding this moduleinto an image reconstruction network that fuses multi-scale style abstractions,the Avatar-Net renders multi-scale stylization for any style image in onefeed-forward pass. We demonstrate the state-of-the-art effectiveness andefficiency of the proposed method in generating high-quality stylized images,with a series of applications include multiple style integration, videostylization and etc.