Guided Image Generation with Conditional Invertible Neural Networks

Abstract

In this work, we address the task of natural image generation guided by aconditioning input. We introduce a new architecture called conditionalinvertible neural network (cINN). The cINN combines the purely generative INNmodel with an unconstrained feed-forward network, which efficientlypreprocesses the conditioning input into useful features. All parameters of thecINN are jointly optimized with a stable, maximum likelihood-based trainingprocedure. By construction, the cINN does not experience mode collapse andgenerates diverse samples, in contrast to e.g. cGANs. At the same time ourmodel produces sharp images since no reconstruction loss is required, incontrast to e.g. VAEs. We demonstrate these properties for the tasks of MNISTdigit generation and image colorization. Furthermore, we take advantage of ourbi-directional cINN architecture to explore and manipulate emergent propertiesof the latent space, such as changing the image style in an intuitive way.

Quick Read (beta)

loading the full paper ...