Cycle Text-To-Image GAN with BERT

Abstract

We explore novel approaches to the task of image generation from theirrespective captions, building on state-of-the-art GAN architectures.Particularly, we baseline our models with the Attention-based GANs that learnattention mappings from words to image features. To better capture the featuresof the descriptions, we then built a novel cyclic design that learns an inversefunction to maps the image back to original caption. Additionally, weincorporated recently developed BERT pretrained word embeddings as our initialtext featurizer and observe a noticeable improvement in qualitative andquantitative performance compared to the Attention GAN baseline.

Quick Read (beta)

loading the full paper ...