Abstract
We explore novel approaches to the task of image generation from theirrespective captions, building on state-of-the-art GAN architectures.Particularly, we baseline our models with the Attention-based GANs that learnattention mappings from words to image features. To better capture the featuresof the descriptions, we then built a novel cyclic design that learns an inversefunction to maps the image back to original caption. Additionally, weincorporated recently developed BERT pretrained word embeddings as our initialtext featurizer and observe a noticeable improvement in qualitative andquantitative performance compared to the Attention GAN baseline.
Quick Read (beta)
loading the full paper ...