Can Giraffes Become Birds? An Evaluation of Image-to-image Translation for Data Generation

  • 2020-01-10 19:29:11
  • Daniel V. Ruiz, Gabriel Salomon, Eduardo Todt
  • 25

Abstract

There is an increasing interest in image-to-image translation withapplications ranging from generating maps from satellite images to creatingentire clothes' images from only contours. In the present work, we investigateimage-to-image translation using Generative Adversarial Networks (GANs) forgenerating new data, taking as a case study the morphing of giraffes imagesinto bird images. Morphing a giraffe into a bird is a challenging task, as theyhave different scales, textures, and morphology. An unsupervised cross-domaintranslator entitled InstaGAN was trained on giraffes and birds, along withtheir respective masks, to learn translation between both domains. A dataset ofsynthetic bird images was generated using translation from originally giraffeimages while preserving the original spatial arrangement and background. It isimportant to stress that the generated birds do not exist, being only theresult of a latent representation learned by InstaGAN. Two subsets of commonliterature datasets were used for training the GAN and generating thetranslated images: COCO and Caltech-UCSD Birds 200-2011. To evaluate therealness and quality of the generated images and masks, qualitative andquantitative analyses were made. For the quantitative analysis, a pre-trainedMask R-CNN was used for the detection and segmentation of birds on Pascal VOC,Caltech-UCSD Birds 200-2011, and our new dataset entitled FakeSet. Thegenerated dataset achieved detection and segmentation results close to the realdatasets, suggesting that the generated images are realistic enough to bedetected and segmented by a state-of-the-art deep neural network.

 

Quick Read (beta)

loading the full paper ...