Abstract
We introduce a new dataset of 293,008 high definition (1360 x 1360 pixels)fashion images paired with item descriptions provided by professional stylists.Each item is photographed from a variety of angles. We provide baseline resultson 1) high-resolution image generation, and 2) image generation conditioned onthe given text descriptions. We invite the community to improve upon thesebaselines. In this paper, we also outline the details of a challenge that weare launching based upon this dataset.
Quick Read (beta)
loading the full paper ...