GPS as a Control Signal for Image Generation

  • 2025-01-22 05:07:28
  • Chao Feng, Ziyang Chen, Aleksander Holynski, Alexei A. Efros, Andrew Owens
  • 0

Abstract

We show that the GPS tags contained in photo metadata provide a usefulcontrol signal for image generation. We train GPS-to-image models and use themfor tasks that require a fine-grained understanding of how images vary within acity. In particular, we train a diffusion model to generate images conditionedon both GPS and text. The learned model generates images that capture thedistinctive appearance of different neighborhoods, parks, and landmarks. Wealso extract 3D models from 2D GPS-to-image models through score distillationsampling, using GPS conditioning to constrain the appearance of thereconstruction from each viewpoint. Our evaluations suggest that ourGPS-conditioned models successfully learn to generate images that vary based onlocation, and that GPS conditioning improves estimated 3D structure.

 

Quick Read (beta)

loading the full paper ...