Abstract
Conditional image generative models hold considerable promise to produceinfinite amounts of synthetic training data. Yet, recent progress in generationquality has come at the expense of generation diversity, limiting the utilityof these models as a source of synthetic training data. Although guidance-basedapproaches have been introduced to improve the utility of generated data byfocusing on quality or diversity, the (implicit or explicit) utility functionsoftentimes disregard the potential distribution shift between synthetic andreal data. In this work, we introduce Chamfer Guidance: a training-freeguidance approach which leverages a handful of real exemplar images tocharacterize the quality and diversity of synthetic data. We show that byleveraging the proposed Chamfer Guidance, we can boost the diversity of thegenerations w.r.t. a dataset of real images while maintaining or improving thegeneration quality on ImageNet-1k and standard geo-diversity benchmarks. Ourapproach achieves state-of-the-art few-shot performance with as little as 2exemplar real images, obtaining 96.4% in terms of precision, and 86.4% in termsof distributional coverage, which increase to 97.5% and 92.7%, respectively,when using 32 real images. We showcase the benefits of the Chamfer Guidancegeneration by training downstream image classifiers on synthetic data,achieving accuracy boost of up to 15% for in-distribution over the baselines,and up to 16% in out-of-distribution. Furthermore, our approach does notrequire using the unconditional model, and thus obtains a 31% reduction inFLOPs w.r.t. classifier-free-guidance-based approaches at sampling time.