Dynamic Classifier-Free Diffusion Guidance via Online Feedback

  • 2025-09-19 16:27:19
  • Pinelopi Papalampidi, Olivia Wiles, Ira Ktena, Aleksandar Shtedritski, Emanuele Bugliarello, Ivana Kajic, Isabela Albuquerque, Aida Nematzadeh
  • 0

Abstract

Classifier-free guidance (CFG) is a cornerstone of text-to-image diffusionmodels, yet its effectiveness is limited by the use of static guidance scales.This "one-size-fits-all" approach fails to adapt to the diverse requirements ofdifferent prompts; moreover, prior solutions like gradient-based correction orfixed heuristic schedules introduce additional complexities and fail togeneralize. In this work, we challeng this static paradigm by introducing aframework for dynamic CFG scheduling. Our method leverages online feedback froma suite of general-purpose and specialized small-scale latent-spaceevaluations, such as CLIP for alignment, a discriminator for fidelity and ahuman preference reward model, to assess generation quality at each step of thereverse diffusion process. Based on this feedback, we perform a greedy searchto select the optimal CFG scale for each timestep, creating a unique guidanceschedule tailored to every prompt and sample. We demonstrate the effectivenessof our approach on both small-scale models and the state-of-the-art Imagen 3,showing significant improvements in text alignment, visual quality, textrendering and numerical reasoning. Notably, when compared against the defaultImagen 3 baseline, our method achieves up to 53.8% human preference win-ratefor overall preference, a figure that increases up to to 55.5% on promptstargeting specific capabilities like text rendering. Our work establishes thatthe optimal guidance schedule is inherently dynamic and prompt-dependent, andprovides an efficient and generalizable framework to achieve it.

 

Quick Read (beta)

loading the full paper ...