Accelerated Diffusion Models via Speculative Sampling

  • 2025-01-09 16:50:16
  • Valentin De Bortoli, Alexandre Galashov, Arthur Gretton, Arnaud Doucet
  • 0

Abstract

Speculative sampling is a popular technique for accelerating inference inLarge Language Models by generating candidate tokens using a fast draft modeland accepting or rejecting them based on the target model's distribution. Whilespeculative sampling was previously limited to discrete sequences, we extend itto diffusion models, which generate samples via continuous, vector-valuedMarkov chains. In this context, the target model is a high-quality butcomputationally expensive diffusion model. We propose various draftingstrategies, including a simple and effective approach that does not requiretraining a draft model and is applicable out of the box to any diffusion model.Our experiments demonstrate significant generation speedup on various diffusionmodels, halving the number of function evaluations, while generating exactsamples from the target model.

 

Quick Read (beta)

loading the full paper ...