Abstract
In this paper, targeting to understand the underlying explainable factorsbehind observations and modeling the conditional generation process on thesefactors, we propose a new task, disentanglement of diffusion probabilisticmodels (DPMs), to take advantage of the remarkable modeling ability of DPMs. Totackle this task, we further devise an unsupervised approach named DisDiff. Forthe first time, we achieve disentangled representation learning in theframework of diffusion probabilistic models. Given a pre-trained DPM, DisDiffcan automatically discover the inherent factors behind the image data anddisentangle the gradient fields of DPM into sub-gradient fields, eachconditioned on the representation of each discovered factor. We propose a novelDisentangling Loss for DisDiff to facilitate the disentanglement of therepresentation and sub-gradients. The extensive experiments on synthetic andreal-world datasets demonstrate the effectiveness of DisDiff.