Abstract
We develop amortized population Gibbs (APG) samplers, a new class ofautoencoding variational methods for deep probabilistic models. APG samplersconstruct high-dimensional proposals by iterating over updates tolower-dimensional blocks of variables. Each conditional update is a neuralproposal, which we train by minimizing the inclusive KL divergence relative tothe conditional posterior. To appropriately account for the size of the inputdata, we develop a new parameterization in terms of neural sufficientstatistics, resulting in quasi-conjugate variational approximations.Experiments demonstrate that learned proposals converge to the known analyticalconditional posterior in conjugate models, and that APG samplers can learninference networks for highly-structured deep generative models when theconditional posteriors are intractable. Here APG samplers offer a path towardscaling up stochastic variational methods to models in which standardautoencoding architectures fail to produce accurate samples.