Autoencoders and Probabilistic Inference with Missing Data: An Exact Solution for The Factor Analysis Case

Abstract

Latent variable models can be used to probabilistically "fill-in" missingdata entries. The variational autoencoder architecture (Kingma and Welling,2014; Rezende et al., 2014) includes a "recognition" or "encoder" network thatinfers the latent variables given the data variables. However, it is not clearhow to handle missing data variables in this network. We show how to calculateexactly the latent posterior distribution for the factor analysis (FA) model inthe presence of missing data, and note that this solution exhibits anon-trivial dependence on the pattern of missingness. Experiments compare theeffectiveness of various approaches to filling in the missing data.

Quick Read (beta)

loading the full paper ...