Abstract
We consider the problem of modelling the effects of unseen perturbations suchas gene knockdowns or drug combinations on low-level measurements such as RNAsequencing data. Specifically, given data collected under some perturbations,we aim to predict the distribution of measurements for new perturbations. Toaddress this challenging extrapolation task, we posit that perturbations actadditively in a suitable, unknown embedding space. More precisely, we formulatethe generative process underlying the observed data as a latent variable model,in which perturbations amount to mean shifts in latent space and can becombined additively. Unlike previous work, we prove that, given sufficientlydiverse training perturbations, the representation and perturbation effects areidentifiable up to affine transformation, and use this to characterize theclass of unseen perturbations for which we obtain extrapolation guarantees. Toestimate the model from data, we propose a new method, the perturbationdistribution autoencoder (PDAE), which is trained by maximising thedistributional similarity between true and predicted perturbationdistributions. The trained model can then be used to predict previously unseenperturbation distributions. Empirical evidence suggests that PDAE comparesfavourably to existing methods and baselines at predicting the effects ofunseen perturbations.