Causal Representation Learning from Multiple Distributions: A General Setting

Abstract

In many problems, the measured variables (e.g., image pixels) are justmathematical functions of the hidden causal variables (e.g., the underlyingconcepts or objects). For the purpose of making predictions in changingenvironments or making proper changes to the system, it is helpful to recoverthe hidden causal variables $Z_i$ and their causal relations represented bygraph $\mathcal{G}_Z$. This problem has recently been known as causalrepresentation learning. This paper is concerned with a general, completelynonparametric setting of causal representation learning from multipledistributions (arising from heterogeneous data or nonstationary time series),without assuming hard interventions behind distribution changes. We aim todevelop general solutions in this fundamental case; as a by product, this helpssee the unique benefit offered by other assumptions such as parametric causalmodels or hard interventions. We show that under the sparsity constraint on therecovered graph over the latent variables and suitable sufficient changeconditions on the causal influences, interestingly, one can recover themoralized graph of the underlying directed acyclic graph, and the recoveredlatent variables and their relations are related to the underlying causal modelin a specific, nontrivial way. In some cases, each latent variable can even berecovered up to component-wise transformations. Experimental results verify ourtheoretical claims.

Quick Read (beta)

loading the full paper ...