Learning Neural Causal Models from Unknown Interventions

  • 2019-10-02 16:50:15
  • Nan Rosemary Ke, Olexa Bilaniuk, Anirudh Goyal, Stefan Bauer, Hugo Larochelle, Chris Pal, Yoshua Bengio
  • 60

Abstract

Meta-learning over a set of distributions can be interpreted as learningdifferent types of parameters corresponding to short-term vs long-term aspectsof the mechanisms underlying the generation of data. These are respectivelycaptured by quickly-changing parameters and slowly-changing meta-parameters. Wepresent a new framework for meta-learning causal models where the relationshipbetween each variable and its parents is modeled by a neural network, modulatedby structural meta-parameters which capture the overall topology of a directedgraphical model. Our approach avoids a discrete search over models in favour ofa continuous optimization procedure. We study a setting where interventionaldistributions are induced as a result of a random intervention on a singleunknown variable of an unknown ground truth causal model, and the observationsarising after such an intervention constitute one meta-example. To disentanglethe slow-changing aspects of each conditional from the fast-changingadaptations to each intervention, we parametrize the neural network into fastparameters and slow meta-parameters. We introduce a meta-learning objectivethat favours solutions robust to frequent but sparse interventionaldistribution change, and which generalize well to previously unseeninterventions. Optimizing this objective is shown experimentally to recover thestructure of the causal graph.

 

Quick Read (beta)

loading the full paper ...