Relational inductive biases, deep learning, and graph networks

  • 2018-06-04 17:58:18
  • Peter W. Battaglia, Jessica B. Hamrick, Victor Bapst, Alvaro Sanchez-Gonzalez, Vinicius Zambaldi, Mateusz Malinowski, Andrea Tacchetti, David Raposo, Adam Santoro, Ryan Faulkner, Caglar Gulcehre, Francis Song, Andrew Ballard, Justin Gilmer, George Dahl, Ashish Vaswani, Kelsey Allen, Charles Nash, Victoria Langston, Chris Dyer, Nicolas Heess, Daan Wierstra, Pushmeet Kohli, Matt Botvinick, Oriol Vinyals, Yujia Li, Razvan Pascanu
  • 296

Abstract

Artificial intelligence (AI) has undergone a renaissance recently, makingmajor progress in key domains such as vision, language, control, anddecision-making. This has been due, in part, to cheap data and cheap computeresources, which have fit the natural strengths of deep learning. However, manydefining characteristics of human intelligence, which developed under muchdifferent pressures, remain out of reach for current approaches. In particular,generalizing beyond one's experiences--a hallmark of human intelligence frominfancy--remains a formidable challenge for modern AI. The following is part position paper, part review, and part unification. Weargue that combinatorial generalization must be a top priority for AI toachieve human-like abilities, and that structured representations andcomputations are key to realizing this objective. Just as biology uses natureand nurture cooperatively, we reject the false choice between"hand-engineering" and "end-to-end" learning, and instead advocate for anapproach which benefits from their complementary strengths. We explore howusing relational inductive biases within deep learning architectures canfacilitate learning about entities, relations, and rules for composing them. Wepresent a new building block for the AI toolkit with a strong relationalinductive bias--the graph network--which generalizes and extends variousapproaches for neural networks that operate on graphs, and provides astraightforward interface for manipulating structured knowledge and producingstructured behaviors. We discuss how graph networks can support relationalreasoning and combinatorial generalization, laying the foundation for moresophisticated, interpretable, and flexible patterns of reasoning.

 

Introduction (beta)

None

 

Conclusion (beta)

None