ProLoNets: Neural-encoding Human Experts' Domain Knowledge to Warm Start Reinforcement Learning

Abstract

Deep reinforcement learning has seen great success across a breadth of tasks,such as in game playing and robotic manipulation. However, the modern practiceof attempting to learn tabula rasa disregards the logical structure of manydomains and the wealth of readily available domain experts' knowledge thatcould help "warm start" the learning process. Further, learning fromdemonstration techniques are not yet efficient enough to infer this knowledgethrough sampling-based mechanisms in large state and action spaces. We presenta new reinforcement learning architecture that can encode expert knowledge, inthe form of propositional logic, directly into a neural, tree-like structure offuzzy propositions amenable to gradient descent and show that our novelarchitecture is able to outperform reinforcement and imitation learningtechniques across an array of reinforcement learning challenges.

Quick Read (beta)

loading the full paper ...