Reinforcement Learning for Improving Agent Design

  • 2018-10-09 02:32:37
  • David Ha
  • 5

Abstract

In many reinforcement learning tasks, the goal is to learn a policy tomanipulate an agent, whose design is fixed, to maximize some notion ofcumulative reward. The design of the agent's physical structure is rarelyoptimized for the task at hand. In this work, we explore the possibility oflearning a version of the agent's design that is better suited for its task,jointly with the policy. We propose a minor alteration to the OpenAI Gymframework, where we parameterize parts of an environment, and allow an agent tojointly learn to modify these environment parameters along with its policy. Wedemonstrate that an agent can learn a better structure of its body that is notonly better suited for the task, but also facilitates policy learning. Jointlearning of policy and structure may even uncover design principles that areuseful for assisted-design applications. Videos of results athttps://designrl.github.io/

 

Quick Read (beta)

loading the full paper ...