Wield: Systematic Reinforcement Learning With Progressive Randomization

  • 2019-09-15 17:36:26
  • Michael Schaarschmidt, Kai Fricke, Eiko Yoneki
  • 2

Abstract

Reinforcement learning frameworks have introduced abstractions to implementand execute algorithms at scale. They assume standardized simulator interfacesbut are not concerned with identifying suitable task representations. Wepresent Wield, a first-of-its kind system to facilitate task design forpractical reinforcement learning. Through software primitives, Wield enablespractitioners to decouple system-interface and deployment-specificconfiguration from state and action design. To guide experimentation, Wieldfurther introduces a novel task design protocol and classification schemecentred around staged randomization to incrementally evaluate modelcapabilities.

 

Quick Read (beta)

loading the full paper ...