Wield: Systematic Reinforcement Learning With Progressive Randomization

Abstract

Reinforcement learning frameworks have introduced abstractions to implementand execute algorithms at scale. They assume standardized simulator interfacesbut are not concerned with identifying suitable task representations. Wepresent Wield, a first-of-its kind system to facilitate task design forpractical reinforcement learning. Through software primitives, Wield enablespractitioners to decouple system-interface and deployment-specificconfiguration from state and action design. To guide experimentation, Wieldfurther introduces a novel task design protocol and classification schemecentred around staged randomization to incrementally evaluate modelcapabilities.

Quick Read (beta)

loading the full paper ...