Abstract
Reinforcement learning frameworks have introduced abstractions to implementand execute algorithms at scale. They assume standardized simulator interfacesbut are not concerned with identifying suitable task representations. Wepresent Wield, a first-of-its kind system to facilitate task design forpractical reinforcement learning. Through software primitives, Wield enablespractitioners to decouple system-interface and deployment-specificconfiguration from state and action design. To guide experimentation, Wieldfurther introduces a novel task design protocol and classification schemecentred around staged randomization to incrementally evaluate modelcapabilities.
Quick Read (beta)
loading the full paper ...