Abstract
We consider the problem of transferring policies to the real world bytraining on a distribution of simulated scenarios. Rather than manually tuningthe randomization of simulations, we adapt the simulation parameterdistribution using a few real world roll-outs interleaved with policy training.In doing so, we are able to change the distribution of simulations to improvethe policy transfer by matching the policy behavior in simulation and the realworld. We show that policies trained with our method are able to reliablytransfer to different robots in two real world tasks: swing-peg-in-hole andopening a cabinet drawer. The video of our experiments can be found athttps://sites.google.com/view/simopt
Quick Read (beta)
loading the full paper ...