Learning to Grasp from 2.5D images: a Deep Reinforcement Learning Approach

Abstract

In this paper, we propose a deep reinforcement learning (DRL) solution to thegrasping problem using 2.5D images as the only source of information. Inparticular, we developed a simulated environment where a robot equipped with avacuum gripper has the aim of reaching blocks with planar surfaces. Theseblocks can have different dimensions, shapes, position and orientation. Unity3D allowed us to simulate a real-world setup, where a depth camera is placed ina fixed position and the stream of images is used by our policy network tolearn how to solve the task. We explored different DRL algorithms and problemconfigurations. The experiments demonstrated the effectiveness of the proposedDRL algorithm applied to grasp tasks guided by visual depth camera inputs. Whenusing the proper policy, the proposed method estimates a robot toolconfiguration that reaches the object surface with negligible position andorientation errors. This is, to the best of our knowledge, the first successfulattempt of using 2.5D images only as of the input of a DRL algorithm, to solvethe grasping problem regressing 3D world coordinates.

Quick Read (beta)

loading the full paper ...