Abstract
Landing an unmanned aerial vehicle (UAV) on a ground marker is an openproblem despite the effort of the research community. Previous attempts mostlyfocused on the analysis of hand-crafted geometric features and the use ofexternal sensors in order to allow the vehicle to approach the land-pad. Inthis article, we propose a method based on deep reinforcement learning thatonly requires low-resolution images taken from a down-looking camera in orderto identify the position of the marker and land the UAV on it. The proposedapproach is based on a hierarchy of Deep Q-Networks (DQNs) used as high-levelcontrol policy for the navigation toward the marker. We implemented differenttechnical solutions, such as the combination of vanilla and double DQNs trainedusing a partitioned buffer replay.The results show that policies trained onuniform textures can accomplish autonomous landing on a large variety ofsimulated environments. The overall performance is comparable with astate-of-the-art algorithm and human pilots.