Multi-focus Attention Network for Efficient Deep Reinforcement Learning

Abstract

Deep reinforcement learning (DRL) has shown incredible performance inlearning various tasks to the human level. However, unlike human perception,current DRL models connect the entire low-level sensory input to thestate-action values rather than exploiting the relationship between and amongentities that constitute the sensory input. Because of this difference, DRLneeds vast amount of experience samples to learn. In this paper, we propose aMulti-focus Attention Network (MANet) which mimics human ability to spatiallyabstract the low-level sensory input into multiple entities and attend to themsimultaneously. The proposed method first divides the low-level input intoseveral segments which we refer to as partial states. After this segmentation,parallel attention layers attend to the partial states relevant to solving thetask. Our model estimates state-action values using these attended partialstates. In our experiments, MANet attains highest scores with significantlyless experience samples. Additionally, the model shows higher performancecompared to the Deep Q-network and the single attention model as benchmarks.Furthermore, we extend our model to attentive communication model forperforming multi-agent cooperative tasks. In multi-agent cooperative taskexperiments, our model shows 20% faster learning than existing state-of-the-artmodel.

Quick Read (beta)

loading the full paper ...