Amplifying the Imitation Effect for Reinforcement Learning of UCAV's Mission Execution

  • 2019-01-17 15:47:12
  • Gyeong Taek Lee, Chang Ouk Kim
  • 1

Abstract

This paper proposes a new reinforcement learning (RL) algorithm that enhancesexploration by amplifying the imitation effect (AIE). This algorithm consistsof self-imitation learning and random network distillation algorithms. We arguethat these two algorithms complement each other and that combining these twoalgorithms can amplify the imitation effect for exploration. In addition, byadding an intrinsic penalty reward to the state that the RL agent frequentlyvisits and using replay memory for learning the feature state when using anexploration bonus, the proposed approach leads to deep exploration and deviatesfrom the current converged policy. We verified the exploration performance ofthe algorithm through experiments in a two-dimensional grid environment. Inaddition, we applied the algorithm to a simulated environment of unmannedcombat aerial vehicle (UCAV) mission execution, and the empirical results showthat AIE is very effective for finding the UCAV's shortest flight path to avoidan enemy's missiles.

 

Quick Read (beta)

loading the full paper ...