Revealing the Learning Process in Reinforcement Learning Agents Through Attention-Oriented Metrics

Abstract

The learning process of a reinforcement learning (RL) agent remains poorlyunderstood beyond the mathematical formulation of its learning algorithm. Toaddress this gap, we introduce attention-oriented metrics (ATOMs) toinvestigate the development of an RL agent's attention during training. In acontrolled experiment, we tested ATOMs on three variations of a Pong game, eachdesigned to teach the agent distinct behaviours, complemented by a behaviouralassessment. ATOMs successfully delineate the attention patterns of an agenttrained on each game variation, and that these differences in attentionpatterns translate into differences in the agent's behaviour. Throughcontinuous monitoring of ATOMs during training, we observed that the agent'sattention developed in phases, and that these phases were consistent acrossgame variations. Overall, we believe that ATOM could help improve ourunderstanding of the learning processes of RL agents and better understand therelationship between attention and learning.

Quick Read (beta)

loading the full paper ...