Self-Organizing Maps as a Storage and Transfer Mechanism in Reinforcement Learning

Abstract

The idea of reusing information from previously learned tasks (source tasks)for the learning of new tasks (target tasks) has the potential to significantlyimprove the sample efficiency reinforcement learning agents. In this work, wedescribe an approach to concisely store and represent learned task knowledge,and reuse it by allowing it to guide the exploration of an agent while itlearns new tasks. In order to do so, we use a measure of similarity that isdefined directly in the space of parameterized representations of the valuefunctions. This similarity measure is also used as a basis for a variant of thegrowing self-organizing map algorithm, which is simultaneously used to enablethe storage of previously acquired task knowledge in an adaptive and scalablemanner.We empirically validate our approach in a simulated navigationenvironment and discuss possible extensions to this approach along withpotential applications where it could be particularly useful.

Quick Read (beta)

loading the full paper ...