Abstract
Intrinsically motivated goal exploration processes enable agents toautonomously sample goals to explore efficiently complex environments withhigh-dimensional continuous actions. They have been applied successfully toreal world robots to discover repertoires of policies producing a widediversity of effects. Often these algorithms relied on engineered goal spacesbut it was recently shown that one can use deep representation learningalgorithms to learn an adequate goal space in simple environments. However, inthe case of more complex environments containing multiple objects ordistractors, an efficient exploration requires that the structure of the goalspace reflects the one of the environment. In this paper we show that using adisentangled goal space leads to better exploration performances than anentangled goal space. We further show that when the representation isdisentangled, one can leverage it by sampling goals that maximize learningprogress in a modular manner. Finally, we show that the measure of learningprogress, used to drive curiosity-driven exploration, can be usedsimultaneously to discover abstract independently controllable features of theenvironment.