Abstract
In reinforcement learning, an agent learns to reach a set of goals by meansof an external reward signal. In the natural world, intelligent organisms learnfrom internal drives, bypassing the need for external signals, which isbeneficial for a wide range of tasks. Motivated by this observation, we proposeto formulate an intrinsic objective as the mutual information between the goalstates and the controllable states. This objective encourages the agent to takecontrol of its environment. Subsequently, we derive a surrogate objective ofthe proposed reward function, which can be optimized efficiently. Lastly, weevaluate the developed framework in different robotic manipulation andnavigation tasks and demonstrate the efficacy of our approach. A video showingexperimental results is available at \url{https://youtu.be/CT4CKMWBYz0}.