Abstract
Visual Reinforcement Learning is a popular and powerful framework that takesfull advantage of the Deep Learning breakthrough. It is known that variationsin input domains (e.g., different panorama colors due to seasonal changes) ortask domains (e.g., altering the target speed of a car) can disrupt agentperformance, necessitating new training for each variation. Recent advancementsin the field of representation learning have demonstrated the possibility ofcombining components from different neural networks to create new models in azero-shot fashion. In this paper, we build upon relative representations, aframework that maps encoder embeddings to a universal space. We adapt thisframework to the Visual Reinforcement Learning setting, allowing to combineagents components to create new agents capable of effectively handling novelvisual-task pairs not encountered during training. Our findings highlight thepotential for model reuse, significantly reducing the need for retraining and,consequently, the time and computational resources required.