Abstract
We study offline multitask representation learning in reinforcement learning(RL), where a learner is provided with an offline dataset from different tasksthat share a common representation and is asked to learn the sharedrepresentation. We theoretically investigate offline multitask low-rank RL, andpropose a new algorithm called MORL for offline multitask representationlearning. Furthermore, we examine downstream RL in reward-free, offline andonline scenarios, where a new task is introduced to the agent that shares thesame representation as the upstream offline tasks. Our theoretical resultsdemonstrate the benefits of using the learned representation from the upstreamoffline task instead of directly learning the representation of the low-rankmodel.