Abstract
The capacity of meta-learning algorithms to quickly adapt to a variety oftasks, including ones they did not experience during meta-training, has been akey factor in the recent success of these methods on few-shot learningproblems. This particular advantage of using meta-learning over standardsupervised or reinforcement learning is only well founded under the assumptionthat the adaptation phase does improve the performance of our model on the taskof interest. However, in the classical framework of meta-learning, thisconstraint is only mildly enforced, if not at all, and we only see animprovement on average over a distribution of tasks. In this paper, we showthat the adaptation in an algorithm like MAML can significantly decrease theperformance of an agent in a meta-reinforcement learning setting, even on arange of meta-training tasks.