Meta-Learning Transferable Active Learning Policies by Deep Reinforcement Learning

  • 2018-06-12 23:52:08
  • Kunkun Pang, Mingzhi Dong, Yang Wu, Timothy Hospedales
  • 2

Abstract

Active learning (AL) aims to enable training high performance classifierswith low annotation cost by predicting which subset of unlabelled instanceswould be most beneficial to label. The importance of AL has motivated extensiveresearch, proposing a wide variety of manually designed AL algorithms withdiverse theoretical and intuitive motivations. In contrast to this body ofresearch, we propose to treat active learning algorithm design as ameta-learning problem and learn the best criterion from data. We model anactive learning algorithm as a deep neural network that inputs the base learnerstate and the unlabelled point set and predicts the best point to annotatenext. Training this active query policy network with reinforcement learning,produces the best non-myopic policy for a given dataset. The key challenge inachieving a general solution to AL then becomes that of learner generalisation,particularly across heterogeneous datasets. We propose a multi-taskdataset-embedding approach that allows dataset-agnostic active learners to betrained. Our evaluation shows that AL algorithms trained in this way candirectly generalise across diverse problems.

 

Quick Read (beta)

loading the full paper ...