A Benchmark and Comparison of Active Learning for Logistic Regression

  • 2018-06-21 12:49:47
  • Yazhou Yang, Marco Loog
  • 0

Abstract

Logistic regression is by far the most widely used classifier in real-worldapplications. In this paper, we benchmark the state-of-the-art active learningmethods for logistic regression and discuss and illustrate their underlyingcharacteristics. Experiments are carried out on three synthetic datasets and 44real-world datasets, providing insight into the behaviors of these activelearning methods with respect to the area of the learning curve (which plotsclassification accuracy as a function of the number of queried examples) andtheir computational costs. Surprisingly, one of the earliest and simplestsuggested active learning methods, i.e., uncertainty sampling, performsexceptionally well overall. Another remarkable finding is that random sampling,which is the rudimentary baseline to improve upon, is not overwhelmed byindividual active learning techniques in many cases.

 

Quick Read (beta)

loading the full paper ...