Classification from Pairwise Similarity and Unlabeled Data

  • 2018-02-12 22:35:38
  • Han Bao, Gang Niu, Masashi Sugiyama
  • 86

Abstract

One of the biggest bottlenecks in supervised learning is its high labelingcost. To overcome this problem, we propose a new weakly-supervised learningsetting called SU classification, where only similar (S) data pairs (twoexamples belong to the same class) and unlabeled (U) data are needed, insteadof fully-supervised data. We show that an unbiased estimator of theclassification risk can be obtained only from SU data, and its empirical riskminimizer achieves the optimal parametric convergence rate. Finally, wedemonstrate the effectiveness of the proposed method through experiments.

 

Quick Read (beta)

loading the full paper ...