Teach me how to Label: Labeling Functions from Natural Language with Text-to-text Transformers

  • 2021-01-18 16:04:15
  • Yannis Papanikolaou
  • 0

Abstract

Annotated data has become the most important bottleneck in training accuratemachine learning models, especially for areas that require domain expertise. Arecent approach to deal with the above issue proposes using natural languageexplanations instead of labeling individual data points, thereby increasinghuman annotators' efficiency as well as decreasing costs substantially. Thispaper focuses on the task of turning these natural language descriptions intoPython labeling functions by following a novel approach to semantic parsingwith pre-trained text-to-text Transformers. In a series of experiments ourapproach achieves a new state of the art on the semantic parsing benchmarkCoNaLa, surpassing the previous best approach by 3.7 BLEU points. Furthermore,on a manually constructed dataset of natural language descriptions-labelingfunctions pairs we achieve a BLEU of 0.39. Our approach can be regarded as astepping stone towards models that are taught how to label in natural language,instead of being provided specific labeled samples. Our code, constructeddataset and models are available athttps://github.com/ypapanik/t5-for-code-generation.

 

Quick Read (beta)

loading the full paper ...