Structured Prediction as Translation between Augmented Natural Languages

Abstract

We propose a new framework, Translation between Augmented Natural Languages(TANL), to solve many structured prediction language tasks including jointentity and relation extraction, nested named entity recognition, relationclassification, semantic role labeling, event extraction, coreferenceresolution, and dialogue state tracking. Instead of tackling the problem bytraining task-specific discriminative classifiers, we frame it as a translationtask between augmented natural languages, from which the task-relevantinformation can be easily extracted. Our approach can match or outperformtask-specific models on all tasks, and in particular, achieves newstate-of-the-art results on joint entity and relation extraction (CoNLL04, ADE,NYT, and ACE2005 datasets), relation classification (FewRel and TACRED), andsemantic role labeling (CoNLL-2005 and CoNLL-2012). We accomplish this whileusing the same architecture and hyperparameters for all tasks and even whentraining a single model to solve all tasks at the same time (multi-tasklearning). Finally, we show that our framework can also significantly improvethe performance in a low-resource regime, thanks to better use of labelsemantics.

Quick Read (beta)

loading the full paper ...