proScript: Partially Ordered Scripts Generation via Pre-trained Language Models

  • 2021-04-16 17:35:10
  • Keisuke Sakaguchi, Chandra Bhagavatula, Ronan Le Bras, Niket Tandon, Peter Clark, Yejin Choi
  • 0

Abstract

Scripts - standardized event sequences describing typical everyday activities- have been shown to help understand narratives by providing expectations,resolving ambiguity, and filling in unstated information. However, to date theyhave proved hard to author or extract from text. In this work, we demonstratefor the first time that pre-trained neural language models (LMs) can be befinetuned to generate high-quality scripts, at varying levels of granularity,for a wide range of everyday scenarios (e.g., bake a cake). To do this, wecollected a large (6.4k), crowdsourced partially ordered scripts (namedproScript), which is substantially larger than prior datasets, and developedmodels that generate scripts with combining language generation and structureprediction. We define two complementary tasks: (i) edge prediction: given ascenario and unordered events, organize the events into a valid (possiblypartial-order) script, and (ii) script generation: given only a scenario,generate events and organize them into a (possibly partial-order) script. Ourexperiments show that our models perform well (e.g., F1=75.7 in task (i)),illustrating a new approach to overcoming previous barriers to scriptcollection. We also show that there is still significant room for improvementtoward human level performance. Together, our tasks, dataset, and models offera new research direction for learning script knowledge.

 

Quick Read (beta)

loading the full paper ...