Semantic WordRank: Generating Finer Single-Document Summarizations

  • 2018-09-12 19:53:56
  • Hao Zhang, Jie Wang
  • 16

Abstract

We present Semantic WordRank (SWR), an unsupervised method for generating anextractive summary of a single document. Built on a weighted word graph withsemantic and co-occurrence edges, SWR scores sentences using anarticle-structure-biased PageRank algorithm with a Softplus functionadjustment, and promotes topic diversity using spectral subtopic clusteringunder the Word-Movers-Distance metric. We evaluate SWR on the DUC-02 andSummBank datasets and show that SWR produces better summaries than thestate-of-the-art algorithms over DUC-02 under common ROUGE measures. We thenshow that, under the same measures over SummBank, SWR outperforms each of thethree human annotators (aka. judges) and compares favorably with the combinedperformance of all judges.

 

Quick Read (beta)

loading the full paper ...