Planning with Entity Chains for Abstractive Summarization

Abstract

Pre-trained transformer-based sequence-to-sequence models have become thego-to solution for many text generation tasks, including summarization.However, the results produced by these models tend to contain significantissues such as hallucinations and irrelevant passages. One solution to mitigatethese problems is to incorporate better content planning in neuralsummarization. We propose to use entity chains (i.e., chains of entitiesmentioned in the summary) to better plan and ground the generation ofabstractive summaries. In particular, we augment the target by prepending itwith its entity chain. We experimented with both pre-training and finetuningwith this content planning objective. When evaluated on CNN/DailyMail, SAMSumand XSum, models trained with this objective improved on entity correctness andsummary conciseness, and achieved state-of-the-art performance on ROUGE forSAMSum and XSum.

Quick Read (beta)

loading the full paper ...