Abstract
Neural abstractive summarization models are able to generate summaries whichhave high overlap with human references. However, existing models are notoptimized for factual correctness, a critical metric in real-worldapplications. In this work, we propose to evaluate the factual correctness of agenerated summary by fact-checking it against its reference using aninformation extraction module. We further propose a training strategy whichoptimizes a neural summarization model with a factual correctness reward viareinforcement learning. We apply the proposed method to the summarization ofradiology reports, where factual correctness is a key requirement. On twoseparate datasets collected from real hospitals, we show via both automatic andhuman evaluation that the proposed approach substantially improves the factualcorrectness and overall quality of outputs from a competitive neuralsummarization system.