Reducing Hallucinations in Summarization via Reinforcement Learning with Entity Hallucination Index

Abstract

Reducing hallucinations in abstractive summarization remains a criticalchallenge for deploying language models (LMs) in real-world settings. In thiswork, we introduce a rewarddriven fine-tuning framework that explicitlyoptimizes for Entity Hallucination Index (EHI), a metric designed to quantifythe presence, correctness, and grounding of named entities in generatedsummaries. Given a corpus of meeting transcripts, we first generate baselinesummaries using a pre-trained LM and compute EHI scores via automatic entityextraction and matching. We then apply reinforcement learning to fine-tune themodel parameters, using EHI as a reward signal to bias generation towardentity-faithful outputs. Our approach does not rely on human-written factualityannotations, enabling scalable fine-tuning. Experiments demonstrate consistentimprovements in EHI across datasets, with qualitative analysis revealing asignificant reduction in entity-level hallucinations without degradation influency or informativeness. We release a reproducible Colab pipeline,facilitating further research on hallucination-aware model fine-tuning usinglightweight, hallucintion metrics like EHI.

Quick Read (beta)

loading the full paper ...