Abstract
Objective: The reading level of health educational materials significantlyinfluences the understandability and accessibility of the information,particularly for minoritized populations. Many patient educational resourcessurpass the reading level and complexity of widely accepted standards. There isa critical need for high-performing text simplification models in healthinformation to enhance dissemination and literacy. This need is particularlyacute in cancer education, where effective prevention and screening educationcan substantially reduce morbidity and mortality. Methods: We introduce Simplified Digestive Cancer (SimpleDC), a parallelcorpus of cancer education materials tailored for health text simplificationresearch, comprising educational content from the American Cancer Society,Centers for Disease Control and Prevention, and National Cancer Institute.Utilizing SimpleDC alongside the existing Med-EASi corpus, we explore LargeLanguage Model (LLM)-based simplification methods, including fine-tuning,reinforcement learning (RL), reinforcement learning with human feedback (RLHF),domain adaptation, and prompt-based approaches. Our experimentation encompassesLlama 2 and GPT-4. A novel RLHF reward function is introduced, featuring alightweight model adept at distinguishing between original and simplifiedtexts, thereby enhancing the model's effectiveness with unlabeled data. Results: Fine-tuned Llama 2 models demonstrated high performance acrossvarious metrics. Our innovative RLHF reward function surpassed existing RL textsimplification reward functions in effectiveness. The results underscore thatRL/RLHF can augment fine-tuning, facilitating model training on unlabeled textand improving performance.