Empathy-R1: A Chain-of-Empathy and Reinforcement Learning Framework for Long-Form Mental Health Support

  • 2025-09-19 07:24:59
  • Xianrong Yao, Dong She, Chenxu Zhang, Yimeng Zhang, Yueru Sun, Noman Ahmed, Yang Gao, Zhanpeng Jin
  • 0

Abstract

Empathy is critical for effective mental health support, especially whenaddressing Long Counseling Texts (LCTs). However, existing Large LanguageModels (LLMs) often generate replies that are semantically fluent but lack thestructured reasoning necessary for genuine psychological support, particularlyin a Chinese context. To bridge this gap, we introduce Empathy-R1, a novelframework that integrates a Chain-of-Empathy (CoE) reasoning process withReinforcement Learning (RL) to enhance response quality for LCTs. Inspired bycognitive-behavioral therapy, our CoE paradigm guides the model to sequentiallyreason about a help-seeker's emotions, causes, and intentions, making itsthinking process both transparent and interpretable. Our framework is empoweredby a new large-scale Chinese dataset, Empathy-QA, and a two-stage trainingprocess. First, Supervised Fine-Tuning instills the CoE's reasoning structure.Subsequently, RL, guided by a dedicated reward model, refines the therapeuticrelevance and contextual appropriateness of the final responses. Experimentsshow that Empathy-R1 achieves strong performance on key automatic metrics. Moreimportantly, human evaluations confirm its superiority, showing a clearpreference over strong baselines and achieving a Win@1 rate of 44.30% on ournew benchmark. By enabling interpretable and contextually nuanced responses,Empathy-R1 represents a significant advancement in developing responsible andgenuinely beneficial AI for mental health support.

 

Quick Read (beta)

loading the full paper ...