An Interactive Evaluation Framework for Empathetic Response Generation

Xixi Lei, Changqun Li, Liang He, Xin Lin*

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Empathetic response generation is a significant domain in Natural Language Processing (NLP). Its development is a critical step toward achieving humanized AI systems. However, current evaluations of empathetic dialogue models are primarily single-turn and static, leading to bias between evaluation results and real-world multi-turn interaction performance. To overcome the longstanding challenge, we propose a novel Interactive Empathy Evaluation Framework (IEEF). It eliminates the bias by facilitating a human-free multi-turn interaction evaluation. Specifically, for human-free interaction, we design a user simulator using reinforcement learning, leveraging a reward model based on LLM scoring. For evalution, we introduce a series of empathy-related metrics based on LLM. The experiments show that IEEF's evaluation results are highly correlated with real-world multi-turn interaction performance, demonstrating its alignment with human preferences in empathy evaluation.

Original languageEnglish
Title of host publication2025 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2025 - Proceedings
EditorsBhaskar D Rao, Isabel Trancoso, Gaurav Sharma, Neelesh B. Mehta
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798350368741
DOIs
StatePublished - 2025
Event2025 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2025 - Hyderabad, India
Duration: 6 Apr 202511 Apr 2025

Publication series

NameICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
ISSN (Print)1520-6149

Conference

Conference2025 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2025
Country/TerritoryIndia
CityHyderabad
Period6/04/2511/04/25

Keywords

  • Dialogue System
  • Empathetic Dialogue
  • Empathy Metrics
  • LLM Scoring
  • Natural Language Processing

Fingerprint

Dive into the research topics of 'An Interactive Evaluation Framework for Empathetic Response Generation'. Together they form a unique fingerprint.

Cite this