I Understand How You Feel: Enhancing Deeper Emotional Support Through Multilingual Emotional Validation in Dialogue System
Zi Haur Pang, Yahui Fu, Koji Inoue, Tatsuya Kawahara
- Published
- Jun 10, 2026 — 09:59 UTC
Problem
The paper addresses the lack of computational approaches to emotional validation in dialogue systems, a critical aspect of providing deeper emotional support. Despite its therapeutic value, emotional validation has not been extensively explored in the literature, particularly in multilingual contexts. The authors present this work as a preprint, indicating that it has not yet undergone peer review.
Method
The authors propose a three-part framework for emotional validation in dialogue systems, which includes: (i) validating response identification, (ii) validation timing detection, and (iii) validating response generation. To facilitate research in these areas, they introduce two novel datasets: M-EDESConv, a 120k English-Japanese multilingual corpus created through a combination of manual and automatic annotation, and M-TESC, a multilingual spoken-dialogue test set. For the validation timing detection task, they develop MEGUMI (Multilingual Emotion-aware Gated Unit for Mutual Integration), which integrates frozen XLM-RoBERTa embeddings with language-specific emotion encoders using cross-modal attention and gated fusion mechanisms. This architecture allows for effective fusion of semantic and emotional information across languages.
Results
MEGUMI demonstrates superior performance on both the M-EDESConv and M-TESC datasets, achieving significant improvements over baseline models. The authors report that MEGUMI outperforms traditional methods in both objective metrics (e.g., accuracy, F1 score) and subjective evaluations (user satisfaction ratings). Additionally, their EmoValidBench benchmarks reveal that current large language models (LLMs) like GPT-4.1 Nano and Llama-3.1 8B can generate contextually relevant and diverse validating responses, but they still struggle with nuanced emotional understanding, indicating a gap in the models’ capabilities.
Limitations
The authors acknowledge several limitations, including the potential biases in the manually annotated datasets and the reliance on existing LLMs, which may not fully capture the complexities of emotional validation. They also note that while MEGUMI shows promise, its performance may vary across different languages and cultural contexts, which could affect generalizability. Furthermore, the subjective nature of emotional validation poses challenges in evaluation metrics, as emotional responses can be highly context-dependent.
Why it matters
This work has significant implications for the development of emotionally intelligent dialogue systems, particularly in multilingual settings. By providing a structured approach to emotional validation, it paves the way for more empathetic AI interactions, which are crucial in applications such as mental health support and customer service. The introduction of M-EDESConv and M-TESC datasets, along with the MEGUMI architecture, offers valuable resources for future research in this domain. As highlighted in the paper, improving emotional understanding in LLMs is essential for advancing the field of affective computing, as discussed in arXiv.
By Turing Wire editorial staff · Jun 10, 2026 · Editorial standards →
Source: arXiv cs.CL