Deep Residual Network with D-S Evidence Theory for Bimodal Emotion Recognition

Yulong Liu; Luefeng Chen; Min Li; Min Wu; Witold Pedrycz; Kaoru Hirota

doi:10.1109/cac53003.2021.9727443

Abstract

1 min read

In this paper, the Deep Residual Network (ResNet) with Dempster-Shafer (D-S) evidence theory is presented for bimodal emotion recognition through applying facial expression and speech emotion information. By acquiring discriminative emotion features and performing bimodal fusion of emotions, this method can overcome the limitations of single modal emotion recognition and obtain higher recognition accuracy. The key areas of emotional features and spectrograms are firstly used to acquire low-level characteristics of emotion. Moreover, two ResNets are designed to select high-level emotion semantic features. Furthermore, under the structure of D-S evidence theory, the output probability values are used for achieving emotion fusion to improve the effectiveness of bimodal emotion recognition. The experimental studies on the eNTERFACE’05 database demonstrate a recognition accuracy of 88.67%, which is a noteworthy improvement of 23.11% and 9.32% compared to an individual mode of facial expressions and speech, respectively.

Deep Residual Network with D-S Evidence Theory for Bimodal Emotion Recognition

Abstract

Discussion(0)

Related publications

Two-Stage Fuzzy Fusion Based-Convolution Neural Network for Dynamic Emotion Recognition

<i>K</i>-Means Clustering-Based Kernel Canonical Correlation Analysis for Multimodal Emotion Recognition in Human–Robot Interaction

Coupled Multimodal Emotional Feature Analysis Based on Broad-Deep Fusion Networks in Human–Robot Interaction

Convolutional Features-Based Broad Learning With LSTM for Multidimensional Facial Emotion Recognition in Human–Robot Interaction

CNN-based Broad Learning with Efficient Incremental Reconstruction Model for Facial Emotion Recognition

Related publications

Article2020
Two-Stage Fuzzy Fusion Based-Convolution Neural Network for Dynamic Emotion Recognition
Article2020

Article2022
<i>K</i>-Means Clustering-Based Kernel Canonical Correlation Analysis for Multimodal Emotion Recognition in Human–Robot Interaction
Article2022

Article2023
Coupled Multimodal Emotional Feature Analysis Based on Broad-Deep Fusion Networks in Human–Robot Interaction
Article2023

Article2023
Convolutional Features-Based Broad Learning With LSTM for Multidimensional Facial Emotion Recognition in Human–Robot Interaction
Article2023

Article2020
CNN-based Broad Learning with Efficient Incremental Reconstruction Model for Facial Emotion Recognition
Article2020