A deep learning framework for gender sensitive speech emotion recognition based on MFCC feature selection and SHAP analysis.

Journal: Scientific reports

Published Date: Aug 5, 2025

Abstract

Speech is one of the most efficient methods of communication among humans, inspiring advancements in machine speech processing under Natural Language Processing (NLP). This field aims to enable computers to analyze, comprehend, and generate human language naturally. Speech processing, as a subset of artificial intelligence, is rapidly expanding due to its applications in emotion recognition, human-computer interaction, and sentiment analysis. This study introduces a novel algorithm for emotion recognition from speech using deep learning techniques. The proposed model achieves up to a 15% improvement compared to state-of-the-art deep learning methods in speech emotion recognition. It employs advanced supervised learning algorithms and deep neural network architectures, including Convolutional Neural Networks (CNNs) and Recurrent Neural Networks (RNNs) with Long Short-Term Memory (LSTM) units. These models are trained on labeled datasets to accurately classify emotions such as happiness, sadness, anger, fear, surprise, and neutrality. The research highlights the system's real-time application potential, such as analyzing audience emotional responses during live television broadcasts. By leveraging advancements in deep learning, the model achieves high accuracy in understanding and predicting emotional states, offering valuable insights into user behavior. This approach contributes to diverse domains, including media analysis, customer feedback systems, and human-machine interaction, showcasing the transformative potential of combining speech processing with neural networks.

Authors

Qingqing Hu

School of Nursing, Zhejiang Chinese Medical University, 548 Binwen Road, Binjiang District, Hangzhou, 310053, Zhejiang Province, People's Republic of China.
Yiran Peng

Faculty of Innovation Engineering, Macau University of Science and Technology, Avenida Wai Long, Taipa, Macau, 999078, China. 3230002514@student.must.edu.mo.
Zhong Zheng

National Research Center of Intelligent Equipment for Agriculture, Beijing 100097, China.

Keywords

Algorithms Deep Learning Emotions Female Humans Male Natural Language Processing Neural Networks, Computer Speech

External Resources

View on PubMed Access via DOI PubMed (40764384)

A deep learning framework for gender sensitive speech emotion recognition based on MFCC feature selection and SHAP analysis.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals

A deep learning framework for gender sensitive speech emotion recognition based on MFCC feature selection and SHAP analysis.

Abstract

Authors

Keywords

External Resources

Stay Ahead of Medical AI

Popular Topics

Recent Journals