Human-Computer Interaction with Detection of Speaker Emotions Using Convolution Neural Networks.

Journal: Computational intelligence and neuroscience
Published Date:

Abstract

Emotions play an essential role in human relationships, and many real-time applications rely on interpreting the speaker's emotion from their words. Speech emotion recognition (SER) modules aid human-computer interface (HCI) applications, but they are challenging to implement because of the lack of balanced data for training and clarity about which features are sufficient for categorization. This research discusses the impact of the classification approach, identifying the most appropriate combination of features and data augmentation on speech emotion detection accuracy. Selection of the correct combination of handcrafted features with the classifier plays an integral part in reducing computation complexity. The suggested classification model, a 1D convolutional neural network (1D CNN), outperforms traditional machine learning approaches in classification. Unlike most earlier studies, which examined emotions primarily through a single language lens, our analysis looks at numerous language data sets. With the most discriminating features and data augmentation, our technique achieves 97.09%, 96.44%, and 83.33% accuracy for the BAVED, ANAD, and SAVEE data sets, respectively.

Authors

  • Abeer Ali Alnuaim
    Department of Computer Science and Engineering, College of Applied Studies and Community Services, King Saud University, P.O. BOX 22459, Riyadh 11495, Saudi Arabia.
  • Mohammed Zakariah
    College of Computer and Information Sciences, King Saud University, Riyadh, Saudi Arabia.
  • Aseel Alhadlaq
    Department of Computer Science and Engineering, College of Applied Studies and Community Services, King Saud University, P.O. BOX 22459, Riyadh 11495, Saudi Arabia.
  • Chitra Shashidhar
    Department of Commerce and Management, Seshadripuram College, Seshadripuram, Bengaluru-20, India.
  • Wesam Atef Hatamleh
    Department of Computer Science, College of Computer and Information Sciences, King Saud University, P.O. Box 51178, Riyadh 11543, Saudi Arabia.
  • Hussam Tarazi
    Department of Computer Science and Informatics, School of Engineering and Computer Science, Oakland University, 318 Meadow Brook Rd, Rochester MI 48309, USA.
  • Prashant Kumar Shukla
    Department of Computer Science and Engineering, Koneru Lakshmaiah Education Foundation, Vaddeswaram, Guntur, Andhra Pradesh, India.
  • Rajnish Ratna
    Gedu College of Business Studies, Royal University of Bhutan, Bhutan.