Enhanced heart sound classification using Mel frequency cepstral coefficients and comparative analysis of single vs. ensemble classifier strategies.

Journal: PloS one
PMID:

Abstract

This paper seeks to enhance the performance of Mel Frequency Cepstral Coefficients (MFCCs) for detecting abnormal heart sounds. Heart sounds are first pre-processed to remove noise and then segmented into S1, systole, S2, and diastole intervals, with thirteen MFCCs estimated from each segment, yielding 52 MFCCs per beat. Finally, MFCCs are used for heart sound classification. For that purpose, a single classifier and an innovative ensemble classifier strategy are presented and compared. In the single classifier strategy, the MFCCs from nine consecutive beats are averaged to classify heart sounds by a single classifier (either a support vector machine (SVM), the k nearest neighbors (kNN), or a decision tree (DT)). Conversely, the ensemble classifier strategy employs nine classifiers (either nine SVMs, nine kNN classifiers, or nine DTs) to individually assess beats as normal or abnormal, with the overall classification based on the majority vote. Both methods were tested on a publicly available phonocardiogram database. The heart sound classification accuracy was 91.95% for the SVM, 91.9% for the kNN, and 87.33% for the DT in the single classifier strategy. Also, the accuracy was 93.59% for the SVM, 91.84% for the kNN, and 92.22% for the DT in the ensemble classifier strategy. Overall, the results demonstrated that MFCCs were more effective than other features, including time, time-frequency, and statistical features, evaluated in similar studies. In addition, the ensemble classifier strategy improved the accuracies of the DT and the SVM by 4.89% and 1.64%, implying that the averaging of MFCCs across multiple phonocardiogram beats in the single classifier strategy degraded the important cues that are required for detecting the abnormal heart sounds, and therefore should be avoided.

Authors

  • Mehdi Hosseinzadeh
    School of Computer Science, Duy Tan University, Da Nang, 550000, Viet Nam; Jadara Research Center, Jadara University, Irbid 21110, Jordan. Electronic address: mehdihosseinzadeh@duytan.edu.vn.
  • Amir Haider
    Department of AI and Robotics, Sejong University, Seoul, Republic of Korea.
  • Mazhar Hussain Malik
    School of Computing and Creative Technologies College of Arts, Technology and Environment (CATE) University of the West of England Frenchay Campus, Bristol, United Kingdom.
  • Mohammad Adeli
    Department of Biomedical Engineering, Dezful Branch, Islamic Azad University, Dezful, Iran.
  • Olfa Mzoughi
    Department of Computer Sciences, College of Computer Engineering & Sciences, Prince Sattam bin Abdulaziz University, Al-Kharj, Saudi Arabia.
  • Entesar Gemeay
    Department of Computer Engineering, Computer and Information Technology College, Taif University, Taif, Saudi Arabia.
  • Mokhtar Mohammadi
    Department of Information Technology, College of Engineering and Computer Science, Lebanese French University, Kurdistan Region, Iraq.
  • Hamid Alinejad-Rokny
    Systems Biology and Health Data Analytics Lab, The Graduate School of Biomedical Engineering, UNSW Sydney, 2052 Sydney, Australia; School of Computer Science and Engineering, The University of New South Wales (UNSW Sydney), 2052 Sydney, Australia; Health Data Analytics Program Leader, AI-enabled Processes (AIP) Research Centre, Macquarie University, Sydney 2109, Australia.
  • Parisa Khoshvaght
    DTU AI and Data Science Hub (DAIDASH), Duy Tan University, Da Nang, Vietnam.
  • Thantrira Porntaveetus
    Faculty of Dentistry, Department of Physiology, Center of Excellence in Genomics and Precision Dentistry, Clinical Research Center, Geriatric Dentistry and Special Patients Care International Program, Chulalongkorn University, Bangkok, Thailand.
  • Amir Masoud Rahmani
    Future Technology Research Center, National Yunlin University of Science and Technology, Yunlin, Taiwan. rahmania@yuntech.edu.tw.