Inter classifier comparison to detect voice pathologies.

Journal: Mathematical biosciences and engineering : MBE
Published Date:

Abstract

Voice pathologies are irregular vibrations produced due to vocal folds and various factors malfunctioning. In medical science, novel machine learning algorithms are applied to construct a system to identify disorders that occur invoice. This study aims to extract the features from the audio signals of four chosen diseases from the SVD dataset, such as laryngitis, cyst, non-fluency syndrome, and dysphonia, and then compare the four results of machine learning algorithms, i.e., SVM, Naïve Byes, decision tree and ensemble classifier. In this project, we have used a comparative approach along with the new combination of features to detect voice pathologies which are laryngitis, cyst, non-fluency syndrome, and dysphonia from the SVD dataset. The combination of specific 13 MFCC (mel-frequency cepstral coefficients) features along with pitch, zero crossing rate (ZCR), spectral flux, spectral entropy, spectral centroid, spectral roll-off, and short term energy for more accurate detection of voice pathologies. It is proven that the combination of features extracted gives the best product on the audio, which split into 10 ms. Four machine learning classifiers, SVM, Naïve Bayes, decision tree and ensemble classifier for the inter classifier comparison, give 93.18, 99.45,100 and 51%, respectively. Out of these accuracies, both Naïve Bayes and the decision tree show the most promising results with a higher detection rate. Naïve Bayes and decision tree gives the highest reported outcomes on the selected set of features in the proposed methodology. The SVM has also been concluded to be the commonly used voice condition identification algorithm.

Authors

  • Sidra Abid Syed
    Faculty of Electrical and Computer Engineering, Biomedical Engineering Department, Sir Syed University of Engineering and Technology, Karachi, Pakistan.
  • Munaf Rashid
    Department of Software Engineering, Ziauddin University, Faculty of Engineering, Science, Technology and Management (ZUFESTM), Karachi, Pakistan.
  • Samreen Hussain
    Electronic Engineering Department, Faculty of Computer Systems Engineering, Electronic Engineering and Telecommunication Engineering, Dawood University of Engineering and Technology, Karachi, Pakistan.
  • Anoshia Imtiaz
    Biomedical Engineering Department, Ziauddin University Faculty of Engineering Science Technology and Management, Karachi, Pakistan.
  • Hamnah Abid
    Biomedical Engineering Department, Ziauddin University Faculty of Engineering Science Technology and Management, Karachi, Pakistan.
  • Hira Zahid
    Department of Biomedical Engineering, Ziauddin University, Faculty of Engineering, Science, Technology and Management (ZUFESTM), Karachi, Pakistan.