A Novel Speech Intelligibility Enhancement Model based on Canonical Correlation and Deep Learning.

Journal: Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference
PMID:

Abstract

Current deep learning (DL) based approaches to speech intelligibility enhancement in noisy environments are often trained to minimise the feature distance between noise-free speech and enhanced speech signals. Despite improving the speech quality, such approaches do not deliver required levels of speech intelligibility in everyday noisy environments. Intelligibility-oriented (I-O) loss functions have recently been developed to train DL approaches for robust speech enhancement. Here, we formulate, for the first time, a novel canonical correlation based I-O loss function to more effectively train DL algorithms. Specifically, we present a canonical-correlation based short-time objective intelligibility (CC-STOI) cost function to train a fully convolutional neural network (FCN) model. We carry out comparative simulation experiments to show that our CC-STOI based speech enhancement framework outperforms state-of-the-art DL models trained with conventional distance-based and STOI-based loss functions, using objective and subjective evaluation measures for case of both unseen speakers and noises. Ongoing future work is evaluating the proposed approach for design of robust hearing-assistive technology.

Authors

  • Tassadaq Hussain
  • Muhammad Diyan
    School of Computer Science and Engineering, Kyungpook National University, Daegu 41566, Korea.
  • Mandar Gogate
    School of Computing, Edinburgh Napier University, Edinburgh, United Kingdom.
  • Kia Dashtipour
    James Watt School of Engineering, University of Glasgow, Glasgow, G12 8QQ, UK.
  • Ahsan Adeel
  • Yu Tsao
  • Amir Hussain
    Cognitive Signal-Image and Control Processing Research Laboratory, School of Natural Sciences, University of Stirling, Stirling, FK9 4LA, United Kingdom.