Deep Neural Network Driven Speech Classification for Relevance Detection in Automatic Medical Documentation.

Journal: Studies in health technology and informatics
Published Date:

Abstract

The automation of medical documentation is a highly desirable process, especially as it could avert significant temporal and monetary expenses in healthcare. With the help of complex modelling and high computational capability, Automatic Speech Recognition (ASR) and deep learning have made several promising attempts to this end. However, a factor that significantly determines the efficiency of these systems is the volume of speech that is processed in each medical examination. In the course of this study, we found that over half of the speech, recorded during follow-up examinations of patients treated with Intra-Vitreal Injections, was not relevant for medical documentation. In this paper, we evaluate the application of Convolutional and Long Short-Term Memory (LSTM) neural networks for the development of a speech classification module aimed at identifying speech relevant for medical report generation. In this regard, various topology parameters are tested and the effect of the model performance on different speaker attributes is analyzed. The results indicate that Convolutional Neural Networks (CNNs) are more successful than LSTM networks, and achieve a validation accuracy of 92.41%. Furthermore, on evaluation of the robustness of the model to gender, accent and unknown speakers, the neural network generalized satisfactorily.

Authors

  • Suhail Ahamed
    Fraunhofer Institute for Biomedical Engineering, Sulzbach, Germany.
  • Gabriele Weiler
    Fraunhofer Institute for Biomedical Engineering, St. Ingbert, Germany.
  • Karl Boden
    Klaus Heimann Eye Research Institute (KHERI), Sulzbach, Germany.
  • Kai Januschowski
    Klaus Heimann Eye Research Institute (KHERI), Sulzbach, Germany.
  • Matthias Stennes
    Fraunhofer Institute for Digital Media Technology, Oldenburg, Germany.
  • Patrick McCrae
    LangTec, Hamburg, Germany.
  • Cornelia Bock
    LangTec, Hamburg, Germany.
  • Carina Rawein
    LangTec, Hamburg, Germany.
  • Marco Petris
    LangTec, Hamburg, Germany.
  • Kilian Foth
    LangTec, Hamburg, Germany.
  • Kerstin Rohm
    Fraunhofer Institute for Biomedical Engineering, St. Ingbert, Germany.
  • Stephan Kiefer
    Fraunhofer Institute for Biomedical Engineering, St. Ingbert, Germany.