A study of time-frequency features for CNN-based automatic heart sound classification for pathology detection.

Journal: Computers in biology and medicine
Published Date:

Abstract

This study concerns the task of automatic structural heart abnormality risk detection from digital phonocardiogram (PCG) signals aiming at pediatric heart disease screening applications. Recently, various systems based on convolutional neural networks trained on time-frequency representations of segmental PCG frames have been presented that outperform systems using hand-crafted features. This study focuses on the segmentation and time-frequency representation components of the CNN-based designs. We consider the most commonly used features (MFCC and Mel-Spectrogram) used in state-of-the-art systems and a time-frequency representation influenced by domain-knowledge, namely sub-band envelopes as an alternative feature. Via tests carried on two high quality databases with a large set of possible settings, we show that sub-band envelopes are preferable to the most commonly used features and period synchronous windowing is preferable over asynchronous windowing.

Authors

  • Baris Bozkurt
    Electrical and Electronics Engineering Department, Izmir Democracy University, Turkey. Electronic address: baris.bozkurt@idu.edu.tr.
  • Ioannis Germanakis
    Faculty of Medicine, University of Crete, Greece. Electronic address: germjohn@med.uoc.gr.
  • Yannis Stylianou
    Computer Science Department, University of Crete, Greece. Electronic address: yannis@csd.uoc.gr.