Snore-GANs: Improving Automatic Snore Sound Classification With Synthesized Data.

Journal: IEEE journal of biomedical and health informatics
PMID:

Abstract

One of the frontier issues that severely hamper the development of automatic snore sound classification (ASSC) associates to the lack of sufficient supervised training data. To cope with this problem, we propose a novel data augmentation approach based on semi-supervised conditional generative adversarial networks (scGANs), which aims to automatically learn a mapping strategy from a random noise space to original data distribution. The proposed approach has the capability of well synthesizing "realistic" high-dimensional data, while requiring no additional annotation process. To handle the mode collapse problem of GANs, we further introduce an ensemble strategy to enhance the diversity of the generated data. The systematic experiments conducted on a widely used Munich-Passau snore sound corpus demonstrate that the scGANs-based systems can remarkably outperform other classic data augmentation systems, and are also competitive to other recently reported systems for ASSC.

Authors

  • Zixing Zhang
    Chair of Complex and Intelligent Systems, University of Passau, Innstr. 43, Passau 94032, Germany.
  • Jing Han
    Key Laboratory of Combinatorial Biosynthesis and Drug Discovery (Wuhan University), Ministry of Education; School of Pharmaceutical Sciences, Wuhan University, Wuhan 430071, China.
  • Kun Qian
    Key Laboratory of Brain Health Intelligent Evaluation and Intervention (Beijing Institute of Technology), Ministry of Education, Beijing, China.
  • Christoph Janott
  • Yanan Guo
    School of Information Science and Engineering, Lanzhou University, No. 222, South Tianshui Road, Lanzhou, Gansu Province, 730000, People's Republic of China.
  • Björn Schuller
    Chair of Embedded Intelligence for Health Care & Wellbeing, University of Augsburg, 86159 Augsburg, Germany.