Imbalanced Power Spectral Generation for Respiratory Rate and Uncertainty Estimations Based on Photoplethysmography Signal.

Journal: Sensors (Basel, Switzerland)
PMID:

Abstract

Respiratory rate (RR) changes in the elderly can indicate serious diseases. Thus, accurate estimation of RRs for cardiopulmonary function is essential for home health monitoring systems. However, machine learning (ML) algorithm errors embedded in health monitoring systems can be problematic in medical decision-making because some data have much larger sample sizes in the training set than others. This difference in sample size implies biosignal data imbalance. Therefore, we propose a novel methodology that combines bootstrap-based imbalanced continuous power spectral generation (IPSG) with ML approaches to estimate RRs and uncertainty to address data imbalance. The sample differences between normal breathing (12-20 breaths per minute (brpm)), dyspnea (≥20 brpm), and hypopnea (<8 brpm) show significant data imbalance, which can affect the learning of ML algorithms. Hence, the normal breathing part with a large amount of data is well-trained. In contrast, the dyspnea and hypopnea parts with relatively fewer data are not well-trained, and this data imbalance makes it difficult to estimate the reference variables of the actual dyspnea and hypopnea data parts, thus generating significant errors. Hence, we apply ML models by mixing artificial feature curves generated using a bootstrap model with the original feature curves to estimate RRs and solve this problem. As a result, the nonparametric bootstrap approach significantly increases the number of artificial feature curves. The generated artificial feature curves are selectively utilized in the highly imbalanced parts. Therefore, we confirm that IPSG is efficiently trained to predict the complex nonlinear relationship between the feature vectors obtained from the photoplethysmography signal and the reference RR. The proposed methodology shows more accurate prediction performance and uncertainty. Combining the proposed Gaussian process regression (GPR) with IPSG based on the Beth Israel Deaconess Medical Center dataset, the mean absolute error of the RR is 0.79 and 1.47 brpm. Our approach achieves high stability and accuracy by randomly mixing original and artificial feature curves. The proposed GPR-IPSG model can improve the performance of clinical home-based monitoring systems and design a reliable framework.

Authors

  • Soojeong Lee
    School of Electronic Engineering, Hanyang University, 222 Wangsimni-ro, Seongdong, Seoul 133-791, Republic of Korea.
  • Mugahed A Al-Antari
    Department of Computer Science and Engineering, College of Software, Kyung Hee University, 1732, Deogyeong-daero, Giheung-gu, Yongin-si, Gyeonggi-do 17104 Republic of Korea.
  • Gyanendra Prasad Joshi
    Department of AI Software, Kangwon National University, Samcheok 10587, Kangwon State, Republic of Korea.
  • Yeong Hyeon Gu
    Department of Artificial Intelligence and Data Science, Sejong University, Seoul, Korea.