Channel attention pyramid network for remote physiological measurement.

Journal: Scientific reports
Published Date:

Abstract

Remote photoplethysmography (rPPG) is an emerging contactless physiological parameter detection method utilizing cameras, showing great promise as a forefront technology for remote health assessment. While traditional rPPG methods have substantially contributed to the development of affordable camera-based health monitoring systems, their dependence on specific assumptions may lead to performance degradation when confronted with real-time dynamic interferences. The article presents a comprehensive overview of CAP-rPPG, an innovative method for remote physiological measurement through video analysis. This method employs a multi-scale deep learning architecture with a Gaussian pyramid to capture facial features at different scales that are often overlooked in prior work. A channel attention module further emphasizes rPPG-rich channels, mitigating the issue of feature dilution caused by excessive channel depth and enhancing the accuracy of physiological signal extraction from facial videos. The uniqueness of CAP-rPPG lies in its hybrid loss function, adeptly balancing the short-term characteristics, long-term characteristics of the signal and the correlation between the predicted HR and the ground-truth HR. CAP-rPPG demonstrates outstanding robustness under various challenging conditions, such as varying lighting environments and physical motion. It consistently outperforms most state-of-the-art methods on both the UBFC-rPPG and PURE datasets. Its capability to non-invasively capture subtle physiological changes from video data represents a significant leap forward in the realm of remote health monitoring technologies.

Authors

  • Jing Zhang
    MOEMIL Laboratory, School of Optoelectronic Information, University of Electronic Science and Technology of China, Chengdu, China.
  • Haixin Sun
    Department of Information and Communication, School of Informatics, Xiamen University, Xiamen 316005, China.
  • Yuhao Hu
    School of Optoelectronic Science and Engineering, University of Electronic Science and Technology of China, Chengdu, 611731, China.
  • Guanghao Zhu
    Shanghai University of Traditional Chinese Medicine, 201203 Shanghai City, China.
  • Fang Liu
    The First Clinical Medical College of Gannan Medical University, Ganzhou 341000, Jiangxi Province, China.
  • Boyun Yan
    School of Optoelectronic Science and Engineering, University of Electronic Science and Technology of China, Chengdu, 611731, China.
  • Jiahao Pu
    School of Optoelectronic Science and Engineering, University of Electronic Science and Technology of China, Chengdu, 611731, China.
  • Xiaohui Du
  • Juanxiu Liu
  • Lin Liu
    Institute of Natural Sciences, MOE-LSC, School of Mathematical Sciences, CMA-Shanghai, SJTU-Yale Joint Center for Biostatistics and Data Science, Shanghai Jiao Tong University; Shanghai Artificial Intelligence Laboratory.
  • Ruqian Hao
  • Xingguo Wang
    Collaborative Innovation Center of Food Safety and Quality Control in Jiangsu Province, National Engineering Research Center for Functional Food, School of Food Science and Technology, Jiangnan University.
  • Yong Liu
    Department of Critical care medicine, Shenzhen Hospital, Southern Medical University, Guangdong, Shenzhen, China.