Deep learning-based video-level view classification of two-dimensional transthoracic echocardiography.

Journal: Biomedical physics & engineering express
PMID:

Abstract

In recent years, deep learning (DL)-based automatic view classification of 2D transthoracic echocardiography (TTE) has demonstrated strong performance, but has not fully addressed key clinical requirements such as view coverage, classification accuracy, inference delay, and the need for thorough exploration of performance in real-world clinical settings. We proposed a clinical requirement-driven DL framework, TTESlowFast, for accurate and efficient video-level TTE view classification. This framework is based on the SlowFast architecture and incorporates both a sampling balance strategy and a data augmentation strategy to address class imbalance and the limited availability of labeled TTE videos, respectively. TTESlowFast achieved an overall accuracy of 0.9881, precision of 0.9870, recall of 0.9867, and F1 score of 0.9867 on the test set. After field deployment, the model's overall accuracy, precision, recall, and F1 score for view classification were 0.9607, 0.9586, 0.9499, and 0.9530, respectively. The inference time for processing a single TTE video was 105.0 ± 50.1 ms on a desktop GPU (NVIDIA RTX 3060) and 186.0 ± 5.2 ms on an edge computing device (Jetson Orin Nano), which basically meets the clinical demand for immediate processing following image acquisition. The TTESlowFast framework proposed in this study demonstrates effective performance in TTE view classification with low inference delay, making it well-suited for various medical scenarios and showing significant potential for practical application.

Authors

  • Hanlin Cheng
    School of Biological Science and Medical Engineering, Southeast University, Nanjing, People's Republic of China.
  • Zhongqing Shi
    Department of Ultrasound Medicine, The Affiliated Hospital of Nanjing University Medical School, Nanjing Drum Tower Hospital, Nanjing, People's Republic of China.
  • Zhanru Qi
    Department of Ultrasound Medicine, The Affiliated Hospital of Nanjing University Medical School, Nanjing Drum Tower Hospital, Nanjing, People's Republic of China.
  • Xiaoxian Wang
    Department of Ultrasound Medicine, The Affiliated Hospital of Nanjing University Medical School, Nanjing Drum Tower Hospital, Nanjing, People's Republic of China.
  • Guanjun Guo
    Department of Ultrasound Medicine, The Affiliated Hospital of Nanjing University Medical School, Nanjing Drum Tower Hospital, Nanjing, People's Republic of China.
  • Aijuan Fang
    Department of Ultrasound Medicine, The Affiliated Hospital of Nanjing University Medical School, Nanjing Drum Tower Hospital, Nanjing, People's Republic of China.
  • Zhibin Jin
    Department of Ultrasound Medicine, The Affiliated Hospital of Nanjing University Medical School, Nanjing Drum Tower Hospital, Nanjing, People's Republic of China.
  • Chunjie Shan
    School of Biological Science and Medical Engineering, Southeast University, Nanjing, People's Republic of China.
  • Ruiyang Chen
    School of Biological Science and Medical Engineering, Southeast University, Nanjing, People's Republic of China.
  • Yue Du
    Laboratory of Drug Metabolism and Pharmacokinetics, School of Pharmacy, Shenyang Pharmaceutical University, 103 Wenhua Road, Shenyang, 110016, PR China.
  • Sunnan Qian
    Department of Information Office, Jiangsu Province Official Hospital, Nanjing, People's Republic of China.
  • Shouhua Luo
  • Jing Yao
    Institute of Plant Protection, Jiangsu Academy of Agricultural Sciences, Nanjing, China.