[Constructing a cataplexy face prediction model for narcolepsy type 1 based on ResNet-18].

Journal: Zhonghua yi xue za zhi
Published Date:

Abstract

To establish a prediction model for the identifying of cataplexy facial features based on clinical shooting videos by using a deep learning image recognition network ResNet-18. A cross-sectional study. Twenty-five narcolepsy type 1 patients who were first diagnosed and never received treatment and 25 healthy controls recruited by advertisement in the Second Affiliated Hospital of Nanchang University from 2020 to 2023.After image preprocessing, a total of 1 180 images were obtained, including 583 cataplexy faces and 597 normal faces.90% were selected as the training set and validation set, and then expanded the data by 5 times.80% of the expanded data set was extracted as the training set and 20% as the validation set, that is, the number of the training set was (583+597)×0.9×0.8×5=4 248, the number of the validation set was (583+597)×0.9×0.2×5=1 062. The data sets for training and validation were used train parameters to establish the model and were trained through the five-fold cross-validation method, to establish the ResNet-18 cataplexy face recognition model via transfer learning.10% (118 images) of the original non-amplified images were extracted as the test set. The test set data did not participate in data enhancement and model training, and was only used to evaluate the final performance of the model. Finally, ResNet-18 was compared with VGG-16, ResNet-34 and Inception V3 deep learning models, and the receiver operating characteristic curve was used to evaluate the value of ResNet-18 image recognition network in cataplexy face recognition. Among 25 patients with narcolepsy type 1, 15 were males and 10 were females, aged [ (, )] of 14.0(11.0, 20.5) years.Among 25 healthy controls, 14 were males and 11 were females, with a median age of 16.0(14.4, 23.0) years.The overall accuracy of ResNet-18 image recognition network in the test set was 90.9%, the sensitivity was 96.4% and the specificity was 85.2%. The area under the ROC curve was 0.99(95%:0.96-1.00). The ResNet-18 model parameter amount was 11.69 M, the floating point operation amount was 1 824.03 M, and the single image recognition time was 5.9 ms. The cataplexy face prediction model built based on the deep learning image recognition network ResNet-18 has a high accuracy in identifying cataplexy faces.

Authors

  • Y Q Yuan
    Department of Neurology, the Second Affiliated Hospital, Jiangxi Medical College, Nanchang University, Nanchang 330006, China.
  • Y Zhu
    Department of Anesthesiology, Fudan University Shanghai Cancer Center; Department of Oncology, Shanghai Medical College, Fudan University, Shanghai 200032, China.
  • B Z Chen
    Queen Mary University of London, Nanchang University, Nanchang 330006, China.
  • T T Yin
    Department of Neurology, the Second Affiliated Hospital, Jiangxi Medical College, Nanchang University, Nanchang 330006, China.
  • Q Ke
    Department of Neurology, the First Affiliated Hospital, Zhejiang University School of Medicine,Hangzhou 310003,China.
  • L Xie
    From the Department of Radiology, Hospital of the University of Pennsylvania, Philadelphia, Pennsylvania.