A Multi-Label Deep Learning Model with Interpretable Grad-CAM for Diabetic Retinopathy Classification.

Journal: Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference
Published Date:

Abstract

The characteristics of diabetic retinopathy (DR) fundus images generally consist of multiple types of lesions which provided strong evidence for the ophthalmologists to make diagnosis. It is particularly significant to figure out an efficient method to not only accurately classify DR fundus images but also recognize all kinds of lesions on them. In this paper, a deep learning-based multi-label classification model with Gradient-weighted Class Activation Mapping (Grad-CAM) was proposed, which can both make DR classification and automatically locate the regions of different lesions. To reducing laborious annotation work and improve the efficiency of labeling, this paper innovatively considered different types of lesions as different labels for a fundus image so that this paper changed the task of lesion detection into that of image classification. A total of five labels were pre-defined and 3228 fundus images were collected for developing our model. The architecture of deep learning model was designed by ourselves based on ResNet. Through experiments on the test images, this method acquired a sensitive of 93.9% and a specificity of 94.4% on DR classification. Moreover, the corresponding regions of lesions were reasonably outlined on the DR fundus images.

Authors

  • Hongyang Jiang
    Department of Urology, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China.
  • Jie Xu
    Second Affiliated Hospital Zhejiang University School of Medicine, Hangzhou, Zhejiang Province 310000, China.
  • Rongjie Shi
  • Kang Yang
    School of Pharmacy, Minzu University of China, Beijing 100081, China.
  • Dongdong Zhang
    Brain Korea 21 PLUS Project for Medical Science, Yonsei University, Seoul, Korea.
  • Mengdi Gao
  • He Ma
    Sino-Dutch Biomedical and Information Engineering School, Northeastern University, Shenyang 110819, China and Key Laboratory of Medical Image Computing, Ministry of Education, Northeastern University, Shenyang 110819, China.
  • Wei Qian
    Department of Electrical and Computer Engineering, University of Texas at El Paso, 500 West University Avenue, El Paso, TX 79968, USA; Sino-Dutch Biomedical and Information Engineering School, Northeastern University, No.11, Lane 3, Wenhua Road, Heping District, Shenyang, Liaoning 110819, China. Electronic address: wqian@utep.edu.