Efficient and Effective Training of COVID-19 Classification Networks With Self-Supervised Dual-Track Learning to Rank.

Journal: IEEE journal of biomedical and health informatics
Published Date:

Abstract

Coronavirus Disease 2019 (COVID-19) has rapidly spread worldwide since first reported. Timely diagnosis of COVID-19 is crucial both for disease control and patient care. Non-contrast thoracic computed tomography (CT) has been identified as an effective tool for the diagnosis, yet the disease outbreak has placed tremendous pressure on radiologists for reading the exams and may potentially lead to fatigue-related mis-diagnosis. Reliable automatic classification algorithms can be really helpful; however, they usually require a considerable number of COVID-19 cases for training, which is difficult to acquire in a timely manner. Meanwhile, how to effectively utilize the existing archive of non-COVID-19 data (the negative samples) in the presence of severe class imbalance is another challenge. In addition, the sudden disease outbreak necessitates fast algorithm development. In this work, we propose a novel approach for effective and efficient training of COVID-19 classification networks using a small number of COVID-19 CT exams and an archive of negative samples. Concretely, a novel self-supervised learning method is proposed to extract features from the COVID-19 and negative samples. Then, two kinds of soft-labels ('difficulty' and 'diversity') are generated for the negative samples by computing the earth mover's distances between the features of the negative and COVID-19 samples, from which data 'values' of the negative samples can be assessed. A pre-set number of negative samples are selected accordingly and fed to the neural network for training. Experimental results show that our approach can achieve superior performance using about half of the negative samples, substantially reducing model training time.

Authors

  • Yuexiang Li
    Computer Vision Institute, College of Computer Science and Software Engineering, Shenzhen University, Shenzhen, China.
  • Dong Wei
    National Institute of Healthcare Data Science, Nanjing University, Nanjing, China.
  • Jiawei Chen
  • Shilei Cao
    Tencent Youtu Lab, Malata Building, Kejizhongyi Road, Nanshan District, Shenzhen, 518075, China.
  • Hongyu Zhou
    Institute for AI in Medicine and Faculty of Medicine, Macau University of Science and Technology, Macau, China; National Clinical Research Center for Ocular Diseases, Eye Hospital, Wenzhou Medical University, Wenzhou, China.
  • Yanchun Zhu
  • Jianrong Wu
  • Lan Lan
  • Wenbo Sun
  • Tianyi Qian
  • Kai Ma
    Tencent Jarvis Lab, Shenzhen, 518057, China.
  • Haibo Xu
    State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, 650223, Yunnan, China.
  • Yefeng Zheng