Efficient and Effective Training of COVID-19 Classification Networks With Self-Supervised Dual-Track Learning to Rank.

Journal: IEEE journal of biomedical and health informatics

Published Date: Aug 20, 2020

Abstract

Coronavirus Disease 2019 (COVID-19) has rapidly spread worldwide since first reported. Timely diagnosis of COVID-19 is crucial both for disease control and patient care. Non-contrast thoracic computed tomography (CT) has been identified as an effective tool for the diagnosis, yet the disease outbreak has placed tremendous pressure on radiologists for reading the exams and may potentially lead to fatigue-related mis-diagnosis. Reliable automatic classification algorithms can be really helpful; however, they usually require a considerable number of COVID-19 cases for training, which is difficult to acquire in a timely manner. Meanwhile, how to effectively utilize the existing archive of non-COVID-19 data (the negative samples) in the presence of severe class imbalance is another challenge. In addition, the sudden disease outbreak necessitates fast algorithm development. In this work, we propose a novel approach for effective and efficient training of COVID-19 classification networks using a small number of COVID-19 CT exams and an archive of negative samples. Concretely, a novel self-supervised learning method is proposed to extract features from the COVID-19 and negative samples. Then, two kinds of soft-labels ('difficulty' and 'diversity') are generated for the negative samples by computing the earth mover's distances between the features of the negative and COVID-19 samples, from which data 'values' of the negative samples can be assessed. A pre-set number of negative samples are selected accordingly and fed to the neural network for training. Experimental results show that our approach can achieve superior performance using about half of the negative samples, substantially reducing model training time.

Authors

Yuexiang Li

Computer Vision Institute, College of Computer Science and Software Engineering, Shenzhen University, Shenzhen, China.
Dong Wei

National Institute of Healthcare Data Science, Nanjing University, Nanjing, China.
Jiawei Chen
Shilei Cao

Tencent Youtu Lab, Malata Building, Kejizhongyi Road, Nanshan District, Shenzhen, 518075, China.
Hongyu Zhou

Institute for AI in Medicine and Faculty of Medicine, Macau University of Science and Technology, Macau, China; National Clinical Research Center for Ocular Diseases, Eye Hospital, Wenzhou Medical University, Wenzhou, China.
Yanchun Zhu
Jianrong Wu
Lan Lan
Wenbo Sun
Tianyi Qian
Kai Ma

Tencent Jarvis Lab, Shenzhen, 518057, China.
Haibo Xu

State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, 650223, Yunnan, China.
Yefeng Zheng

Keywords

Algorithms Betacoronavirus Clinical Laboratory Techniques Cohort Studies Computational Biology Coronavirus Infections COVID-19 COVID-19 Testing Deep Learning Diagnostic Errors Humans Neural Networks, Computer Pandemics Pneumonia, Viral Radiographic Image Interpretation, Computer-Assisted Retrospective Studies SARS-CoV-2 Supervised Machine Learning Tomography, X-Ray Computed

External Resources

View on PubMed Access via DOI PubMed (32816680)

Efficient and Effective Training of COVID-19 Classification Networks With Self-Supervised Dual-Track Learning to Rank.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals