Accurate recognition of colorectal cancer with semi-supervised deep learning on pathological images.

Journal: Nature communications
Published Date:

Abstract

Machine-assisted pathological recognition has been focused on supervised learning (SL) that suffers from a significant annotation bottleneck. We propose a semi-supervised learning (SSL) method based on the mean teacher architecture using 13,111 whole slide images of colorectal cancer from 8803 subjects from 13 independent centers. SSL (~3150 labeled, ~40,950 unlabeled; ~6300 labeled, ~37,800 unlabeled patches) performs significantly better than the SL. No significant difference is found between SSL (~6300 labeled, ~37,800 unlabeled) and SL (~44,100 labeled) at patch-level diagnoses (area under the curve (AUC): 0.980 ± 0.014 vs. 0.987 ± 0.008, P value = 0.134) and patient-level diagnoses (AUC: 0.974 ± 0.013 vs. 0.980 ± 0.010, P value = 0.117), which is close to human pathologists (average AUC: 0.969). The evaluation on 15,000 lung and 294,912 lymph node images also confirm SSL can achieve similar performance as that of SL with massive annotations. SSL dramatically reduces the annotations, which has great potential to effectively build expert-level pathological artificial intelligence platforms in practice.

Authors

  • Gang Yu
    The Children's Hospital, Zhejiang University School of Medicine, Hangzhou, China.
  • Kai Sun
    Department of Materials Science and Engineering, Jinan University.
  • Chao Xu
    Department of Neurology, the Second Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou 310009, China;Department of Emergency, Zhejiang Hospital, Hangzhou 310013, China.
  • Xing-Hua Shi
    Department of Computer & Information Sciences, College of Science and Technology, Temple University, Philadelphia, PA, 19122, USA.
  • Chong Wu
    School of Management, Harbin, China. Electronic address: wuchong@hit.edu.cn.
  • Ting Xie
    Department of Biomedical Engineering, School of Basic Medical Science, Central South University, 410013, Changsha, Hunan, China.
  • Run-Qi Meng
    Electronic Information Science and Technology, School of Physics and Electronics, Central South University, 410083, Changsha, Hunan, China.
  • Xiang-He Meng
    Centers of System Biology, Data Information and Reproductive Health, School of Basic Medical Science, Central South University, Changsha, Hunan 410008, China.
  • Kuan-Song Wang
    Department of Pathology, Xiangya Hospital, School of Basic Medical Science, Central South University, 410078, Changsha, Hunan, China. 375527162@qq.com.
  • Hong-Mei Xiao
    Centers of System Biology, Data Information and Reproductive Health, School of Basic Medical Science, Central South University, Changsha, Hunan 410008, China.
  • Hong-Wen Deng
    Center for Bioinformatics and Genomics, Department of Global Biostatistics and Data Science, Tulane University, New Orleans, LA 70112, USA.