Robust deep learning from incomplete annotation for accurate lung nodule detection.

Journal: Computers in biology and medicine
Published Date:

Abstract

Deep learning plays a significant role in the detection of pulmonary nodules in low-dose computed tomography (LDCT) scans, contributing to the diagnosis and treatment of lung cancer. Nevertheless, its effectiveness often relies on the availability of extensive, meticulously annotated dataset. In this paper, we explore the utilization of an incompletely annotated dataset for pulmonary nodules detection and introduce the FULFIL (Forecasting Uncompleted Labels For Inexpensive Lung nodule detection) algorithm as an innovative approach. By instructing annotators to label only the nodules they are most confident about, without requiring complete coverage, we can substantially reduce annotation costs. Nevertheless, this approach results in an incompletely annotated dataset, which presents challenges when training deep learning models. Within the FULFIL algorithm, we employ Graph Convolution Network (GCN) to discover the relationships between annotated and unannotated nodules for self-adaptively completing the annotation. Meanwhile, a teacher-student framework is employed for self-adaptive learning using the completed annotation dataset. Furthermore, we have designed a Dual-Views loss to leverage different data perspectives, aiding the model in acquiring robust features and enhancing generalization. We carried out experiments using the LUng Nodule Analysis (LUNA) dataset, achieving a sensitivity of 0.574 at a False positives per scan (FPs/scan) of 0.125 with only 10% instance-level annotations for nodules. This performance outperformed comparative methods by 7.00%. Experimental comparisons were conducted to evaluate the performance of our model and human experts on test dataset. The results demonstrate that our model can achieve a comparable level of performance to that of human experts. The comprehensive experimental results demonstrate that FULFIL can effectively leverage an incomplete pulmonary nodule dataset to develop a robust deep learning model, making it a promising tool for assisting in lung nodule detection.

Authors

  • Zebin Gao
    Beijing XiaoBaiShiJi Network Technical Co., Ltd, Beijing, 100084, China.
  • Yuchen Guo
    Institute for Brain and Cognitive Sciences, Tsinghua University, Beijing, China.
  • Guoxin Wang
    Department of Medical Education, Ruijin Hospital Affifiliated to Shanghai Jiao Tong University School of Medicine, 197 Ruijin Rd. II, Shanghai, 200025, China.
  • Xiangru Chen
    Beijing XiaoBaiShiJi Network Technical Co., Ltd, Beijing, 100084, China.
  • Xuyang Cao
    JD Health International Inc, Beijing 100176, China.
  • Chao Zhang
    School of Information Engineering, Suqian University, Suqian, Jiangsu, China.
  • Shan An
    State Key Lab of Software Development Environment, Beihang University, Beijing 100191, China.
  • Feng Xu
    Orthopedics Department of the First Affiliated Hospital of Tsinghua University, Beijing, China.