Development of Debiasing Technique for Lung Nodule Chest X-ray Datasets to Generalize Deep Learning Models.

Journal: Sensors (Basel, Switzerland)
Published Date:

Abstract

Screening programs for early lung cancer diagnosis are uncommon, primarily due to the challenge of reaching at-risk patients located in rural areas far from medical facilities. To overcome this obstacle, a comprehensive approach is needed that combines mobility, low cost, speed, accuracy, and privacy. One potential solution lies in combining the chest X-ray imaging mode with federated deep learning, ensuring that no single data source can bias the model adversely. This study presents a pre-processing pipeline designed to debias chest X-ray images, thereby enhancing internal classification and external generalization. The pipeline employs a pruning mechanism to train a deep learning model for nodule detection, utilizing the most informative images from a publicly available lung nodule X-ray dataset. Histogram equalization is used to remove systematic differences in image brightness and contrast. Model training is then performed using combinations of lung field segmentation, close cropping, and rib/bone suppression. The resulting deep learning models, generated through this pre-processing pipeline, demonstrate successful generalization on an independent lung nodule dataset. By eliminating confounding variables in chest X-ray images and suppressing signal noise from the bone structures, the proposed deep learning lung nodule detection algorithm achieves an external generalization accuracy of 89%. This approach paves the way for the development of a low-cost and accessible deep learning-based clinical system for lung cancer screening.

Authors

  • Michael J Horry
    Centre for Advanced Modelling and Geospatial Information Systems (CAMGIS), Faculty of Engineering and Information Technology, University of Technology Sydney, Ultimo, NSW 2007, Australia.
  • Subrata Chakraborty
    Center for Advanced Modelling and Geospatial Information Systems (CAMGIS), Faculty of Engineering and Information Technology, University of Technology Sydney, Australia.
  • Biswajeet Pradhan
    School of Systems, Management, and Leadership, Faculty of Engineering and IT, University of Technology Sydney, New South Wales, Australia; Department of Energy and Mineral Resources Engineering, Choongmu-gwan, Sejong University, 209 Neungdong-ro Gwangjin-gu, 05006 Seoul, South Korea.
  • Manoranjan Paul
    Machine Vision and Digital Health (MaViDH), School of Computing, Mathematics, and Engineering, Charles Sturt University, Australia.
  • Jing Zhu
    College of Environmental Science and Engineering, Beijing Forestry University, Beijing 100083, China.
  • Hui Wen Loh
    School of Science and Technology, Singapore University of Social Sciences, Singapore, Singapore.
  • Prabal Datta Barua
    Cogninet Australia, Sydney, NSW 2010 Australia.
  • U Rajendra Acharya
    School of Business (Information Systems), Faculty of Business, Education, Law & Arts, University of Southern Queensland, Darling Heights, Australia.