Towards safe and reliable deep learning for lung nodule malignancy estimation using out-of-distribution detection.

Journal: Computers in biology and medicine
Published Date:

Abstract

Artificial Intelligence (AI) models may fail or suffer from reduced performance when applied to unseen data that differs from the training data distribution, referred to as dataset shift. Automatic detection of out-of-distribution (OOD) data contributes to safe and reliable clinical implementation of AI models. In this study, we propose a recognized OOD detection method that utilizes the Mahalanobis distance (MD) and compare its performance to widely known classical methods. The MD measures the similarity between features of an unseen sample and the distribution of development samples features of intermediate model layers. We integrate our proposed method in an existing deep learning (DL) model for lung nodule malignancy risk estimation on chest CT and validate it across four dataset shifts known to reduce AI model performance. The results show that our proposed method outperforms the classical methods and can effectively detect near- and far-OOD samples across all datasets with different data distribution shifts. Additionally, we demonstrate that our proposed method can seamlessly incorporate additional In-distribution (ID) data while maintaining the ability to accurately differentiate between the remaining OOD cases. Lastly, we searched for the optimal OOD threshold in the OOD dataset where the performance of the DL model stays reliable, however no decline in DL performance was revealed as the OOD score increased.

Authors

  • DrĂ© Peeters
    Diagnostic Imaging Analysis Group, Medical Imaging Department, Radboud University Medical Center, Geert Grooteplein Zuid 10, 6525 GA, Nijmegen, the Netherlands. dre.peeters@radboudumc.nl.
  • Kiran V Venkadesh
    Diagnostic Imaging Analysis Group, Medical Imaging Department, Radboud University Medical Center, Geert Grooteplein Zuid 10, 6525 GA, Nijmegen, the Netherlands.
  • Renate Dinnessen
    Diagnostic Imaging Analysis Group, Medical Imaging Department, Radboud University Medical Center, Geert Grooteplein Zuid 10, 6525 GA, Nijmegen, the Netherlands.
  • Zaigham Saghir
  • Ernst T Scholten
    From the Diagnostic Image Analysis Group, Radboud University Medical Center, Geert Groteplein 10, Nijmegen 6500 HB, the Netherlands (K.M., E.T.S., S.S., C.M.S., B.v.G.); Department of Radiology, Bernhoven Hospital, Uden, the Netherlands (H.S.); Department of Radiology, Jeroen Bosch Hospital, 's-Hertogenbosch, the Netherlands (A.J.G.K., M.B.J.M.K., T.S., M.R.); Department of Radiology, Meander Medisch Centrum, Amersfoort, the Netherlands (C.M.S.); and Thirona, Nijmegen, the Netherlands (R.H.H.M.P., A.M., J.M.).
  • Rozemarijn Vliegenthart
    University of Groningen, University Medical Center Groningen, Department of Radiology, Hanzeplein 1, 9713 GZ Groningen, The Netherlands.
  • Mathias Prokop
    Department of Radiology, Radboud University Medical Center, Nijmegen, The Netherlands.
  • Colin Jacobs
    Diagnostic Image Analysis Group, Radboud University Medical Center, Nijmegen, The Netherlands.