Navigating prevalence shifts in image analysis algorithm deployment.

Journal: Medical image analysis
PMID:

Abstract

Domain gaps are significant obstacles to the clinical implementation of machine learning (ML) solutions for medical image analysis. Although current research emphasizes new training methods and network architectures, the specific impact of prevalence shifts on algorithms in real-world applications is often overlooked. Differences in class frequencies between development and deployment data are crucial, particularly for the widespread adoption of artificial intelligence (AI), as disease prevalence can vary greatly across different times and locations. Our contribution is threefold. Based on a diverse set of 30 medical classification tasks (1) we demonstrate that lack of prevalence shift handling can have severe consequences on the quality of calibration, decision threshold, and performance assessment. Furthermore, (2) we show that prevalences can be accurately and reliably estimated in a data-driven manner. Finally, (3) we propose a new workflow for prevalence-aware image classification that uses estimated deployment prevalences to adjust a trained classifier to a new environment, without requiring additional annotated deployment data. Comprehensive experiments indicate that our proposed approach could contribute to generating better classifier decisions and more reliable performance estimates compared to current practice.

Authors

  • Patrick Godau
    Division of Intelligent Medical Systems (IMSY), German Cancer Research Center (DKFZ), Heidelberg, Germany; National Center for Tumor Diseases (NCT), Heidelberg, Germany.
  • Piotr Kalinowski
    Department of General, Transplant and Liver Surgery, Medical University of Warsaw, Warsaw, Poland.
  • Evangelia Christodoulou
    Department of Development & Regeneration, KU Leuven, Herestraat 49 box 805, Leuven, 3000 Belgium.
  • Annika Reinke
    German Cancer Research Center DKFZ, Division of Computer Assisted Medical Interventions, Heidelberg, Germany. Electronic address: a.reinke@dkfz.de.
  • Minu Tizabi
    Division of Intelligent Medical Systems, German Cancer Research Center (DKFZ), Heidelberg, Germany.
  • Luciana Ferrer
    Instituto de Investigación en Ciencias de la Computación (ICC), CONICET-UBA, Ciudad Autónoma de Buenos Aires, Buenos Aires, Argentina.
  • Paul Jager
  • Lena Maier-Hein
    German Cancer Research Center (DKFZ), Computer Assisted Medical Interventions, Heidelberg, Germany.