Longitudinal interpretability of deep learning based breast cancer risk prediction.

Journal: Physics in medicine and biology
Published Date:

Abstract

Deep-learning-based models have achieved state-of-the-art breast cancer risk (BCR) prediction performance. However, these models are highly complex, and the underlying mechanisms of BCR prediction are not fully understood. Key questions include whether these models can detect breast morphologic changes that lead to cancer. These findings would boost confidence in utilizing BCR models in practice and provide clinicians with new perspectives. In this work, we aimed to determine when oncogenic processes in the breast provide sufficient signal for the models to detect these changes.In total, 1210 screening mammograms were collected for patients screened at different times before the cancer was screen-detected and 2400 mammograms for patients with at least ten years of follow-up. MIRAI, a BCR risk prediction model, was used to estimate the BCR. Attribution heterogeneity was defined as the relative difference between the attributions obtained from the right and left breasts using one of the eight interpretability techniques. Model reliance on the side of the breast with cancer was quantified with AUC. The Mann-Whitney U test was used to check for significant differences in median absolute Attribution Heterogeneity between cancer patients and healthy individuals.All tested attribution methods showed a similar longitudinal trend, where the model reliance on the side of the breast with cancer was the highest for the 0-1 years-to-cancer interval (AUC = 0.85-0.95), dropped for the 1-3 years-to-cancer interval (AUC = 0.64-0.71), and remained above the threshold for random performance for the 3-5 years-to-cancer interval (AUC = 0.51-0.58). For all eight attribution methods, the median values of absolute attribution heterogeneity were significantly larger for patients diagnosed with cancer at one point (< 0.01).Interpretability of BCR prediction has revealed that long-term predictions (beyond three years) are most likely based on typical breast characteristics, such as breast density; for mid-term predictions (one to three years), the model appears to detect early signs of tumor development, while for short-term predictions (up to a year), the BCR model essentially functions as a breast cancer detection model.

Authors

  • Zan Klanecek
    University of Ljubljana, Faculty of Mathematics and Physics, Medical Physics, Ljubljana, Slovenia.
  • Yao-Kuan Wang
    KU Leuven, Department of Imaging and Pathology, Division of Medical Physics & Quality Assessment, Leuven, Belgium.
  • Tobias Wagner
    KU Leuven, Department of Imaging and Pathology, Division of Medical Physics & Quality Assessment, Leuven, Belgium.
  • Lesley Cockmartin
    UZ Leuven, Department of Radiology, Leuven, Belgium.
  • Nicholas Marshall
    KU Leuven, Department of Imaging and Pathology, Division of Medical Physics & Quality Assessment, Leuven, Belgium.
  • Brayden Schott
    Department of Radiation Oncology, Washington University School of Medicine, 4921 Parkview Place, Campus Box 8224, St. Louis, MO, 63110, USA.
  • Ali Deatsch
    University of Wisconsin-Madison, Department of Medical Physics, Madison, United States of America.
  • Andrej Studen
    University of Ljubljana, Faculty of Mathematics and Physics, Medical Physics, Ljubljana, Slovenia.
  • Katja Jarm
    Institute of Oncology Ljubljana, Ljubljana, Slovenia.
  • Mateja Krajc
    Institute of Oncology Ljubljana, Ljubljana, Slovenia.
  • Miloš Vrhovec
    Institute of Oncology Ljubljana, Ljubljana, Slovenia.
  • Hilde Bosmans
    Department of Imaging & Pathology, Biomedical Sciences Group, Catholic University of Leuven, Leuven, Belgium.
  • Robert Jeraj