Resolving challenges in deep learning-based analyses of histopathological images using explanation methods.

Journal: Scientific reports
Published Date:

Abstract

Deep learning has recently gained popularity in digital pathology due to its high prediction quality. However, the medical domain requires explanation and insight for a better understanding beyond standard quantitative performance evaluation. Recently, many explanation methods have emerged. This work shows how heatmaps generated by these explanation methods allow to resolve common challenges encountered in deep learning-based digital histopathology analyses. We elaborate on biases which are typically inherent in histopathological image data. In the binary classification task of tumour tissue discrimination in publicly available haematoxylin-eosin-stained images of various tumour entities, we investigate three types of biases: (1) biases which affect the entire dataset, (2) biases which are by chance correlated with class labels and (3) sampling biases. While standard analyses focus on patch-level evaluation, we advocate pixel-wise heatmaps, which offer a more precise and versatile diagnostic instrument. This insight is shown to not only be helpful to detect but also to remove the effects of common hidden biases, which improves generalisation within and across datasets. For example, we could see a trend of improved area under the receiver operating characteristic (ROC) curve by 5% when reducing a labelling bias. Explanation techniques are thus demonstrated to be a helpful and highly relevant tool for the development and the deployment phases within the life cycle of real-world applications in digital pathology.

Authors

  • Miriam Hägele
    TU Berlin, Machine Learning Group, Berlin, 10587, Germany.
  • Philipp Seegerer
    Machine-Learning Group, Department of Software Engineering and Theoretical Computer Science, Technical University of Berlin, 10623 Berlin, Germany.
  • Sebastian Lapuschkin
    Department of Video Coding & Analytics, Fraunhofer Heinrich Hertz Institute, Berlin, Germany.
  • Michael Bockmayr
    Institute of Pathology, Charité - Universitätsmedizin Berlin, corporate member of Freie Universität Berlin, Humboldt-Universität zu Berlin and Berlin Institute of Health, 10117 Berlin, Germany.
  • Wojciech Samek
    Machine Learning Group, Fraunhofer Heinrich Hertz Institute, Berlin, Germany.
  • Frederick Klauschen
    Pathologisches Institut, Ludwig-Maximilians-Universität München, Thalkirchner Str. 36, 80337, München, Deutschland. f.klauschen@lmu.de.
  • Klaus-Robert Müller
    Berlin Institute for the Foundations of Learning and Data (BIFOLD), Berlin, Deutschland.
  • Alexander Binder
    Berlin Institute of Technology, Machine Learning Laboratory, Berlin, Germany.