Privacy risks of whole-slide image sharing in digital pathology.

Journal: Nature communications
Published Date:

Abstract

Access to large volumes of so-called whole-slide images-high-resolution scans of complete pathological slides-has become a cornerstone of the development of novel artificial intelligence methods in pathology for diagnostic use, education/training of pathologists, and research. Nevertheless, a methodology based on risk analysis for evaluating the privacy risks associated with sharing such imaging data and applying the principle "as open as possible and as closed as necessary" is still lacking. In this article, we develop a model for privacy risk analysis for whole-slide images which focuses primarily on identity disclosure attacks, as these are the most important from a regulatory perspective. We introduce a taxonomy of whole-slide images with respect to privacy risks and mathematical model for risk assessment and design . Based on this risk assessment model and the taxonomy, we conduct a series of experiments to demonstrate the risks using real-world imaging data. Finally, we develop guidelines for risk assessment and recommendations for low-risk sharing of whole-slide image data.

Authors

  • Petr Holub
    Institute of Computer Science, Masaryk University, Brno, Czech Republic.
  • Heimo Müller
    Medical University of Graz, Graz, Austria.
  • Tomáš Bíl
    Institute of Computer Science, Masaryk University, Brno, Czech Republic.
  • Luca Pireddu
    Visual and Data-intensive Computing Group, CRS4, Pula, Italy.
  • Markus Plass
    Medical University of Graz, Graz, Austria.
  • Fabian Prasser
    Berlin Institute of Health (BIH), Berlin, Germany.
  • Irene Schlünder
    BBMRI-ERIC, Graz, Austria.
  • Kurt Zatloukal
    Medical University of Graz, Graz, Austria.
  • Rudolf Nenutil
    Department of Pathology, Masaryk Memorial Cancer Institute, Brno, Czech Republic.
  • Tomáš Brázdil
    Faculty of Informatics, Masaryk University, Brno, Czech Republic.