How Data Infrastructure Deals with Bias Problems in Medical Imaging.

Journal: Studies in health technology and informatics
Published Date:

Abstract

The paper discusses biases in medical imaging analysis, particularly focusing on the challenges posed by the development of machine learning algorithms and generative models. It introduces a taxonomy of bias problems and addresses them through a data infrastructure initiative: the PADME (Platform for Analytics and Distributed Machine-Learning for Enterprises), which is a part of the National Research Data Infrastructure for Personal Health Data (NFDI4Health) project. The PADME facilitates the structuring and sharing of health data while ensuring privacy and adherence to FAIR principles. The paper presents experimental results that show that generative methods can be effective in data augmentation. Complying with PADME infrastructure, this work proposes a solution framework to deal with bias in the different data stations and preserve privacy when transferring images. It highlights the importance of standardized data infrastructure in mitigating biases and promoting FAIR, reusable, and privacy-preserving research environments in healthcare.

Authors

  • Feifei Li
    School of Chinese Pharmacy, Beijing University of Chinese Medicine, Beijing 102488, China.
  • Ekaterina Kutafina
    Department of Medical Informatics, Uniklinik RWTH Aachen, Aachen, Germany.
  • Mirjam Schoneck
    Institute for Diagnostic and Interventional Radiology, University of Cologne, Faculty of Medicine and University Hospital Cologne, Germany.
  • Liliana Lourenco Caldeira
    Institute for Diagnostic and Interventional Radiology, University of Cologne, Faculty of Medicine and University Hospital Cologne, Germany.
  • Oya Beyan
    Institute for Biomedical Informatics, Faculty of Medicine and University Hospital Cologne, University of Cologne, Cologne, Germany.