Multi-modal dataset creation for federated learning with DICOM-structured reports.

Journal: International journal of computer assisted radiology and surgery
PMID:

Abstract

Purpose Federated training is often challenging on heterogeneous datasets due to divergent data storage options, inconsistent naming schemes, varied annotation procedures, and disparities in label quality. This is particularly evident in the emerging multi-modal learning paradigms, where dataset harmonization including a uniform data representation and filtering options are of paramount importance.Methods DICOM-structured reports enable the standardized linkage of arbitrary information beyond the imaging domain and can be used within Python deep learning pipelines with highdicom. Building on this, we developed an open platform for data integration with interactive filtering capabilities, thereby simplifying the process of creation of patient cohorts over several sites with consistent multi-modal data.Results In this study, we extend our prior work by showing its applicability to more and divergent data types, as well as streamlining datasets for federated training within an established consortium of eight university hospitals in Germany. We prove its concurrent filtering ability by creating harmonized multi-modal datasets across all locations for predicting the outcome after minimally invasive heart valve replacement. The data include imaging and waveform data (i.e., computed tomography images, electrocardiography scans) as well as annotations (i.e., calcification segmentations, and pointsets), and metadata (i.e., prostheses and pacemaker dependency).Conclusion Structured reports bridge the traditional gap between imaging systems and information systems. Utilizing the inherent DICOM reference system arbitrary data types can be queried concurrently to create meaningful cohorts for multi-centric data analysis. The graphical interface as well as example structured report templates are available at https://github.com/Cardio-AI/fl-multi-modal-dataset-creation .

Authors

  • Malte Tölle
    Department of Internal Medicine III, Heidelberg University Hospital, Germany (S.E., M.T.).
  • Lukas Burger
    DZHK (German Centre for Cardiovascular Research, All Partner Sites), Munich, Germany.
  • Halvar Kelm
    DZHK (German Centre for Cardiovascular Research, All Partner Sites), Munich, Germany.
  • Florian André
    University of Heidelberg, Department of Cardiology, Angiology and Pneumology, Im Neuenheimer Feld 410, Heidelberg, 69120, Germany. Electronic address: florian.andre@med.uni-heidelberg.de.
  • Peter Bannas
    Department of Diagnostic and Interventional Radiology and Nuclear Medicine, University Medical Center Hamburg-Eppendorf, Hamburg, Germany.
  • Gerhard Diller
    Clinic for Cardiology III, University Hospital Münster, Münster, Germany.
  • Norbert Frey
    DZHK (German Centre for Cardiovascular Research, All Partner Sites), Munich, Germany.
  • Philipp Garthe
    Clinic for Cardiology III, University Hospital Münster, Münster, Germany.
  • Stefan Gross
    German Centre for Cardiovascular Research (DZHK), Partner Site Greifswald, Greifswald, Germany.
  • Anja Hennemuth
    Charité - Universitätsmedizin Berlin, Berlin, Germany; Fraunhofer MEVIS, Bremen, Germany; German Centre for Cardiovascular Research, Berlin, Germany.
  • Lars Kaderali
  • Nina Krüger
    DZHK (German Centre for Cardiovascular Research, All Partner Sites), Munich, Germany.
  • Andreas Leha
    Department of Medical Statistics, University Medical Center Göttingen, Humboldtallee 32, 37073 Göttingen, Germany.
  • Simon Martin
    Division of Cardiovascular Imaging, Department of Radiology and Radiological Science, Medical University of South Carolina, Charleston, South Carolina; Department of Diagnostic and Interventional Radiology, University Hospital Frankfurt, Frankfurt, Germany.
  • Alexander Meyer
    Department of Cardiothoracic and Vascular Surgery, Deutsches Herzzentrum Berlin, Berlin, Germany; DZHK (German Centre for Cardiovascular Research), Partner Site Berlin, Berlin, Germany; Berlin Institute of Health, Berlin, Germany. Electronic address: meyera@dhzb.de.
  • Eike Nagel
    Institute for Experimental and Translational Cardiovascular Imaging, DZHK Centre for Cardiovascular Imaging, Goethe University Frankfurt, Frankfurt am Main, Germany.
  • Stefan Orwat
    Department of Cardiology III - Adult Congenital and Valvular Heart Disease, University Hospital Muenster, Albert-Schweitzer-Campus 1, 48149, Münster, Germany.
  • Clemens Scherer
    Department of Medicine I, LMU University Hospital, LMU Munich, Germany (C.S.).
  • Moritz Seiffert
    DZHK (German Centre for Cardiovascular Research, All Partner Sites), Munich, Germany.
  • Jan Moritz Seliger
    DZHK (German Centre for Cardiovascular Research, All Partner Sites), Munich, Germany.
  • Stefan Simm
    Institute of Bioinformatics, University Medicine Greifswald, 17475 Greifswald, Germany.
  • Tim Friede
    Department of Medical Statistics, University Medical Center Göttingen, Humboldtallee 32, 37073 Göttingen, Germany.
  • Tim Seidler
    DZHK (German Center for Cardiovascular Research), Partner Site Göttingen, Robert-Koch str. 40, 37075 Göttingen, Germany.
  • Sandy Engelhardt