Federated machine learning for a facilitated implementation of Artificial Intelligence in healthcare - a proof of concept study for the prediction of coronary artery calcification scores.

Journal: Journal of integrative bioinformatics
Published Date:

Abstract

The implementation of Artificial Intelligence (AI) still faces significant hurdles and one key factor is the access to data. One approach that could support that is federated machine learning (FL) since it allows for privacy preserving data access. For this proof of concept, a prediction model for coronary artery calcification scores (CACS) has been applied. The FL was trained based on the data in the different institutions, while the centralized machine learning model was trained on one allocation of data. Both algorithms predict patients with risk scores ≥5 based on age, biological sex, waist circumference, dyslipidemia and HbA1c. The centralized model yields a sensitivity of c. 66% and a specificity of c. 70%. The FL slightly outperforms that with a sensitivity of 67% while slightly underperforming it with a specificity of 69%. It could be demonstrated that CACS prediction is feasible via both, a centralized and an FL approach, and that both show very comparable accuracy. In order to increase accuracy, additional and a higher volume of patient data is required and for that FL is utterly necessary. The developed "CACulator" serves as proof of concept, is available as research tool and shall support future research to facilitate AI implementation.

Authors

  • Justus Wolff
    TUM School of Life Sciences Weihenstephan, Technical University of Munich, Freising, Germany.
  • Julian Matschinske
    Chair of Experimental Bioinformatics, TUM School of Life Sciences, Technical University of Munich, Freising, Germany.
  • Dietrich Baumgart
    Preventicum Essen, Theodor-Althoff-Str. 47 45133 Essen, Germany.
  • Anne Pytlik
    Preventicum Essen, Theodor-Althoff-Str. 47 45133 Essen, Germany.
  • Andreas Keck
    Strategy Institute for Digital Health, Hamburg, Germany.
  • Arunakiry Natarajan
    Independent Researcher, Digital Health, Informatics and Data Science, Lower Saxony, Germany.
  • Claudio E von Schacky
    From the Department of Radiology and Biomedical Imaging (C.E.v.S., J.H.S., E.O., P.M.J., M.P., S.C.F., T.M.L., V.P.) and Department of Epidemiology and Biostatistics (F.L., M.C.N.), University of California, San Francisco, 185 Berry St, Suite 350, San Francisco, CA 94107; Department of Diagnostic and Interventional Radiology, Technische Universität München, Munich, Germany (C.E.v.S., S.C.F.); Department of Diagnostic and Interventional Radiology, Medical Center-University of Freiburg, Faculty of Medicine, Freiburg, Germany (P.M.J.); and Department of Radiology, University of California Davis Health, Sacramento, Calif (L.N.).
  • Josch K Pauling
    LipiTUM, Chair of Experimental Bioinformatics, TUM School of Life Sciences, Technical University of Munich, Freising, Germany.
  • Jan Baumbach
    TUM School of Life Sciences Weihenstephan, Technical University of Munich, Freising, Germany.