Split Learning for Distributed Collaborative Training of Deep Learning Models in Health Informatics.

Journal: AMIA ... Annual Symposium proceedings. AMIA Symposium
Published Date:

Abstract

Deep learning continues to rapidly evolve and is now demonstrating remarkable potential for numerous medical prediction tasks. However, realizing deep learning models that generalize across healthcare organizations is challenging. This is due, in part, to the inherent siloed nature of these organizations and patient privacy requirements. To address this problem, we illustrate how split learning can enable collaborative training of deep learning models across disparate and privately maintained health datasets, while keeping the original records and model parameters private. We introduce a new privacy-preserving distributed learning framework that offers a higher level of privacy compared to conventional federated learning. We use several biomedical imaging and electronic health record (EHR) datasets to show that deep learning models trained via split learning can achieve highly similar performance to their centralized and federated counterparts while greatly improving computational efficiency and reducing privacy risks.

Authors

  • Zhuohang Li
    Department of Computer Science, Vanderbilt University, Nashville, TN 37212, United States.
  • Chao Yan
    School of Pharmacy, Shanghai Jiao Tong University, Shanghai 200240, China.
  • Xinmeng Zhang
    Department of Computer Science, Vanderbilt University, Nashville, TN 37212, United States.
  • Gharib Gharibi
    TripleBlind, Kansas City, MO.
  • Zhijun Yin
    Vanderbilt University Medical Center, Nashville, TN, United States.
  • Xiaoqian Jiang
    School of Biomedical Informatics, University of Texas Health, Science Center at Houston, Houston, TX, USA.
  • Bradley A Malin
    Vanderbilt University, Nashville, TN.