Federated learning of predictive models from federated Electronic Health Records.

Journal: International journal of medical informatics
Published Date:

Abstract

BACKGROUND: In an era of "big data," computationally efficient and privacy-aware solutions for large-scale machine learning problems become crucial, especially in the healthcare domain, where large amounts of data are stored in different locations and owned by different entities. Past research has been focused on centralized algorithms, which assume the existence of a central data repository (database) which stores and can process the data from all participants. Such an architecture, however, can be impractical when data are not centrally located, it does not scale well to very large datasets, and introduces single-point of failure risks which could compromise the integrity and privacy of the data. Given scores of data widely spread across hospitals/individuals, a decentralized computationally scalable methodology is very much in need.

Authors

  • Theodora S Brisimi
    Center for Information and Systems Engineering, Boston University, Boston, MA 02215 USA.
  • Ruidi Chen
    Department of Electrical & Computer Engineering, and Division of Systems Engineering, Boston University, 8 Saint Mary's St., Boston, MA 02215, United States.
  • Theofanie Mela
    Electrophysiology Lab/Arrhythmia Service, Massachusetts General Hospital, 55 Fruit Street, Boston, MA 02114, United States.
  • Alex Olshevsky
    Department of Electrical and Computer Engineering and Division of Systems Engineering, Boston University, Boston, MA.
  • Ioannis Ch Paschalidis
    Department of Electrical and Computer Engineering and Division of Systems Engineering, Boston University, Boston, MA.
  • Wei Shi
    Department of Orthopedics, Shenzhen Pediatrics Institute of Shantou University Medical College, Shenzhen, China.