Predicting Adverse Drug Reactions on Distributed Health Data using Federated Learning.

Journal: AMIA ... Annual Symposium proceedings. AMIA Symposium
Published Date:

Abstract

Using electronic health data to predict adverse drug reaction (ADR) incurs practical challenges, such as lack of adequate data from any single site for rare ADR detection, resource constraints on integrating data from multiple sources, and privacy concerns with creating a centralized database from person-specific, sensitive data. We introduce a federated learning framework that can learn a global ADR prediction model from distributed health data held locally at different sites. We propose two novel methods of local model aggregation to improve the predictive capability of the global model. Through comprehensive experimental evaluation using real-world health data from 1 million patients, we demonstrate the effectiveness of our proposed approach in achieving comparable performance to centralized learning and outperforming localized learning models for two types of ADRs. We also demonstrate that, for varying data distributions, our aggregation methods outperform state-of-the-art techniques, in terms of precision, recall, and accuracy.

Authors

  • Olivia Choudhury
    Postdoctoral Researcher, IBM Research, Cambridge, MA, 02142, USA. olivia.choudhury1@ibm.com.
  • Yoonyoung Park
    4 IBM Corporation, IBM Research, Cambridge, Massachusetts.
  • Theodoros Salonidis
    IBM T.J. Watson Research Center, Yorktown Heights, NY, USA.
  • Aris Gkoulalas-Divanis
    IBM Watson Health, Cambridge, Massachusetts, USA.
  • Issa Sylla
    IBM Research Cambridge, Massachusetts, USA.
  • Amar K Das
    IBM Research Cambridge, Massachusetts, USA.