FedDSS: A data-similarity approach for client selection in horizontal federated learning.

Journal: International journal of medical informatics
Published Date:

Abstract

BACKGROUND AND OBJECTIVE: Federated learning (FL) is an emerging distributed learning framework allowing multiple clients (hospitals, institutions, smart devices, etc.) to collaboratively train a centralized machine learning model without disclosing personal data. It has the potential to address several healthcare challenges, including a lack of training data, data privacy, and security concerns. However, model learning under FL is affected by non-i.i.d. data, leading to severe model divergence and reduced performance due to the varying client's data distributions. To address this problem, we propose FedDSS, Federated Data Similarity Selection, a framework that uses a data-similarity approach to select clients, without compromising client data privacy.

Authors

  • Tuong Minh Nguyen
    Department of Industrial Systems Engineering and Management, National University of Singapore, Singapore, 117576, Singapore. Electronic address: minh.t.nguyen@u.nus.edu.
  • Kim Leng Poh
    Department of Industrial Systems Engineering and Management, National University of Singapore, Singapore, 117576, Singapore.
  • Shu-Ling Chong
    Department of Emergency Medicine, KK Women's and Children's Hospital, Singapore, Singapore. chong.shu-ling@kkh.com.sg.
  • Jan Hau Lee
    SingHealth-Duke NUS Paediatrics Academic Clinical Programme, Duke-NUS Medical School, Singapore, 169857, Singapore; Children's Intensive Care Unit, KK Women's and Children's Hospital, Singapore, 229899, Singapore.