Federated learning with workload-aware client scheduling in heterogeneous systems.

Journal: Neural networks : the official journal of the International Neural Network Society
Published Date:

Abstract

Federated Learning (FL) is a novel distributed machine learning, which allows thousands of edge devices to train models locally without uploading data to the central server. Since devices in real federated settings are resource-constrained, FL encounters systems heterogeneity, which causes considerable stragglers and incurs significant accuracy degradation. To tackle the challenges of systems heterogeneity and improve the robustness of the global model, we propose a novel adaptive federated framework in this paper. Specifically, we propose FedSAE that leverages the workload completion history of clients to adaptively predict the affordable training workload for each device. Consequently, FedSAE can significantly reduce stragglers in highly heterogeneous systems. We incorporate Active Learning into FedSAE to dynamically schedule participants. The server evaluates the devices' training value based on their training loss in each round, and larger-value clients are selected with a higher probability. As a result, the model convergence is accelerated. Furthermore, we propose q-FedSAE that combines FedSAE and q-FFL to improve global fairness in highly heterogeneous systems. The evaluations conducted in a highly heterogeneous system demonstrate that both FedSAE and q-FedSAE converge faster than FedAvg. In particular, FedSAE outperforms FedAvg across multiple federated datasets - FedSAE improves testing accuracy by 22.19% and reduces stragglers by 90.69% on average. Moreover, holding the same accuracy as FedSAE, q-FedSAE allows for more robust convergence and fairer model performance than q-FedAvg, FedSAE.

Authors

  • Li Li
    Department of Gastric Surgery, Sichuan Clinical Research Center for Cancer, Sichuan Cancer Hospital & Institute, Sichuan Cancer Center, Affiliated Cancer Hospital of University of Electronic Science and Technology of China, Chengdu, China.
  • Duo Liu
    College of Computer Science, Chongqing University, Chongqing, China. Electronic address: liuduo@cqu.edu.cn.
  • Moming Duan
    College of Computer Science, Chongqing University, Chongqing, China. Electronic address: duanmoming@cqu.edu.cn.
  • Yu Zhang
    College of Marine Electrical Engineering, Dalian Maritime University, Dalian, China.
  • Ao Ren
    College of Computer Science, Chongqing University, Chongqing, China. Electronic address: ren.ao@cqu.edu.cn.
  • Xianzhang Chen
    College of Computer Science, Chongqing University, Chongqing, China. Electronic address: xzchen@cqu.edu.cn.
  • Yujuan Tan
    College of Computer Science, Chongqing University, Chongqing, China. Electronic address: tanyujuan@cqu.edu.cn.
  • Chengliang Wang
    College of Computer Science, Chongqing University, Chongqing, China. Electronic address: wangcl@cqu.edu.cn.