Federated learning with workload-aware client scheduling in heterogeneous systems.

Journal: Neural networks : the official journal of the International Neural Network Society

Published Date: Aug 1, 2022

Abstract

Federated Learning (FL) is a novel distributed machine learning, which allows thousands of edge devices to train models locally without uploading data to the central server. Since devices in real federated settings are resource-constrained, FL encounters systems heterogeneity, which causes considerable stragglers and incurs significant accuracy degradation. To tackle the challenges of systems heterogeneity and improve the robustness of the global model, we propose a novel adaptive federated framework in this paper. Specifically, we propose FedSAE that leverages the workload completion history of clients to adaptively predict the affordable training workload for each device. Consequently, FedSAE can significantly reduce stragglers in highly heterogeneous systems. We incorporate Active Learning into FedSAE to dynamically schedule participants. The server evaluates the devices' training value based on their training loss in each round, and larger-value clients are selected with a higher probability. As a result, the model convergence is accelerated. Furthermore, we propose q-FedSAE that combines FedSAE and q-FFL to improve global fairness in highly heterogeneous systems. The evaluations conducted in a highly heterogeneous system demonstrate that both FedSAE and q-FedSAE converge faster than FedAvg. In particular, FedSAE outperforms FedAvg across multiple federated datasets - FedSAE improves testing accuracy by 22.19% and reduces stragglers by 90.69% on average. Moreover, holding the same accuracy as FedSAE, q-FedSAE allows for more robust convergence and fairer model performance than q-FedAvg, FedSAE.

Authors

Li Li

Department of Gastric Surgery, Sichuan Clinical Research Center for Cancer, Sichuan Cancer Hospital & Institute, Sichuan Cancer Center, Affiliated Cancer Hospital of University of Electronic Science and Technology of China, Chengdu, China.
Duo Liu

College of Computer Science, Chongqing University, Chongqing, China. Electronic address: liuduo@cqu.edu.cn.
Moming Duan

College of Computer Science, Chongqing University, Chongqing, China. Electronic address: duanmoming@cqu.edu.cn.
Yu Zhang

College of Marine Electrical Engineering, Dalian Maritime University, Dalian, China.
Ao Ren

College of Computer Science, Chongqing University, Chongqing, China. Electronic address: ren.ao@cqu.edu.cn.
Xianzhang Chen

College of Computer Science, Chongqing University, Chongqing, China. Electronic address: xzchen@cqu.edu.cn.
Yujuan Tan

College of Computer Science, Chongqing University, Chongqing, China. Electronic address: tanyujuan@cqu.edu.cn.
Chengliang Wang

College of Computer Science, Chongqing University, Chongqing, China. Electronic address: wangcl@cqu.edu.cn.

Keywords

Humans Machine Learning Workload

External Resources

View on PubMed Access via DOI PubMed (35995021)

Federated learning with workload-aware client scheduling in heterogeneous systems.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals