TY - JOUR
T1 - Joint Client Selection and Bandwidth Allocation of Wireless Federated Learning by Deep Reinforcement Learning
AU - Mao, Wei
AU - Lu, Xingjian
AU - Jiang, Yuhui
AU - Zheng, Haikun
N1 - Publisher Copyright:
© 2008-2012 IEEE.
PY - 2024/1/1
Y1 - 2024/1/1
N2 - Federated Learning (FL) is a promising paradigm for massive data mining service while protecting users' privacy. In wireless federated learning networks (WFLNs), limited communication resources and heterogeneity of user devices have essential impacts on training efficiency of FL, hence it is critical to select clients and allocate network bandwidths among them in each learning round to improve the training efficiency. In this article, we formulate the joint client selection and bandwidth allocation optimization problem as a MDP process and design a FL framework CSBWA to solve it. CSBWA relies on DRL-based REINFORCE algorithm to automatically perform effective policy based on observed information, e.g., client states, historical bandwidths, and feedback rewards. It is able to achieve lower time cost and energy consumption with long-term FL performance guarantee by jointly optimizing the client selection and bandwidth allocation. Experimental results show the effectiveness of CSBWA in reducing time cost and energy consumption while guaranteeing model performance of wireless federated learning compared with existing state-of-art methods.
AB - Federated Learning (FL) is a promising paradigm for massive data mining service while protecting users' privacy. In wireless federated learning networks (WFLNs), limited communication resources and heterogeneity of user devices have essential impacts on training efficiency of FL, hence it is critical to select clients and allocate network bandwidths among them in each learning round to improve the training efficiency. In this article, we formulate the joint client selection and bandwidth allocation optimization problem as a MDP process and design a FL framework CSBWA to solve it. CSBWA relies on DRL-based REINFORCE algorithm to automatically perform effective policy based on observed information, e.g., client states, historical bandwidths, and feedback rewards. It is able to achieve lower time cost and energy consumption with long-term FL performance guarantee by jointly optimizing the client selection and bandwidth allocation. Experimental results show the effectiveness of CSBWA in reducing time cost and energy consumption while guaranteeing model performance of wireless federated learning compared with existing state-of-art methods.
KW - Wireless federated learning
KW - bandwidth allocation
KW - client selection
KW - deep reinforcement learning
UR - https://www.scopus.com/pages/publications/85182354829
U2 - 10.1109/TSC.2024.3350050
DO - 10.1109/TSC.2024.3350050
M3 - 文章
AN - SCOPUS:85182354829
SN - 1939-1374
VL - 17
SP - 336
EP - 348
JO - IEEE Transactions on Services Computing
JF - IEEE Transactions on Services Computing
IS - 1
ER -