TY - JOUR
T1 - FedCFA
T2 - 39th Annual AAAI Conference on Artificial Intelligence, AAAI 2025
AU - Jiang, Zhonghua
AU - Xu, Jimin
AU - Zhang, Shengyu
AU - Shen, Tao
AU - Li, Jiwei
AU - Kuang, Kun
AU - Cai, Haibin
AU - Wu, Fei
N1 - Publisher Copyright:
Copyright © 2025, Association for the Advancement of Artificial Intelligence.
PY - 2025/4/11
Y1 - 2025/4/11
N2 - Federated learning (FL) is a promising technology for data privacy and distributed optimization, but it suffers from data imbalance and heterogeneity among clients. Existing FL methods try to solve the problems by aligning client with server model or by correcting client model with control variables. These methods excel on IID and general Non-IID data but perform mediocrely in Simpson's Paradox scenarios. Simpson's Paradox refers to the phenomenon that the trend observed on the global dataset disappears or reverses on a subset, which may lead to the fact that global model obtained through aggregation in FL does not accurately reflect the distribution of global data. Thus, we propose FedCFA, a novel FL framework employing counterfactual learning to generate counterfactual samples by replacing local data critical factors with global average data, aligning local data distributions with the global and mitigating Simpson's Paradox effects. In addition, to improve the quality of counterfactual samples, we introduce factor decorrelation (FDC) loss to reduce the correlation among features and thus improve the independence of extracted factors. We conduct extensive experiments on six datasets and verify that our method outperforms other FL methods in terms of efficiency and global model accuracy under limited communication rounds.
AB - Federated learning (FL) is a promising technology for data privacy and distributed optimization, but it suffers from data imbalance and heterogeneity among clients. Existing FL methods try to solve the problems by aligning client with server model or by correcting client model with control variables. These methods excel on IID and general Non-IID data but perform mediocrely in Simpson's Paradox scenarios. Simpson's Paradox refers to the phenomenon that the trend observed on the global dataset disappears or reverses on a subset, which may lead to the fact that global model obtained through aggregation in FL does not accurately reflect the distribution of global data. Thus, we propose FedCFA, a novel FL framework employing counterfactual learning to generate counterfactual samples by replacing local data critical factors with global average data, aligning local data distributions with the global and mitigating Simpson's Paradox effects. In addition, to improve the quality of counterfactual samples, we introduce factor decorrelation (FDC) loss to reduce the correlation among features and thus improve the independence of extracted factors. We conduct extensive experiments on six datasets and verify that our method outperforms other FL methods in terms of efficiency and global model accuracy under limited communication rounds.
UR - https://www.scopus.com/pages/publications/105004169706
U2 - 10.1609/aaai.v39i17.33942
DO - 10.1609/aaai.v39i17.33942
M3 - 会议文章
AN - SCOPUS:105004169706
SN - 2159-5399
VL - 39
SP - 17662
EP - 17670
JO - Proceedings of the AAAI Conference on Artificial Intelligence
JF - Proceedings of the AAAI Conference on Artificial Intelligence
IS - 17
Y2 - 25 February 2025 through 4 March 2025
ER -