跳到主要导航 跳到搜索 跳到主要内容

Decentralized Local Updates with Dual-Slow Estimation and Momentum-Based Variance-Reduction for Non-Convex Optimization

  • Kangyang Luo
  • , Kunkun Zhang
  • , Shengbo Zhang
  • , Xiang Li*
  • , Ming Gao
  • *此作品的通讯作者
  • East China Normal University

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Decentralized learning (DL) has recently employed local updates to reduce the communication cost for general non-convex optimization problems. Specifically, local updates require each node to perform multiple update steps on the parameters of the local model before communicating with others. However, most existing methods could be highly sensitive to data heterogeneity (i.e., non-iid data distribution) and adversely affected by the stochastic gradient noise. In this paper, we propose DSE-MVR to address these problems. Specifically, DSE-MVR introduces a dual-slow estimation strategy that utilizes the gradient tracking technique to estimate the global accumulated update direction for handling the data heterogeneity problem; also for stochastic noise, the method uses the mini-batch momentum-based variance-reduction technique. We theoretically prove that DSE-MVR can achieve optimal convergence results for general non-convex optimization in both iid and non-iid data distribution settings. In particular, the leading terms in the convergence rates derived by DSE-MVR are independent of the stochastic noise for large-batches or large partial average intervals (i.e., the number of local update steps). Further, we put forward DSE-SGD and theoretically justify the importance of the dual-slow estimation strategy in the data heterogeneity setting. Finally, we conduct extensive experiments to show the superiority of DSE-MVR against other state-of-the-art approaches. We provide our code here: https://anonymous.4open.science/r/DSE-MVR-32B8/.

源语言英语
主期刊名ECAI 2023 - 26th European Conference on Artificial Intelligence, including 12th Conference on Prestigious Applications of Intelligent Systems, PAIS 2023 - Proceedings
编辑Kobi Gal, Kobi Gal, Ann Nowe, Grzegorz J. Nalepa, Roy Fairstein, Roxana Radulescu
出版商IOS Press BV
1625-1632
页数8
ISBN(电子版)9781643684369
DOI
出版状态已出版 - 28 9月 2023
活动26th European Conference on Artificial Intelligence, ECAI 2023 - Krakow, 波兰
期限: 30 9月 20234 10月 2023

出版系列

姓名Frontiers in Artificial Intelligence and Applications
372
ISSN(印刷版)0922-6389
ISSN(电子版)1879-8314

会议

会议26th European Conference on Artificial Intelligence, ECAI 2023
国家/地区波兰
Krakow
时期30/09/234/10/23

指纹

探究 'Decentralized Local Updates with Dual-Slow Estimation and Momentum-Based Variance-Reduction for Non-Convex Optimization' 的科研主题。它们共同构成独一无二的指纹。

引用此