TY - GEN
T1 - A Mixed-State Streaming Edge Partitioning based on Combinatorial Design
AU - Zhang, Zhenyu
AU - Qu, Wenwen
AU - Zhang, Weixi
AU - Shang, Junlin
AU - Wang, Xiaoling
N1 - Publisher Copyright:
© 2023 IEEE.
PY - 2023
Y1 - 2023
N2 - Graph partitioning is crucial in distributed graph computing systems, while impacting load balancing and communication between machines. To cope with the soaring scale of graphs, the streaming model has shown promising performance in graph partitioning. Although streaming model can deal with the bottleneck of memory usage for large-scale graphs, existing streaming partitioning algorithms not only lack sufficient quality but also cannot provide theoretical boundaries for graph partitioning. In addition, most streaming partitioning algorithms are sensitive to the order of edge streaming. In this paper, we model the edge partitioning problem as a combinatorial design problem, and provide a tight theoretical boundary. Based on the balanced edge partitioning design, we proposed a mixed-state streaming edge partitioning algorithm, which can generate high-quality graph partitions by mapping matrix and use the historical partition information to further optimize the partition quality and load balance. The experiments show that our proposed algorithm reduces partitioning time by more than half compared to the mainstream HDRF algorithm while maintaining load balance, and improves partitioning quality by about three times.
AB - Graph partitioning is crucial in distributed graph computing systems, while impacting load balancing and communication between machines. To cope with the soaring scale of graphs, the streaming model has shown promising performance in graph partitioning. Although streaming model can deal with the bottleneck of memory usage for large-scale graphs, existing streaming partitioning algorithms not only lack sufficient quality but also cannot provide theoretical boundaries for graph partitioning. In addition, most streaming partitioning algorithms are sensitive to the order of edge streaming. In this paper, we model the edge partitioning problem as a combinatorial design problem, and provide a tight theoretical boundary. Based on the balanced edge partitioning design, we proposed a mixed-state streaming edge partitioning algorithm, which can generate high-quality graph partitions by mapping matrix and use the historical partition information to further optimize the partition quality and load balance. The experiments show that our proposed algorithm reduces partitioning time by more than half compared to the mainstream HDRF algorithm while maintaining load balance, and improves partitioning quality by about three times.
KW - combinatorial design
KW - graph partitioning
KW - streaming partitioning
UR - https://www.scopus.com/pages/publications/85185396106
U2 - 10.1109/ICDM58522.2023.00096
DO - 10.1109/ICDM58522.2023.00096
M3 - 会议稿件
AN - SCOPUS:85185396106
T3 - Proceedings - IEEE International Conference on Data Mining, ICDM
SP - 868
EP - 877
BT - Proceedings - 23rd IEEE International Conference on Data Mining, ICDM 2023
A2 - Chen, Guihai
A2 - Khan, Latifur
A2 - Gao, Xiaofeng
A2 - Qiu, Meikang
A2 - Pedrycz, Witold
A2 - Wu, Xindong
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 23rd IEEE International Conference on Data Mining, ICDM 2023
Y2 - 1 December 2023 through 4 December 2023
ER -