跳到主要导航 跳到搜索 跳到主要内容

Optimizing Graph Partition by Optimal Vertex-Cut: A Holistic Approach

  • Wenwen Qu*
  • , Weixi Zhang
  • , Ji Cheng
  • , Chaorui Zhang
  • , Wei Han
  • , Bo Bai
  • , Chen Jason Zhang
  • , Liang He*
  • , Xiaoling Wang*
  • *此作品的通讯作者
  • East China Normal University
  • Huawei Technologies Co., Ltd.
  • Hong Kong Polytechnic University

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Graph partitioning is crucial in distributed graph-parallel computing systems, and it is challenging for graph partitioning to optimize the communication cost and load balancing together. Existing state-of-the-art works, such as Powerlyra and TopoX, optimize the load balancing by randomly distributing the edges of high-degree vertices, which inevitably brings a high communication cost that is unbounded. This paper proposes a graph partition model that can minimize communication cost while maximizing load balancing. More specifically, we model the graph partition as the combinatorial design problem. Our proposed model can provide high-quality partition that guarantees that the computing load can be evenly distributed to each worker and minimizes the communication cost with a near-optimal theoretical boundary.Based on the proposed model, we extend the hybrid-cut partitioning algorithm for the power-law graph and propose HCPD, a hybrid-cut partitioning algorithm based on combinatorial design. HCPD uses the proposed model to optimize the load balancing and communication cost simultaneously for high-degree vertices, and assigns the high-degree vertices and their low-degree neighbors to the same workers by label propagation to reduce the overall communication cost. In this way, we partition the low-degree and high-degree vertices holistically and further improve the partition quality, unlike Powerlyra and TopoX, which deal with the two parts independently. Our experiments show that HCPD outperforms Powerlyra on PageRank task by up to 2× faster on real-world power-law graphs with billions of edges.

源语言英语
主期刊名Proceedings - 2023 IEEE 39th International Conference on Data Engineering, ICDE 2023
出版商IEEE Computer Society
1019-1031
页数13
ISBN(电子版)9798350322279
DOI
出版状态已出版 - 2023
活动39th IEEE International Conference on Data Engineering, ICDE 2023 - Anaheim, 美国
期限: 3 4月 20237 4月 2023

出版系列

姓名Proceedings - International Conference on Data Engineering
2023-April
ISSN(印刷版)1084-4627

会议

会议39th IEEE International Conference on Data Engineering, ICDE 2023
国家/地区美国
Anaheim
时期3/04/237/04/23

指纹

探究 'Optimizing Graph Partition by Optimal Vertex-Cut: A Holistic Approach' 的科研主题。它们共同构成独一无二的指纹。

引用此