跳到主要导航 跳到搜索 跳到主要内容

Hybrid Checkpointing for Iterative Processing in BSP-Based Systems

  • East China Normal University
  • National University of Defense Technology
  • Anhui Polytechnic University

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Distributed iterative processing exists in various application scenarios including large-scale graph analytics and machine learning. Many systems employ bulk synchronous parallel (BSP) model to synchronize the iterations. In these BSP-based systems, the long iterative processing time in distributed environments makes the fault-tolerance crucial. Most BSP-based systems write a checkpoint in either blocking strategy or unblocking strategy to achieve fault-tolerance. However, the blocking strategy involves a checkpointing overhead in failure-free cases, whereas the unblocking strategy also incurs a recovery cost if the BSP-based system has not completed checkpointing in failure cases. Motivated by the trade-off between blocking and unblocking checkpointing, we aim to choose different checkpointing strategy when checkpoint is required during iterative processing, in order to reduce the whole execution time. In particular, we propose a checkpointing choice problem, i.e., how to choose the strategy to minimize the execution time. The challenge is to make a choice during runtime without future information. To address this problem, we provide a hybrid checkpointing, which heuristically chooses either blocking or unblocking checkpointing based on cost evaluation. Our experiments on Giraph, a typical BSP-based system, show that hybrid checkpointing outperforms blocking and unblocking checkpointing.

源语言英语
主期刊名Web Information Systems and Applications - 18th International Conference, WISA 2021, Proceedings
编辑Chunxiao Xing, Xiaoming Fu, Yong Zhang, Guigang Zhang, Chaolemen Borjigin
出版商Springer Science and Business Media Deutschland GmbH
693-705
页数13
ISBN(印刷版)9783030875701
DOI
出版状态已出版 - 2021
活动18th International Conference on Web Information Systems and Applications, WISA 2021 - Kaifeng, 中国
期限: 24 9月 202126 9月 2021

出版系列

姓名Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
12999 LNCS
ISSN(印刷版)0302-9743
ISSN(电子版)1611-3349

会议

会议18th International Conference on Web Information Systems and Applications, WISA 2021
国家/地区中国
Kaifeng
时期24/09/2126/09/21

指纹

探究 'Hybrid Checkpointing for Iterative Processing in BSP-Based Systems' 的科研主题。它们共同构成独一无二的指纹。

引用此