跳到主要导航 跳到搜索 跳到主要内容

Stage delay scheduling: Speeding up DAG-style data analytics jobs with resource interleaving

  • Wujie Shao
  • , Fei Xu*
  • , Li Chen
  • , Haoyue Zheng
  • , Fangming Liu
  • *此作品的通讯作者
  • East China Normal University
  • University of Louisiana at Lafayette
  • Huazhong University of Science and Technology

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

To increase the resource utilization of datacenters, big data analytics jobs are commonly running stages in parallel which are organized into and scheduled according to the Directed Acyclic Graph (DAG). Through an in-depth analysis of the latest Alibaba cluster trace and our motivation experiments on Amazon EC2, however, we show that the CPU and network resources are still under-utilized due to the unwise stage scheduling, thereby prolonging the completion time of a DAG-style job (e.g., Spark). While existing works on reducing the job completion time focus on either task scheduling or job scheduling, stage scheduling has received comparably little attention. In this paper, we design and implement DelayStage, a simple yet effective stage delay scheduling strategy to interleave the cluster resources across the parallel stages, so as to increase the cluster resource utilization and speed up the job performance. With the aim of minimizing the makespan of parallel stages, DelayStage judiciously arranges the execution of stages in a pipelined manner to maximize the performance benefits of resource interleaving. Extensive prototype experiments on 30 Amazon EC2 instances and complementary trace-driven simulations show that DelayStage can improve the cluster resource utilization by up to 81.8% and reduce the job completion time by up to 41.3%, in comparison to the stock Spark and the state-of-the-art stage scheduling strategies, yet with acceptable runtime overhead.

源语言英语
主期刊名Proceedings of the 48th International Conference on Parallel Processing, ICPP 2019
出版商Association for Computing Machinery
ISBN(电子版)9781450362955
DOI
出版状态已出版 - 5 8月 2019
活动48th International Conference on Parallel Processing, ICPP 2019 - Kyoto, 日本
期限: 5 8月 20198 8月 2019

出版系列

姓名ACM International Conference Proceeding Series

会议

会议48th International Conference on Parallel Processing, ICPP 2019
国家/地区日本
Kyoto
时期5/08/198/08/19

指纹

探究 'Stage delay scheduling: Speeding up DAG-style data analytics jobs with resource interleaving' 的科研主题。它们共同构成独一无二的指纹。

引用此