跳到主要导航 跳到搜索 跳到主要内容

Towards Efficient Workflow Scheduling Over Yarn Cluster Using Deep Reinforcement Learning

  • Jianguo Xue
  • , Ting Wang*
  • , Puyu Cai
  • *此作品的通讯作者
  • MoE Engineering Research Center of Hardware/Software Co-Design Technology and Application
  • Michigan State University

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Hadoop Yarn is an open-source cluster manager responsible for resource management and job scheduling. However, data-driven applications are typically organized into workflows that consist of a series of jobs with dependencies. Yarn does not manage users' workflows and only considers the current job rather than the entire workflow when scheduling. In practice, multiple workflows share the same Yarn cluster and are pre-assigned separate Yarn resource queues to avoid mutual interference. However, this coarse-grained resource division can sometimes result in low resource utilization and increased pending time of jobs on the Yarn queue. For instance, one resource queue may have exhausted its quota while still having pending jobs, while other queues may have available resources but cannot begin executing any jobs due to unfulfilled data dependencies. To address this problem, we propose a deep reinforcement learning-based workflow scheduling scheme that takes into account job dependencies, job priorities, and dynamic resource usage. The proposed approach can intelligently identify and utilize free windows of different resource queues. Our simulation results demonstrate that the proposed DRL-based workflow scheduling scheme can significantly reduce the average job latency compared to existing approaches.

源语言英语
主期刊名GLOBECOM 2023 - 2023 IEEE Global Communications Conference
出版商Institute of Electrical and Electronics Engineers Inc.
473-478
页数6
ISBN(电子版)9798350310900
DOI
出版状态已出版 - 2023
活动2023 IEEE Global Communications Conference, GLOBECOM 2023 - Kuala Lumpur, 马来西亚
期限: 4 12月 20238 12月 2023

出版系列

姓名Proceedings - IEEE Global Communications Conference, GLOBECOM
ISSN(印刷版)2334-0983
ISSN(电子版)2576-6813

会议

会议2023 IEEE Global Communications Conference, GLOBECOM 2023
国家/地区马来西亚
Kuala Lumpur
时期4/12/238/12/23

指纹

探究 'Towards Efficient Workflow Scheduling Over Yarn Cluster Using Deep Reinforcement Learning' 的科研主题。它们共同构成独一无二的指纹。

引用此