跳到主要导航 跳到搜索 跳到主要内容

ISpot: Achieving predictable performance for big data analytics with cloud transient servers

  • Fei Xu
  • , Huan Jiang
  • , Haoyue Zheng
  • , Wujie Shao
  • East China Normal University

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Achieving predictable performance for big data analytics running on cloud transient servers (e.g., EC2 spot instances) is challenging, because the transient server can be revoked by the cloud and the spot price is nontrivial to predict. Undoubtedly, choosing the low-price yet unstable cloud resources can severely degrade the job performance. To tackle this issue, this paper proposes iSpot, a cost-efficient spot instance provisioning framework in the cloud, by focusing on Spark as a representative DAG (Directed Acyclic Graph)-style big analytics workload. Specifically, it identifies the availability zones with stable spot instance resources by devising an accurate LSTM (Long Short-Term Memory)-based price prediction method. iSpot further predicts the performance of Spark stages and jobs by designing a fined-grained performance model using the job profiling and the DAG information of stages. Based on the price prediction and Spark performance model, iSpot is able to provision the spot instances with the cost-efficient instance type (i.e., the instance type that achieves the minimum monetary cost), in order to deliver predictable performance for big data analytics. Extensive prototype experiments on Amazon EC2 demonstrate that iSpot can guarantee the performance of big data analytics while reducing the job budget with cloud transient servers.

源语言英语
主期刊名Proceedings - 15th IEEE International Symposium on Parallel and Distributed Processing with Applications and 16th IEEE International Conference on Ubiquitous Computing and Communications, ISPA/IUCC 2017
编辑Gregorio Martinez, Richard Hill, Geoffrey Fox, Peter Mueller, Guojun Wang
出版商Institute of Electrical and Electronics Engineers Inc.
314-321
页数8
ISBN(电子版)9781538637906
DOI
出版状态已出版 - 25 5月 2018
活动15th IEEE International Symposium on Parallel and Distributed Processing with Applications and 16th IEEE International Conference on Ubiquitous Computing and Communications, ISPA/IUCC 2017 - Guangzhou, 中国
期限: 12 12月 201715 12月 2017

出版系列

姓名Proceedings - 15th IEEE International Symposium on Parallel and Distributed Processing with Applications and 16th IEEE International Conference on Ubiquitous Computing and Communications, ISPA/IUCC 2017

会议

会议15th IEEE International Symposium on Parallel and Distributed Processing with Applications and 16th IEEE International Conference on Ubiquitous Computing and Communications, ISPA/IUCC 2017
国家/地区中国
Guangzhou
时期12/12/1715/12/17

指纹

探究 'ISpot: Achieving predictable performance for big data analytics with cloud transient servers' 的科研主题。它们共同构成独一无二的指纹。

引用此