Skip to main navigation Skip to search Skip to main content

ISpot: Achieving predictable performance for big data analytics with cloud transient servers

  • Fei Xu
  • , Huan Jiang
  • , Haoyue Zheng
  • , Wujie Shao
  • East China Normal University

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Achieving predictable performance for big data analytics running on cloud transient servers (e.g., EC2 spot instances) is challenging, because the transient server can be revoked by the cloud and the spot price is nontrivial to predict. Undoubtedly, choosing the low-price yet unstable cloud resources can severely degrade the job performance. To tackle this issue, this paper proposes iSpot, a cost-efficient spot instance provisioning framework in the cloud, by focusing on Spark as a representative DAG (Directed Acyclic Graph)-style big analytics workload. Specifically, it identifies the availability zones with stable spot instance resources by devising an accurate LSTM (Long Short-Term Memory)-based price prediction method. iSpot further predicts the performance of Spark stages and jobs by designing a fined-grained performance model using the job profiling and the DAG information of stages. Based on the price prediction and Spark performance model, iSpot is able to provision the spot instances with the cost-efficient instance type (i.e., the instance type that achieves the minimum monetary cost), in order to deliver predictable performance for big data analytics. Extensive prototype experiments on Amazon EC2 demonstrate that iSpot can guarantee the performance of big data analytics while reducing the job budget with cloud transient servers.

Original languageEnglish
Title of host publicationProceedings - 15th IEEE International Symposium on Parallel and Distributed Processing with Applications and 16th IEEE International Conference on Ubiquitous Computing and Communications, ISPA/IUCC 2017
EditorsGregorio Martinez, Richard Hill, Geoffrey Fox, Peter Mueller, Guojun Wang
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages314-321
Number of pages8
ISBN (Electronic)9781538637906
DOIs
StatePublished - 25 May 2018
Event15th IEEE International Symposium on Parallel and Distributed Processing with Applications and 16th IEEE International Conference on Ubiquitous Computing and Communications, ISPA/IUCC 2017 - Guangzhou, China
Duration: 12 Dec 201715 Dec 2017

Publication series

NameProceedings - 15th IEEE International Symposium on Parallel and Distributed Processing with Applications and 16th IEEE International Conference on Ubiquitous Computing and Communications, ISPA/IUCC 2017

Conference

Conference15th IEEE International Symposium on Parallel and Distributed Processing with Applications and 16th IEEE International Conference on Ubiquitous Computing and Communications, ISPA/IUCC 2017
Country/TerritoryChina
CityGuangzhou
Period12/12/1715/12/17

Keywords

  • Big data analytics
  • Cloud computing
  • Cloud transient servers
  • Predictable performance
  • Spot instance provisioning

Fingerprint

Dive into the research topics of 'ISpot: Achieving predictable performance for big data analytics with cloud transient servers'. Together they form a unique fingerprint.

Cite this