Agile auto scaling for supporting large scale cloud service platform

  • Yong Yang*
  • , Xinkui Zhao
  • , Xingjian Lu
  • , Jianwei Yin
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

As most of the auto scaling algorithms have drawbacks such as latency, coarse-grained and high overhead, inspired by cache ideology, suspended virtual machines were introduced to speed the provision speed. An agile auto scaling algorithm was designed based on auto-regressive and moving average(ARMA) with two level prediction to achieve fine-grained resource allocation, and strategies such as quantile statistic, allocation of extra resources, delayed release of resources were adopted to further satisfy the quality of service (QoS). Experimental results show that auto scaling can further save cloud resources, while improving the service quality with the workload of NETEASE Cloud Reader.

Original languageEnglish
Pages (from-to)63-67+99
JournalHuazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition)
Volume41
Issue numberSUPPL.2
StatePublished - Dec 2013
Externally publishedYes

Keywords

  • Auto scaling
  • Auto-regressive and moving average (ARMA)
  • Cloud computing
  • Quality of service (QoS)
  • Two level prediction
  • Workload prediction

Fingerprint

Dive into the research topics of 'Agile auto scaling for supporting large scale cloud service platform'. Together they form a unique fingerprint.

Cite this