DeepMRA: An Efficient Microservices Resource Allocation Framework with Deep Reinforcement Learning in the Cloud

  • Qi Si
  • , Jilin Shi
  • , Weiyi Li
  • , Xuesong Lu
  • , Peng Pu*
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

The rapid growth of cloud computing has precipitated a paradigm shift in application service deployment, transitioning predominantly from monolithic to microservices architectures. This shift to microservices brings new, complex challenges in managing cloud resources. Traditional cloud resource allocation methods struggle with microservices’ unique challenges, like complex inter-service dependencies and the need to balance Quality of Service (QoS) with cost efficiency. Recognizing these challenges in cloud resource management, our study proposes an innovative approach to dynamically allocate resources for cloud microservices with Deep Reinforcement Learning (DRL). Specifically, we introduce DeepMRA, an efficient microservices resource allocation framework with Deep Reinforcement Learning in the Cloud, with multiple agents navigating the complexities arising from varying workloads. We propose a performance predictor to forecast application performance, guiding the training of agents in DRL. Due to the shortcomings of traditional performance data collection methods in the context of microservices, we developed the Parallel and Asynchronous Uncertainty-Directed Sampling (PAUDS) algorithm. This algorithm is specifically designed to optimize data collection processes, ensuring a robust dataset for building a reliable performance predictor. Extensive experiments conducted with microservice-based applications indicate that the proposed method reduces resource consumption while upholding QoS requirements under varying workloads.

Original languageEnglish
Title of host publicationAdvanced Intelligent Computing Technology and Applications - 20th International Conference, ICIC 2024, Proceedings
EditorsDe-Shuang Huang, Yijie Pan, Xiankun Zhang
PublisherSpringer Science and Business Media Deutschland GmbH
Pages455-466
Number of pages12
ISBN (Print)9789819755806
DOIs
StatePublished - 2024
Event20th International Conference on Intelligent Computing, ICIC 2024 - Tianjin, China
Duration: 5 Aug 20248 Aug 2024

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume14863 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference20th International Conference on Intelligent Computing, ICIC 2024
Country/TerritoryChina
CityTianjin
Period5/08/248/08/24

Keywords

  • Cloud Computing
  • Deep Reinforcement Learning
  • Microservice
  • Resource Allocation

Fingerprint

Dive into the research topics of 'DeepMRA: An Efficient Microservices Resource Allocation Framework with Deep Reinforcement Learning in the Cloud'. Together they form a unique fingerprint.

Cite this