An adaptive auto-configuration tool for hadoop

Changlong Li, Hang Zhuang, Kun Lu, Mingming Sun, Jinhong Zhou, Dong Dai, Xuehai Zhou

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

18 Scopus citations

Abstract

With the coming concept of 'big data', the ability to handle large datasets has become a critical consideration for the success of industrial organizations such as Google, Amazon, Yahoo! and Facebook. As an important Cloud Computing framework for bulk data processing, Hadoop is widely used in these organizations. However, the performance of MapReduce is seriously limited by its stiff configuration strategy. Even for a single simple job in Hadoop, a large number of tuning parameters have to be set by users. This may easily lead to performance loss due to some misconfigurations. In this paper, we present an adaptive automatic configuration tool (AACT) for Hadoop to achieve performance optimization. To achieve this goal, we propose a mathematical model which will accurately learn the relationship between system performance and configuration parameters, then configure Hadoop system based on this mathematical model. With the help of AACT, Hadoop is able to adapt the hardware and software configurations dynamically and drive the system to an optimal configuration in acceptable time. Experimental results show its efficiency and adaptability, and that it is ten times faster compared with default configuration.

Original languageEnglish
Title of host publication2014 19th International Conference on Engineering of Complex Computer Systems, ICECCS 2014
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages69-72
Number of pages4
ISBN (Electronic)9781479954827
DOIs
StatePublished - 13 Oct 2014
Externally publishedYes
Event19th International Conference on Engineering of Complex Computer Systems, ICECCS 2014 - Tianjin, China
Duration: 4 Aug 20147 Aug 2014

Publication series

NameProceedings of the IEEE International Conference on Engineering of Complex Computer Systems, ICECCS
ISSN (Print)2770-8527
ISSN (Electronic)2770-8535

Conference

Conference19th International Conference on Engineering of Complex Computer Systems, ICECCS 2014
Country/TerritoryChina
CityTianjin
Period4/08/147/08/14

Keywords

  • Auto-Configuration
  • Hadoop
  • Self-Learning

Fingerprint

Dive into the research topics of 'An adaptive auto-configuration tool for hadoop'. Together they form a unique fingerprint.

Cite this