Boosting mapreduce with network-aware task assignment

  • Fei Xu
  • , Fangming Liu*
  • , Dekang Zhu
  • , Hai Jin
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

5 Scopus citations

Abstract

Running MapReduce in a shared cluster has become a recent trend to process large-scale data analytical applications while improving the cluster utilization. However, the network sharing among various applications can make the network bandwidth for MapReduce applications constrained and heterogeneous. This further increases the severity of network hotspots in racks, and makes existing task assignment policies which focus on the data locality no longer effective. To deal with this issue, this paper develops a model to analyze the relationship between job completion time and the assignment of both map and reduce tasks across racks. We further design a network-aware task assignment strategy to shorten the completion time of MapReduce jobs in shared clusters. It integrates two simple yet effective greedy heuristics that minimize the completion time of map phase and reduce phase, respectively. With large-scale simulations driven by Facebook job traces, we demonstrate that the network-aware strategy can shorten the average completion time of MapReduce jobs, as compared to the state-of-the-art task assignment strategies, yet with an acceptable computational overhead.

Original languageEnglish
Title of host publicationCloud Computing - 4th International Conference, CloudComp 2013, Revised Selected Papers
EditorsMin Chen, Victor C.M. Leung
PublisherSpringer Verlag
Pages79-89
Number of pages11
ISBN (Print)9783319055053
DOIs
StatePublished - 2014
Externally publishedYes
Event4th International Conference on Cloud Computing, CloudComp 2013 - Wuhan, China
Duration: 17 Oct 201319 Oct 2013

Publication series

NameLecture Notes of the Institute for Computer Sciences, Social-Informatics and Telecommunications Engineering, LNICST
Volume133
ISSN (Print)1867-8211

Conference

Conference4th International Conference on Cloud Computing, CloudComp 2013
Country/TerritoryChina
CityWuhan
Period17/10/1319/10/13

Keywords

  • MapReduce
  • Network hotspots
  • Task assignment

Fingerprint

Dive into the research topics of 'Boosting mapreduce with network-aware task assignment'. Together they form a unique fingerprint.

Cite this