Apara: Workload-aware data partition and replication for parallel databases

Xiaolei Zhang, Chunxi Zhang, Yuming Li, Rong Zhang, Aoying Zhou

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

Data partition and replication mechanisms directly determine query execution patterns in parallel database systems, which have a great impact on system performance. Recently, there have been some workload-aware data storage techniques, but they suffer from problems of narrow support to complex workloads or large requirements for storage. In order to enable the support for complex analytical workloads over massive distributed database systems, we design and implement a workload-aware data partition and replication tool, called Apara. We design two heuristic algorithms and define two cost models for effective data partition calculation and efficient replication usages. We run a set of experiments to compare and demonstrate the performance between Apara and the other representative work. The results show that Apara consistently outperforms the primary solutions on TPC-H workloads.

Original languageEnglish
Title of host publicationWeb and Big Data - 3rd International Joint Conference, APWeb-WAIM 2019, Proceedings
EditorsJie Shao, Man Lung Yiu, Masashi Toyoda, Dongxiang Zhang, Wei Wang, Bin Cui
PublisherSpringer Verlag
Pages191-206
Number of pages16
ISBN (Print)9783030260743
DOIs
StatePublished - 2019
Event3rd APWeb and WAIM Joint Conference on Web and Big Data, APWeb-WAIM 2019 - Chengdu, China
Duration: 1 Aug 20193 Aug 2019

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume11642 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference3rd APWeb and WAIM Joint Conference on Web and Big Data, APWeb-WAIM 2019
Country/TerritoryChina
CityChengdu
Period1/08/193/08/19

Keywords

  • Distributed database
  • Partition
  • Replication
  • Workload-aware storage

Fingerprint

Dive into the research topics of 'Apara: Workload-aware data partition and replication for parallel databases'. Together they form a unique fingerprint.

Cite this