A framework for OLAP in column-store database: One-pass join and pushing the materialization to the end

  • Yuean Zhu*
  • , Yansong Zhang
  • , Xuan Zhou
  • , Shan Wang
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

In data warehouse modeled with the star schema, data are usually retrieved by performing a join operation between the fact table and dimension table(s) followed by a selection and project operation, while join operator is the most expensive operator in RDBMS. In column-store database, there are two ways to do join. The first way is early materialization join (EM join); the other way is late materialization join (LM join). In EM join, the columns involved in the query are glued together firstly, then the glued rows are sent to join operator. Whereas, in LM join, only the attributes participated in the join operator are accessed. The problem that access to inner table is out-of-order can't be ignored for LM join. Otherwise, the naïve LM join is usually slower than EM join [9]. Since the late materialization is good for memory bandwidth and CPU efficiency, the LM join attracts more attention in academic research community. The state-of-art LM joins in column-store such as radix-cluster hash join [8] in MonetDB, invisible join [10] in C-Store all try to avoid accessing table randomly. In this paper, we devised a framework for OLAP called CDDTA-MMDB where a new join algorithm called CDDTA-LWMJoin (we contract it to LWMJoin in the following) is introduced. The LWMJoin is on the basis of our prior work: CDDTA-Join [7]. We equip the CDDTA-Join with light-weight materialization (LWM) which is designed to cut down the memory access and reduce production of intermediate data structure. Experiments show that CDDTA-MMDB is efficient and can be 2x faster than MonetDB and 4x faster than invisible join in the context of data warehouse modeled with star schema.

Original languageEnglish
Title of host publicationWeb Technologies and Applications - 15th Asia-Pacific Web Conference, APWeb 2013, Proceedings
Pages646-653
Number of pages8
DOIs
StatePublished - 2013
Externally publishedYes
Event15th Asia-Pacific Web Conference on Web Technologies and Applications, APWeb 2013 - Sydney, NSW, Australia
Duration: 4 Apr 20136 Apr 2013

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume7808 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference15th Asia-Pacific Web Conference on Web Technologies and Applications, APWeb 2013
Country/TerritoryAustralia
CitySydney, NSW
Period4/04/136/04/13

Keywords

  • OLAP
  • in-memory column-store database
  • join
  • materialization

Fingerprint

Dive into the research topics of 'A framework for OLAP in column-store database: One-pass join and pushing the materialization to the end'. Together they form a unique fingerprint.

Cite this