跳到主要导航 跳到搜索 跳到主要内容

One-size-fits-all OLAP technique for big data analysis

  • Yan Song Zhang*
  • , Min Jiao
  • , Zhan Wei Wang
  • , Shan Wang
  • , Xuan Zhou
  • *此作品的通讯作者
  • Renmin University of China
  • School of Information

科研成果: 期刊稿件文章同行评审

摘要

The traditional OLAP is pushed into large scale analysis era by rapidly expending big data volume. The major features are high storage density, heavy workload, large scale storage and processing capacity. Both traditional parallel database and the hot topic MapReduce technique have to face the critical issues of performance and parallel processing efficiency of big data analytical processing in large scale parallel processing framework. The performance of star schema based OLAP with star-join is limited by processing complexity and network transmission cost in parallel processing. This paper makes a deep analysis of features of storage model and workload of OLAP, proposes the optimization mechanisms and implementation technologies for the most fundamental SPJGA-OLAP subset in storage, processing, distribution, network transmission, and distributed buffering. The technical feasibility is evaluated with the commonly accepted TPC-H industrial benchmark and SSB academic benchmark. This paper proposes the predicate-vector DDTA-JOIN centric parallel OLAP framework, replacing the diverse join execution plans with normalized predicate-vector processing, and enables one-size-fits-all OLAP model for both central processing and large scale parallel processing by making advantage of nowadays hardware, minimizing network transmission cost and processing cost. The analysis of the storage cost and network transmission cost for distribution mechanism with datasets of 1TB and 100TB is given. The technical feasibility and parallel processing efficiency are verified by OLAP cost model analysis and real data experiments.

源语言英语
页(从-至)1936-1946
页数11
期刊Jisuanji Xuebao/Chinese Journal of Computers
34
10
DOI
出版状态已出版 - 10月 2011
已对外发布

指纹

探究 'One-size-fits-all OLAP technique for big data analysis' 的科研主题。它们共同构成独一无二的指纹。

引用此