Optimizing window aggregate functions via random sampling

  • Guangxuan Song
  • , Wenwen Qu
  • , Yilin Wang
  • , Xiaoling Wang*
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

Window functions have been a part of the SQL standard since 2003 and have been well studied during the past decade. As the demand increases in analytics tools, window functions have seen an increasing amount of potential applications. Although the current mainstream commercial databases support window functions, the existing implementation strategies are inefficient for the real-time processing of big data. Recently, some algorithms based on sampling (e.g., online aggregation) have been proposed to deal with large and complex data in relational databases, which offer us a flexible tradeoff between accuracy and efficiency. However, sampling techniques have not been considered for window functions in databases. In this paper, we first propose two algorithms to deal with window functions based on two sampling techniques, Naive Random Sampling and Incremental Random Sampling. The proposed algorithms are highly efficient and are general enough to aggregate other existing algorithms of window functions. In particular, we evaluated our algorithms in the latest version of PostgreSQL, which demonstrated superior performance over the TPC-H benchmark.

Original languageEnglish
Title of host publicationWeb and Big Data - 1st International Joint Conference, APWeb-WAIM 2017, Proceedings
EditorsChristian S. Jensen, Xiang Lian, Lei Chen, Cyrus Shahabi, Xiaochun Yang
PublisherSpringer Verlag
Pages229-244
Number of pages16
ISBN (Print)9783319635637
DOIs
StatePublished - 2017
Event1st Asia-Pacific Web and Web-Age Information Management Joint Conference on Web and Big Data, APWeb-WAIM 2017 - Beijing, China
Duration: 7 Jul 20179 Jul 2017

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume10367 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference1st Asia-Pacific Web and Web-Age Information Management Joint Conference on Web and Big Data, APWeb-WAIM 2017
Country/TerritoryChina
CityBeijing
Period7/07/179/07/17

Keywords

  • Query optimization
  • Sample
  • Window function

Fingerprint

Dive into the research topics of 'Optimizing window aggregate functions via random sampling'. Together they form a unique fingerprint.

Cite this