NFAQP: Normalizing Flow Based Approximate Query Processing

  • Libin Cen
  • , Jingdong Li
  • , Wenjing Yue
  • , Xiaoling Wang*
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

With the unprecedented rate at which data is being generated, Approximate Query Processing (AQP) techniques are widely demanded in various areas. Recently, machine learning techniques have made remarkable progress in this field. However, data with large domain sizes still cannot be handled efficiently by existing approach. Besides, the accuracy of the estimate is easily affected by the number of predicates, which may lead to erroneous decisions for users in complex scenarios. In this paper, we propose NFAQP, a novel AQP approach that leverages normalizing flow to efficiently model the data distribution and estimate the aggregation function by multidimensional Monte Carlo integration. Our model is highly lightweight - often just a few dozen of KB - and is unaffected by large domains. More importantly, even under queries with a large number of predicates, NFAQP still achieves relatively low approximation errors. Extensive experiments conducted on three real-world datasets demonstrate that NFAQP outperforms baseline approaches in terms of accuracy and model size, while maintaining relatively low latency.

Original languageEnglish
Title of host publicationAdvanced Data Mining and Applications - 19th International Conference, ADMA 2023, Proceedings
EditorsXiaochun Yang, Bin Wang, Heru Suhartanto, Guoren Wang, Jing Jiang, Bing Li, Huaijie Zhu, Ningning Cui
PublisherSpringer Science and Business Media Deutschland GmbH
Pages45-60
Number of pages16
ISBN (Print)9783031466762
DOIs
StatePublished - 2023
Event19th International Conference on Advanced Data Mining and Applications, ADMA 2023 - Shenyang, China
Duration: 21 Aug 202323 Aug 2023

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume14180 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference19th International Conference on Advanced Data Mining and Applications, ADMA 2023
Country/TerritoryChina
CityShenyang
Period21/08/2323/08/23

Keywords

  • Aggregate function
  • Approximate Query Processing
  • Normalizing Flow

Fingerprint

Dive into the research topics of 'NFAQP: Normalizing Flow Based Approximate Query Processing'. Together they form a unique fingerprint.

Cite this