Sliding-window top-k queries on uncertain streams

Cheqing Jin, Ke Yi, Lei Chen, Jeffrey Xu Yu, Xuemin Lin

Research output: Contribution to journalArticlepeer-review

111 Scopus citations

Abstract

Query processing on uncertain data streams has attracted a lot of attentions lately, due to the imprecise nature in the data generated from a variety of streaming applications, such as readings from a sensor network. However, all of the existing works on uncertain data streams study unbounded streams. This paper takes the first step towards the important and challenging problem of answering sliding-window queries on uncertain data streams, with a focus on arguably one of the most important types of queries-top-k queries. The challenge of answering sliding-window top-k queries on uncertain data streams stems from the strict space and time requirements of processing both arriving and expiring tuples in high-speed streams, combined with the difficulty of coping with the exponential blowup in the number of possible worlds induced by the uncertain data model. In this paper, we design a unified framework for processing sliding-window top-k queries on uncertain streams. We show that all the existing top-k definitions in the literature can be plugged into our framework, resulting in several succinct synopses that use space much smaller than the window size, while are also highly efficient in terms of processing time. In addition to the theoretical space and time bounds that we prove for these synopses, we also present a thorough experimental report to verify their practical efficiency on both synthetic and real data.

Original languageEnglish
Pages (from-to)301-312
Number of pages12
JournalProceedings of the VLDB Endowment
Volume1
Issue number1
DOIs
StatePublished - 2008
Externally publishedYes

Fingerprint

Dive into the research topics of 'Sliding-window top-k queries on uncertain streams'. Together they form a unique fingerprint.

Cite this