Discovering the most influential sites over uncertain data: A rank-based approach

  • Kai Zheng*
  • , Zi Huang
  • , Aoying Zhou
  • , Xiaofang Zhou
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

18 Scopus citations

Abstract

With the rapidly increasing availability of uncertain data in many important applications such as location-based services, sensor monitoring, and biological information management systems, uncertainty-aware query processing has received a significant amount of research effort from the database community in recent years. In this paper, we investigate a new type of query in the context of uncertain databases, namely uncertain top-k influential sites query ({\rm UT}k{\rm IS} query for short), which can be applied in a wide range of application areas such as marketing analysis and mobile services. Since it is not so straightforward to precisely define the semantics of {\rm top}k query with uncertain data, in this paper we introduce a novel and more intuitive formulation of the query on the basis of expected rank semantics. To address the efficiency issue caused by possible worlds exploration, we propose effective pruning rules and a divide-and-conquer paradigm such that the number of candidates as well as the number of possible worlds to be considered can be significantly reduced. Finally, we conduct extensive experiments on real data sets to verify the effectiveness and efficiency of the new methods proposed in this paper.

Original languageEnglish
Article number5871623
Pages (from-to)2156-2169
Number of pages14
JournalIEEE Transactions on Knowledge and Data Engineering
Volume24
Issue number12
DOIs
StatePublished - 2012

Keywords

  • Uncertain data
  • reverse nearest neighbor query
  • top-k query

Fingerprint

Dive into the research topics of 'Discovering the most influential sites over uncertain data: A rank-based approach'. Together they form a unique fingerprint.

Cite this