跳到主要导航 跳到搜索 跳到主要内容

Cleaning uncertain data for top-k queries

  • Luyi Mo
  • , Reynold Cheng
  • , Xiang Li
  • , David W. Cheung
  • , Xuan S. Yang
  • The University of Hong Kong

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

The information managed in emerging applications, such as sensor networks, location-based services, and data integration, is inherently imprecise. To handle data uncertainty, probabilistic databases have been recently developed. In this paper, we study how to quantify the ambiguity of answers returned by a probabilistic top-k query. We develop efficient algorithms to compute the quality of this query under the possible world semantics. We further address the cleaning of a probabilistic database, in order to improve top-k query quality. Cleaning involves the reduction of ambiguity associated with the database entities. For example, the uncertainty of a temperature value acquired from a sensor can be reduced, or cleaned, by requesting its newest value from the sensor. While this "cleaning operation" may produce a better query result, it may involve a cost and fail. We investigate the problem of selecting entities to be cleaned under a limited budget. Particularly, we propose an optimal solution and several heuristics. Experiments show that the greedy algorithm is efficient and close to optimal.

源语言英语
主期刊名ICDE 2013 - 29th International Conference on Data Engineering
134-145
页数12
DOI
出版状态已出版 - 2013
已对外发布
活动29th International Conference on Data Engineering, ICDE 2013 - Brisbane, QLD, 澳大利亚
期限: 8 4月 201311 4月 2013

出版系列

姓名Proceedings - International Conference on Data Engineering
ISSN(印刷版)1084-4627

会议

会议29th International Conference on Data Engineering, ICDE 2013
国家/地区澳大利亚
Brisbane, QLD
时期8/04/1311/04/13

指纹

探究 'Cleaning uncertain data for top-k queries' 的科研主题。它们共同构成独一无二的指纹。

引用此