Recording how-provenance on probabilistic databases

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

Tracking data provenance (or lineage) has become increasingly important in many large-scale applications, and a few methods have been proposed to record data provenance recently. However, most of previous works mainly focus on deterministic databases except Trio style lineage that aims at probabilistic databases, which is much more challenging because of the exponential growth of possible world instances and dependence among intermediate tuples. This paper proposes an approach, named PHP-tree, to model how-provenance upon probabilistic databases. we also show how to evaluate probability based on a PHP-tree. Compared with Trio style lineage, our approach is independent of intermediate results and can calculate the probability both cases of restricted and complete propagation of data provenance. Detailed experimental results show the effectiveness, efficiency and scalability of our proposed model.

Original languageEnglish
Title of host publicationAdvances in Web Technologies and Applications - Proceedings of the 12th Asia-Pacific Web Conference, APWeb 2010
Pages205-211
Number of pages7
DOIs
StatePublished - 2010
Event12th International Asia Pacific Web Conference, APWeb 2010 - Busan, Korea, Republic of
Duration: 6 Apr 20108 Apr 2010

Publication series

NameAdvances in Web Technologies and Applications - Proceedings of the 12th Asia-Pacific Web Conference, APWeb 2010

Conference

Conference12th International Asia Pacific Web Conference, APWeb 2010
Country/TerritoryKorea, Republic of
CityBusan
Period6/04/108/04/10

Fingerprint

Dive into the research topics of 'Recording how-provenance on probabilistic databases'. Together they form a unique fingerprint.

Cite this