TY - GEN
T1 - Recording how-provenance on probabilistic databases
AU - Gao, Ming
AU - He, Xiangnan
AU - Jin, Cheqing
AU - Wang, Xiaoling
AU - Zhou, Aoying
PY - 2010
Y1 - 2010
N2 - Tracking data provenance (or lineage) has become increasingly important in many large-scale applications, and a few methods have been proposed to record data provenance recently. However, most of previous works mainly focus on deterministic databases except Trio style lineage that aims at probabilistic databases, which is much more challenging because of the exponential growth of possible world instances and dependence among intermediate tuples. This paper proposes an approach, named PHP-tree, to model how-provenance upon probabilistic databases. we also show how to evaluate probability based on a PHP-tree. Compared with Trio style lineage, our approach is independent of intermediate results and can calculate the probability both cases of restricted and complete propagation of data provenance. Detailed experimental results show the effectiveness, efficiency and scalability of our proposed model.
AB - Tracking data provenance (or lineage) has become increasingly important in many large-scale applications, and a few methods have been proposed to record data provenance recently. However, most of previous works mainly focus on deterministic databases except Trio style lineage that aims at probabilistic databases, which is much more challenging because of the exponential growth of possible world instances and dependence among intermediate tuples. This paper proposes an approach, named PHP-tree, to model how-provenance upon probabilistic databases. we also show how to evaluate probability based on a PHP-tree. Compared with Trio style lineage, our approach is independent of intermediate results and can calculate the probability both cases of restricted and complete propagation of data provenance. Detailed experimental results show the effectiveness, efficiency and scalability of our proposed model.
UR - https://www.scopus.com/pages/publications/77954300673
U2 - 10.1109/APWeb.2010.19
DO - 10.1109/APWeb.2010.19
M3 - 会议稿件
AN - SCOPUS:77954300673
SN - 9780769540122
T3 - Advances in Web Technologies and Applications - Proceedings of the 12th Asia-Pacific Web Conference, APWeb 2010
SP - 205
EP - 211
BT - Advances in Web Technologies and Applications - Proceedings of the 12th Asia-Pacific Web Conference, APWeb 2010
T2 - 12th International Asia Pacific Web Conference, APWeb 2010
Y2 - 6 April 2010 through 8 April 2010
ER -