TY - CHAP
T1 - UD(k,l)-index
T2 - An efficient approximate index for XML data
AU - Wu, Hongwei
AU - Wang, Qing
AU - Yu, Jeffrey Xu
AU - Zhou, Aoying
AU - Zhou, Shuigeng
PY - 2003
Y1 - 2003
N2 - XML has become the main standard of data presentation and exchange on the Internet. Processing path expressions plays a key role in XML queries evaluation. Path indices can speed up path expressions evaluation on XML data by restricting search only to the relevant portion. However, to answer all path expressions accurately, traditional path indices group data nodes according to the paths from the root of the data graph to the nodes in question, regardless of the paths fanning out from these nodes. This leads to large indices size and low efficiency of branching path expressions evaluation. In this paper, we present UD(k,l)-indices, a family of efficient approximate index structures in which data nodes are grouped according to their incoming paths of length up to fc and outgoing paths of length up to l. UD(k,l)-indices fully exploit local similarity of XML data nodes on their upward and downward paths, so can be used for efficiently evaluating path expressions, especially branching path expressions. For small values of k and l, UD(k, l)-index is approximate, we use validation-based approach to find exact answers to the path expressions. Experiments show that with proper values of k and l, UD(k, l)-index can improve the performance of path expressions evaluation significantly with low space overhead.
AB - XML has become the main standard of data presentation and exchange on the Internet. Processing path expressions plays a key role in XML queries evaluation. Path indices can speed up path expressions evaluation on XML data by restricting search only to the relevant portion. However, to answer all path expressions accurately, traditional path indices group data nodes according to the paths from the root of the data graph to the nodes in question, regardless of the paths fanning out from these nodes. This leads to large indices size and low efficiency of branching path expressions evaluation. In this paper, we present UD(k,l)-indices, a family of efficient approximate index structures in which data nodes are grouped according to their incoming paths of length up to fc and outgoing paths of length up to l. UD(k,l)-indices fully exploit local similarity of XML data nodes on their upward and downward paths, so can be used for efficiently evaluating path expressions, especially branching path expressions. For small values of k and l, UD(k, l)-index is approximate, we use validation-based approach to find exact answers to the path expressions. Experiments show that with proper values of k and l, UD(k, l)-index can improve the performance of path expressions evaluation significantly with low space overhead.
UR - https://www.scopus.com/pages/publications/35248843693
U2 - 10.1007/978-3-540-45160-0_8
DO - 10.1007/978-3-540-45160-0_8
M3 - 章节
AN - SCOPUS:35248843693
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 68
EP - 79
BT - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
A2 - Dong, Guozhu
A2 - Changjie, Tang
A2 - Wang, Wei
PB - Springer Verlag
ER -