UD(k,l)-index: An efficient approximate index for XML data

Hongwei Wu, Qing Wang, Jeffrey Xu Yu, Aoying Zhou, Shuigeng Zhou

Research output: Chapter in Book/Report/Conference proceedingChapterpeer-review

7 Scopus citations

Abstract

XML has become the main standard of data presentation and exchange on the Internet. Processing path expressions plays a key role in XML queries evaluation. Path indices can speed up path expressions evaluation on XML data by restricting search only to the relevant portion. However, to answer all path expressions accurately, traditional path indices group data nodes according to the paths from the root of the data graph to the nodes in question, regardless of the paths fanning out from these nodes. This leads to large indices size and low efficiency of branching path expressions evaluation. In this paper, we present UD(k,l)-indices, a family of efficient approximate index structures in which data nodes are grouped according to their incoming paths of length up to fc and outgoing paths of length up to l. UD(k,l)-indices fully exploit local similarity of XML data nodes on their upward and downward paths, so can be used for efficiently evaluating path expressions, especially branching path expressions. For small values of k and l, UD(k, l)-index is approximate, we use validation-based approach to find exact answers to the path expressions. Experiments show that with proper values of k and l, UD(k, l)-index can improve the performance of path expressions evaluation significantly with low space overhead.

Original languageEnglish
Title of host publicationLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
EditorsGuozhu Dong, Tang Changjie, Wei Wang
PublisherSpringer Verlag
Pages68-79
Number of pages12
ISBN (Electronic)9783540407157
DOIs
StatePublished - 2003
Externally publishedYes

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume2762
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Fingerprint

Dive into the research topics of 'UD(k,l)-index: An efficient approximate index for XML data'. Together they form a unique fingerprint.

Cite this