TY - GEN
T1 - Efficient evaluation of distance predicates in Xpath full-text query
AU - Chen, Hong
AU - Wang, Xiaoling
AU - Zhou, Aoying
PY - 2006
Y1 - 2006
N2 - In recent years, more and more XML repositories are emerging, e.g., XML digital library, SIGMOD and DBLP document collections. Since XML is good at representing both structured and unstructured data, to facilitate the usage of this kind of information, it is necessary to support structure-based and content-based (full-text) queries/retrievals over XML repositories. With existing XPath/XQuery Pull-Text, user could do search based on cardinality, proximity or distance predicates. In this paper, we propose an efficient approach for the Information Retrieval (IR) style search, especially distance predicates search, on XML documents. Numbering technique is employed to encode XML documents, and then three algorithms are designed to evaluate queries with distance predicates. To improve the performance, some optimization techniques are introduced. Extensive experiments show the effectiveness and efficiency of the proposed approach.
AB - In recent years, more and more XML repositories are emerging, e.g., XML digital library, SIGMOD and DBLP document collections. Since XML is good at representing both structured and unstructured data, to facilitate the usage of this kind of information, it is necessary to support structure-based and content-based (full-text) queries/retrievals over XML repositories. With existing XPath/XQuery Pull-Text, user could do search based on cardinality, proximity or distance predicates. In this paper, we propose an efficient approach for the Information Retrieval (IR) style search, especially distance predicates search, on XML documents. Numbering technique is employed to encode XML documents, and then three algorithms are designed to evaluate queries with distance predicates. To improve the performance, some optimization techniques are introduced. Extensive experiments show the effectiveness and efficiency of the proposed approach.
UR - https://www.scopus.com/pages/publications/33745666833
U2 - 10.1007/11610496_9
DO - 10.1007/11610496_9
M3 - 会议稿件
AN - SCOPUS:33745666833
SN - 3540311580
SN - 9783540311584
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 76
EP - 85
BT - Advanced Web and Network Technologies, and Applications - APWeb 2006 International Workshops
PB - Springer Verlag
T2 - APWeb 2006 International Workshops: XRA, IWSN, MEGA, and ICSE
Y2 - 16 January 2006 through 18 January 2006
ER -