TY - GEN
T1 - Bloom filter-based XML packets filtering for millions of path queries
AU - Gong, Xueqing
AU - Qian, Weining
AU - Yan, Ying
AU - Zhou, Aoying
PY - 2005
Y1 - 2005
N2 - The filtering of XML data is the basis of many complex applications. Lots of algorithms have been proposed to solve this problem[2, 3, 5, 6, 7, 8, 9, 11, 12, 13, 18]. One important challenge is that the number of path queries is huge. It is necessary to take an efficient data structure representing path queries. Another challenge is that these path queries usually vary with time. The maintenance of path queries determines the flexibility and capacity of a filtering system. In this paper, we introduce a novel approximate method for XML data filtering, which uses Bloom filters representing path queries. In this method, millions of path queries can be stored efficiently. At the same time, it is easy to deal with the change of these path queries. To improve the filtering performance, we introduce a new data structure, Prefix Filters, to decrease the number of candidate paths. Experiments show that our Bloom filter-based method takes less time to build routing table than automaton-based method. And our method has a good performance with acceptable false positive when filtering XML packets of relatively small depth with millions of path queries.
AB - The filtering of XML data is the basis of many complex applications. Lots of algorithms have been proposed to solve this problem[2, 3, 5, 6, 7, 8, 9, 11, 12, 13, 18]. One important challenge is that the number of path queries is huge. It is necessary to take an efficient data structure representing path queries. Another challenge is that these path queries usually vary with time. The maintenance of path queries determines the flexibility and capacity of a filtering system. In this paper, we introduce a novel approximate method for XML data filtering, which uses Bloom filters representing path queries. In this method, millions of path queries can be stored efficiently. At the same time, it is easy to deal with the change of these path queries. To improve the filtering performance, we introduce a new data structure, Prefix Filters, to decrease the number of candidate paths. Experiments show that our Bloom filter-based method takes less time to build routing table than automaton-based method. And our method has a good performance with acceptable false positive when filtering XML packets of relatively small depth with millions of path queries.
UR - https://www.scopus.com/pages/publications/28444484604
U2 - 10.1109/ICDE.2005.26
DO - 10.1109/ICDE.2005.26
M3 - 会议稿件
AN - SCOPUS:28444484604
SN - 0769522858
T3 - Proceedings - International Conference on Data Engineering
SP - 890
EP - 901
BT - Proceedings - 21st International Conference on Data Engineering, ICDE 2005
T2 - 21st International Conference on Data Engineering, ICDE 2005
Y2 - 5 April 2005 through 8 April 2005
ER -