TY - GEN
T1 - Efficient structural query evaluation over social data
AU - Wang, Chaoyong
AU - Gong, Xueqing
AU - Wang, Xiaoling
PY - 2012
Y1 - 2012
N2 - With the rapid increase of social media, more and more users generate data on social application platforms, such as facebook, twitter and Sina Weibo(weibo.com). Current platforms, however, only provides keyword-based search function on social data, which is far from enough to satisfy users' query requirement in the view point of both structure and content aspects. The traditional structural join algorithms, which obtain results by matching both structure and content, do not work very well for social data. The challenges include (1) the size of social data is huge, (2) the online social applications require real time response. It is necessary to study the structural query on social data in order to meet the above requirements. This paper proposes the Post Dewey, a new numbering schema which is the structural summation of an element tag to reduce search space. A novel structural join algorithm, Post Structure Join (PSJ), was presented to address the limitation of the stack based algorithms, as a supplement strategy for structural joins. PSJ improves the overall performance by reducing the input size at the cost of losing some join efficiency. The approach is validated on real dataset crawled and extracted from Sina Weibo. The experimental results demonstrate the effectiveness of PSJ by comparing with the state-of-the-art structural join algorithms.
AB - With the rapid increase of social media, more and more users generate data on social application platforms, such as facebook, twitter and Sina Weibo(weibo.com). Current platforms, however, only provides keyword-based search function on social data, which is far from enough to satisfy users' query requirement in the view point of both structure and content aspects. The traditional structural join algorithms, which obtain results by matching both structure and content, do not work very well for social data. The challenges include (1) the size of social data is huge, (2) the online social applications require real time response. It is necessary to study the structural query on social data in order to meet the above requirements. This paper proposes the Post Dewey, a new numbering schema which is the structural summation of an element tag to reduce search space. A novel structural join algorithm, Post Structure Join (PSJ), was presented to address the limitation of the stack based algorithms, as a supplement strategy for structural joins. PSJ improves the overall performance by reducing the input size at the cost of losing some join efficiency. The approach is validated on real dataset crawled and extracted from Sina Weibo. The experimental results demonstrate the effectiveness of PSJ by comparing with the state-of-the-art structural join algorithms.
KW - Post Structure Join
KW - Social Network
KW - XML Data Management
UR - https://www.scopus.com/pages/publications/84874644251
U2 - 10.1109/CGC.2012.102
DO - 10.1109/CGC.2012.102
M3 - 会议稿件
AN - SCOPUS:84874644251
SN - 9780769548647
T3 - Proceedings - 2nd International Conference on Cloud and Green Computing and 2nd International Conference on Social Computing and Its Applications, CGC/SCA 2012
SP - 344
EP - 351
BT - Proceedings - 2nd International Conference on Cloud and Green Computing and 2nd International Conference on Social Computing and Its Applications, CGC/SCA 2012
T2 - 2nd International Conference on Cloud and Green Computing, CGC 2012, Held Jointly with the 2nd International Conference on Social Computing and Its Applications, SCA 2012
Y2 - 1 November 2012 through 3 November 2012
ER -