Content-based search using term aggregation and classification over hybrid peer-to-peer systems

Aoying Zhou*, Rong Zhang, Quang Hieu Vu, Weining Qian

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In this paper, we propose a method to support content-based search, one of the challenges in Peer-to-Peer file sharing system. The proposed method is based on a hybrid structure, which is a combination of a Chord ring and a balanced tree. The tree is used to aggregate and classify terms while the Chord ring is used to index terms of nodes in the tree. At every node in the tree, the system classifies terms as either important or unimportant. Important terms of a node, which can distinguish the node from its neighbor nodes, are indexed in the Chord ring while unimportant terms that are either popular or rare terms are aggregated to higher level nodes. Based on the classification, the system can process queries on the fly without the need of global knowledge. Therefore, our system can avoid the problem of bottleneck at nodes keeping global knowledge and the expensive cost of synchronization global knowledge among these nodes. We have done extensive experiments to validate the effectiveness and efficiency of our proposal.

Original languageEnglish
Title of host publicationProceedings - 2007 IFIP International Conference on Network and Parallel Computing Workshops, NPC 2007
Pages28-35
Number of pages8
DOIs
StatePublished - 2007
Event2007 IFIP International Conference on Network and Parallel Computing Workshops, NPC 2007 - Dalian, China
Duration: 18 Sep 200721 Sep 2007

Publication series

NameProceedings - 2007 IFIP International Conference on Network and Parallel Computing Workshops, NPC 2007

Conference

Conference2007 IFIP International Conference on Network and Parallel Computing Workshops, NPC 2007
Country/TerritoryChina
CityDalian
Period18/09/0721/09/07

Fingerprint

Dive into the research topics of 'Content-based search using term aggregation and classification over hybrid peer-to-peer systems'. Together they form a unique fingerprint.

Cite this