Effective data density estimation in ring-based P2P networks

Minqi Zhou, Heng Tao Shen, Xiaofang Zhou, Weining Qian, Aoying Zhou

Research output: Contribution to journalConference articlepeer-review

4 Scopus citations

Abstract

Estimating the global data distribution in Peer-to-Peer (P2P) networks is an important issue and has yet to be well addressed. It can benefit many P2P applications, such as load balancing analysis, query processing, and data mining. Inspired by the inversion method for random variate generation, in this paper we present a novel model named distribution-free data density estimation for dynamic ring-based P2P networks to achieve high estimation accuracy with low estimation cost regardless of distribution models of the underlying data. It generates random samples for any arbitrary distribution by sampling the global cumulative distribution function and is free from sampling bias. In P2P networks, the key idea for distribution-free estimation is to sample a small subset of peers for estimating the global data distribution over the data domain. Algorithms on computing and sampling the global cumulative distribution function based on which global data distribution is estimated are introduced with detailed theoretical analysis. Our extensive performance study confirms the effectiveness and efficiency of our methods in ring-based P2P networks.

Original languageEnglish
Article number6228117
Pages (from-to)594-605
Number of pages12
JournalProceedings - International Conference on Data Engineering
DOIs
StatePublished - 2012
EventIEEE 28th International Conference on Data Engineering, ICDE 2012 - Arlington, VA, United States
Duration: 1 Apr 20125 Apr 2012

Fingerprint

Dive into the research topics of 'Effective data density estimation in ring-based P2P networks'. Together they form a unique fingerprint.

Cite this