跳到主要导航 跳到搜索 跳到主要内容

Sampling connected induced subgraphs uniformly at random

  • Xuesong Lu*
  • , Stéphane Bressan
  • *此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

A recurrent challenge for modern applications is the processing of large graphs. The ability to generate representative samples of smaller size is useful not only to circumvent scalability issues but also, per se, for statistical analysis and other data mining tasks. For such purposes adequate sampling techniques must be devised. We are interested, in this paper, in the uniform random sampling of a connected subgraph from a graph. We require that the sample contains a prescribed number of vertices. The sampled graph is the corresponding induced graph. We devise, present and discuss several algorithms that leverage three different techniques: Rejection Sampling, Random Walk and Markov Chain Monte Carlo. We empirically evaluate and compare the performance of the algorithms. We show that they are effective and efficient but that there is a trade-off, which depends on the density of the graphs and the sample size. We propose one novel algorithm, which we call Neighbour Reservoir Sampling (NRS), that very successfully realizes the trade-off between effectiveness and efficiency.

源语言英语
主期刊名Scientific and Statistical Database Management - 24th International Conference, SSDBM 2012, Proceedings
出版商Springer Verlag
195-212
页数18
ISBN(印刷版)9783642312342
DOI
出版状态已出版 - 2012
已对外发布
活动24th International Conference on Scientific and Statistical DatabaseManagement, SSDBM 2012 - Chania, Crete, 希腊
期限: 25 6月 201227 6月 2012

出版系列

姓名Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
7338 LNCS
ISSN(印刷版)0302-9743
ISSN(电子版)1611-3349

会议

会议24th International Conference on Scientific and Statistical DatabaseManagement, SSDBM 2012
国家/地区希腊
Chania, Crete
时期25/06/1227/06/12

指纹

探究 'Sampling connected induced subgraphs uniformly at random' 的科研主题。它们共同构成独一无二的指纹。

引用此