跳到主要导航 跳到搜索 跳到主要内容

A parallel data generator for efficiently generating “realistic” social streams

  • Shanghai Second Polytechnic University
  • East China Normal University

科研成果: 期刊稿件文章同行评审

摘要

A social stream refers to the data stream that records a series of social entities and the dynamic interactions between two entities. It can be employed to model the changes of entity states in numerous applications. The social streams, the combination of graph and streaming data, pose great challenge to efficient analytical query processing, and are key to better understanding users’ behavior. Considering of privacy and other related issues, a social stream generator is of great significance. A framework of synthetic social stream generator (SSG) is proposed in this paper. The generated social streams using SSG can be tuned to capture several kinds of fundamental social stream properties, including patterns about users’ behavior and graph patterns. Extensive empirical studies with several real-life social stream data sets show that SSG can produce data that better fit to real data. It is also confirmed that SSG can generate social stream data continuously with stable throughput and memory consumption. Furthermore, we propose a parallel implementation of SSG with the help of asynchronized parallel processing model and delayed update strategy. Our experiments verify that the throughput of the parallel implementation can increase linearly by increasing nodes.

源语言英语
页(从-至)1072-1101
页数30
期刊Frontiers of Computer Science
13
5
DOI
出版状态已出版 - 1 10月 2019

指纹

探究 'A parallel data generator for efficiently generating “realistic” social streams' 的科研主题。它们共同构成独一无二的指纹。

引用此