跳到主要导航 跳到搜索 跳到主要内容

Tracking high quality clusters over uncertain data streams

  • Fudan University
  • Shanghai Key Laboratory of Trustworthy Computering

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Recently, data mining over uncertain data streams has attracted a lot of attentions because of the widely existed imprecise data generated from a variety of streaming applications. In this paper, we try to resolve the problem of clustering over uncertain data streams. Facing uncertain tuples with different probability distributions, the clustering algorithm should not only consider the tuple value but also emphasis on its uncertainty. To fulfill these dual purposes, a metric named tuple uncertainty will be integrated into the overall procedure of clustering. Firstly, we survey uncertain data model and propose our uncertainty measurement and corresponding properties. Secondly, based on such uncertainty quantification method, we provide a two phase stream clustering algorithm and elaborate implementation detail. Finally, performance experiments over a number of real and synthetic data sets demonstrate the effectiveness and efficiency of our method.

源语言英语
主期刊名Proceedings - 25th IEEE International Conference on Data Engineering, ICDE 2009
1641-1648
页数8
DOI
出版状态已出版 - 2009
活动25th IEEE International Conference on Data Engineering, ICDE 2009 - Shanghai, 中国
期限: 29 3月 20092 4月 2009

出版系列

姓名Proceedings - International Conference on Data Engineering
ISSN(印刷版)1084-4627

会议

会议25th IEEE International Conference on Data Engineering, ICDE 2009
国家/地区中国
Shanghai
时期29/03/092/04/09

指纹

探究 'Tracking high quality clusters over uncertain data streams' 的科研主题。它们共同构成独一无二的指纹。

引用此