Clustering algorithm over uncertain data streams

Research output: Contribution to journalArticlepeer-review

28 Scopus citations

Abstract

This paper proposes a novel algorithm, named EMicro, to cluster uncertain data streams. Although most of the works used today mainly use the distance metric to describe the cluster quality, EMicro considers distance metric and data uncertainty together to measure the clustering quality. Another contribution of this paper is the outlier processing mechanism. Two buffers are maintained to reserve normal micro-clusters and potential outlier micro-clusters, respectively, to obtain good performance. Experimental results show that EMicro outperforms existing methods in efficiency and effectiveness.

Original languageEnglish
Pages (from-to)2173-2182
Number of pages10
JournalRuan Jian Xue Bao/Journal of Software
Volume21
Issue number9
DOIs
StatePublished - Sep 2010

Keywords

  • Clustering
  • Outlier
  • Uncertain data stream

Fingerprint

Dive into the research topics of 'Clustering algorithm over uncertain data streams'. Together they form a unique fingerprint.

Cite this