FDBSCAN: A fast DBSCAN algorithm

Shuigeng Zhou*, Aoying Zhou, Wen Jin, Ye Fan, Weining Qian

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

50 Scopus citations

Abstract

Clustering is an important application area for many fields including data mining, statistical data analysis, pattern recognition, image processing, and other business applications, Up to now, many algorithms for clustering have been developed. Contributed from the database research community, DBSCAN algorithm is an outstanding representative of clustering algorithms for its good performance in clustering spatial data. Relying on a density-based notion of clusters, DBSCAN is designed to discover clusters of arbitrary shape. It requires only one input parameter and supports the user in determining an appropriate value of it. In this paper, a fast DBSCAN algorithm (FDBSCAN) is developed which considerably speeds up the original DBSCAN algorithm. Unlike DBSCAN, FDBSCAN uses only a small number of representative points in a core point's neighborhood as seeds to expand the cluster such that the execution frequency of region query and consequently the I/O cost are reduced. Experimental results show that FDBSCAN is effective and efficient in clustering large-scale databases, and it is faster than the original DBSCAN algorithm by several times.

Original languageEnglish
Pages (from-to)735-744
Number of pages10
JournalRuan Jian Xue Bao/Journal of Software
Volume11
Issue number6
StatePublished - Jun 2000
Externally publishedYes

Fingerprint

Dive into the research topics of 'FDBSCAN: A fast DBSCAN algorithm'. Together they form a unique fingerprint.

Cite this