Practical path-based methods for clustering arbitrary shaped data sets

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Path-based clustering is a well-known method for extracting arbitrary shaped clusters. However, its high time complexity limits some possible applications. In this paper, we propose two new algorithms to speed up the original path-based method. A basic method focuses on the path-distance calculation. A modified Floyd algorithm is applied to reduce the time complexity from Θ(n2m + n3 log n) to Θ(n3 + nk). An improved method emphasizes large scale data sets. A preprocess is used to reduce the number of data points to the path-based algorithm. Moreover, this algorithm can automatic determine the number of clusters by a box clustering. The new approaches are applied to a variety of test data sets with arbitrary shapes and the experimental results show that our method is efficient in dealing with the given problems.

Original languageEnglish
Title of host publicationProceedings - 2013 9th International Conference on Natural Computation, ICNC 2013
PublisherIEEE Computer Society
Pages962-966
Number of pages5
ISBN (Print)9781467347143
DOIs
StatePublished - 2013
Event2013 9th International Conference on Natural Computation, ICNC 2013 - Shenyang, China
Duration: 23 Jul 201325 Jul 2013

Publication series

NameProceedings - International Conference on Natural Computation
ISSN (Print)2157-9555

Conference

Conference2013 9th International Conference on Natural Computation, ICNC 2013
Country/TerritoryChina
CityShenyang
Period23/07/1325/07/13

Fingerprint

Dive into the research topics of 'Practical path-based methods for clustering arbitrary shaped data sets'. Together they form a unique fingerprint.

Cite this