Cluster merging and splitting in hierarchical clustering algorithms

  • Chris Ding*
  • , Xiaofeng He
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

139 Scopus citations

Abstract

Hierarchical clustering constructs a hierarchy of clusters by either repeatedly merging two smaller clusters into a larger one or splitting a larger cluster into smaller ones. The crucial step is how to best select the next cluster(s) to split or merge. Here we provide a comprehensive analysis of selection methods and propose several new methods. We perform extensive clustering experiments to test 8 selection methods, and find that the average similarity is the best method in divisive clustering and the MinMax linkage is the best in agglomerative clustering. Cluster balance is a key factor to achieve good performance. We also introduce the concept of objective function saturation and clustering target distance to effectively assess the quality of clustering.

Original languageEnglish
Title of host publicationProceedings - 2002 IEEE International Conference on Data Mining, ICDM 2002
Pages139-146
Number of pages8
StatePublished - 2002
Externally publishedYes
Event2nd IEEE International Conference on Data Mining, ICDM '02 - Maebashi, Japan
Duration: 9 Dec 200212 Dec 2002

Publication series

NameProceedings - IEEE International Conference on Data Mining, ICDM
ISSN (Print)1550-4786

Conference

Conference2nd IEEE International Conference on Data Mining, ICDM '02
Country/TerritoryJapan
CityMaebashi
Period9/12/0212/12/02

Fingerprint

Dive into the research topics of 'Cluster merging and splitting in hierarchical clustering algorithms'. Together they form a unique fingerprint.

Cite this