Cluster aggregate inequality and multi-level hierarchical clustering

  • Chris Ding*
  • , Xiaofeng He
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

17 Scopus citations

Abstract

We show that (1) in hierarchical clustering, many linkage functions satisfy a cluster aggregate inequality, which allows an exact O(N2) multi-level (using mutual nearest neighbor) implementation of the standard O(N3) agglomerative hierarchical clustering algorithm. (2) a desirable close friends cohesion of clusters can be translated into kNN consistency which is guaranteed by the multi-level algorithm; (3) For similarity-based linkage functions, the multi-level algorithm is naturally implemented as graph contraction. The effectiveness of our algorithms is demonstrated on a number of real life applications.

Original languageEnglish
Title of host publicationKnowledge Discovery in Databases
Subtitle of host publicationPKDD 2005 - 9th European Conference on Principles and Practice of Knowledge Discovery in Databases, Proceedings
PublisherSpringer Verlag
Pages71-83
Number of pages13
ISBN (Print)3540292446, 9783540292449
DOIs
StatePublished - 2005
Externally publishedYes
Event9th European Conference on Principles and Practice of Knowledge Discovery in Databases, PKDD 2005 - Porto, Portugal
Duration: 3 Oct 20057 Oct 2005

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume3721 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference9th European Conference on Principles and Practice of Knowledge Discovery in Databases, PKDD 2005
Country/TerritoryPortugal
CityPorto
Period3/10/057/10/05

Fingerprint

Dive into the research topics of 'Cluster aggregate inequality and multi-level hierarchical clustering'. Together they form a unique fingerprint.

Cite this