Compression of weighted graphs

Hannu Toivonen, Fang Zhou, Aleksi Hartikainen, Atte Hinkka

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

122 Scopus citations

Abstract

We propose to compress weighted graphs (networks), motivated by the observation that large networks of social, biological, or other relations can be complex to handle and visualize. In the process also known as graph simplification, nodes and (unweighted) edges are grouped to supernodes and superedges, respectively, to obtain a smaller graph. We propose models and algorithms for weighted graphs. The interpretation (i.e. decompression) of a compressed, weighted graph is that a pair of original nodes is connected by an edge if their supernodes are connected by one, and that the weight of an edge is approximated to be the weight of the superedge. The compression problem now consists of choosing supernodes, superedges, and superedge weights so that the approximation error is minimized while the amount of compression is maximized. In this paper, we formulate this task as the'simple weighted graph compression problem'. We then propose a much wider class of tasks under the name of'generalized weighted graph compression problem'. The generalized task extends the optimization to preserve longer-range connectivities between nodes, not just individual edge weights. We study the properties of these problems and propose a range of algorithms to solve them, with different balances between complexity and quality of the result. We evaluate the problems and algorithms experimentally on real networks. The results indicate that weighted graphs can be compressed efficiently with relatively little compression error.

Original languageEnglish
Title of host publicationProceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD'11
PublisherAssociation for Computing Machinery
Pages965-973
Number of pages9
ISBN (Print)9781450308137
DOIs
StatePublished - 2011
Externally publishedYes
Event17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2011 - San Diego, United States
Duration: 21 Aug 201124 Aug 2011

Publication series

NameProceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

Conference

Conference17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2011
Country/TerritoryUnited States
CitySan Diego
Period21/08/1124/08/11

Keywords

  • Compression
  • Graph mining
  • Network
  • Weighted graph

Fingerprint

Dive into the research topics of 'Compression of weighted graphs'. Together they form a unique fingerprint.

Cite this