Skip to main navigation Skip to search Skip to main content

Finding dependency trees from binary data

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Much work has been done in finding interesting subsets of items, since it has broad applications in financial data analysis, e-commerce, text data mining, and so on. Though the well-known frequent pattern mining attracted much attention in research community, recently, more work has been devoted to analysis of more sophisticated relationships among items. Chow-Liu tree and low-entropy tree, for example, were used to summarize the frequent patterns. In this paper, we consider finding a novel dependency tree from binary data. It has several advantages over previous related work. Firstly, we propose a novel distance measure between items based on information theory, which captures the expected uncertainty in the item pairs and the mutual information between them. Based on this distance measure, we present a simple yet efficient algorithm for finding the dependency trees from binary data. We also show how our new approach can find applications in frequent pattern summarization. Our running example on synthetic dataset shows that our approach achieves good results compared to existing popular heuristics.

Original languageEnglish
Title of host publicationProceedings - 8th IEEE International Conference on Computer and Information Technology Workshops, CIT Workshops 2008
Pages80-85
Number of pages6
DOIs
StatePublished - 2008
Event8th IEEE International Conference on Computer and Information Technology Workshops, CIT Workshops 2008 - Sydney, Australia
Duration: 8 Jul 200811 Jul 2008

Publication series

NameProceedings - 8th IEEE International Conference on Computer and Information Technology Workshops, CIT Workshops 2008

Conference

Conference8th IEEE International Conference on Computer and Information Technology Workshops, CIT Workshops 2008
Country/TerritoryAustralia
CitySydney
Period8/07/0811/07/08

Fingerprint

Dive into the research topics of 'Finding dependency trees from binary data'. Together they form a unique fingerprint.

Cite this