A data mining approach to XML dissemination

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Currently user's interests are expressed by XPath or XQuery queries in XML dissemination applications. These queries require a good knowledge of the structure and contents of the documents that will arrive; As well as knowledge of XQuery which few consumers will have. In some cases, where the distinction of relevant and irrelevant documents requires the consideration of a large number of features, the query may be impossible. This paper introduces a data mining approach to XML dissemination that uses a given document collection of the user to automatically learn a classifier modelling of his/her information needs. Also discussed are the corresponding optimization methods that allow a dissemination server to execute a massive number of classifiers simultaneously. The experimental evaluation of several real XML document sets demonstrates the accuracy and efficiency of the proposed approach.

Original languageEnglish
Title of host publicationWeb Information Systems Engineering, WISE 2010 - 11th International Conference, Proceedings
Pages442-455
Number of pages14
DOIs
StatePublished - 2010
Event11th International Conference on Web Information Systems Engineering, WISE 2010 - Hong Kong, China
Duration: 12 Dec 201014 Dec 2010

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume6488 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference11th International Conference on Web Information Systems Engineering, WISE 2010
Country/TerritoryChina
CityHong Kong
Period12/12/1014/12/10

Keywords

  • XML Classification
  • XML dissemination
  • feature vector
  • frequent structural pattern
  • pattern matching

Fingerprint

Dive into the research topics of 'A data mining approach to XML dissemination'. Together they form a unique fingerprint.

Cite this