PMJoin: Optimizing distributed multi-way stream joins by stream partitioning

Yongluan Zhou, Ying Yan, Feng Yu, Aoying Zhou

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

15 Scopus citations

Abstract

In emerging data stream applications, data sources are typically distributed. Evaluating multi-join queries over streams from different sources may incur large communication cost. As queries run continuously, the precious bandwidths would be aggressively consumed without careful optimization of operator ordering and placement. In this paper, we focus on the optimization of continuous multi-join queries over distributed streams. We observe that by partitioning streams into sub-streams we can significantly reduce the communication cost and hence propose a novel partition-based join scheme - PM Join. A few partitioning techniques are studied. To generate the query plan for each substream, a heuristic algorithm is proposed based on a rate-based model. Results from an extensive experimental study show that our techniques can sufficiently reduce the communication cost.

Original languageEnglish
Title of host publicationDatabase Systems for Advanced Applications - 11th International Conference, DASFAA 2006, Proceedings
PublisherSpringer Verlag
Pages325-341
Number of pages17
ISBN (Print)3540333371, 9783540333371
DOIs
StatePublished - 2006
Externally publishedYes
Event11th International Conference on Database Systems for Advanced Applications, DASFAA 2006 - Singapore, Singapore
Duration: 12 Apr 200615 Apr 2006

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume3882 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference11th International Conference on Database Systems for Advanced Applications, DASFAA 2006
Country/TerritorySingapore
CitySingapore
Period12/04/0615/04/06

Fingerprint

Dive into the research topics of 'PMJoin: Optimizing distributed multi-way stream joins by stream partitioning'. Together they form a unique fingerprint.

Cite this