A Chunk-Based Hash Table Caching Method for In-Memory Hash Joins

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

In-memory query processing can be accelerated by caching intermediate query results. Among various types of intermediate results, hash tables used by hash join are ideal objects for caching, as they can benefit a wide range of queries. In this paper, we introduce a fine-grained hash table caching method to benefit the hash-join operator. Our insight is that the fine-grained management of cached hash tables at the granularity of chunks can achieve optimal caching efficiency. As hash chunks can be reused more effectively, we further propose a cache-enabled hash join operator to adapt the cache under chunk granularity and accelerate in-memory hash join execution. Furthermore, we also consider concurrent accesses to cached hash chunks and present the scheduling strategy to improve throughput and consider response time simultaneously. To the best of our knowledge, our work is the first one that studies the delicate management of intermediate result caching for the concurrent workload. We integrated our caching method into a prototype to evaluate its performance. Experiments show that it can achieve significant performance improvement over brute force caching methods.

Original languageEnglish
Title of host publicationWeb Information Systems Engineering – WISE 2020 - 21st International Conference, Proceedings
EditorsZhisheng Huang, Wouter Beek, Hua Wang, Yanchun Zhang, Rui Zhou
PublisherSpringer Science and Business Media Deutschland GmbH
Pages376-389
Number of pages14
ISBN (Print)9783030620073
DOIs
StatePublished - 2020
Event21st International Conference on Web Information Systems Engineering, WISE 2020 - Amsterdam, Netherlands
Duration: 20 Oct 202024 Oct 2020

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume12343 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference21st International Conference on Web Information Systems Engineering, WISE 2020
Country/TerritoryNetherlands
CityAmsterdam
Period20/10/2024/10/20

Keywords

  • Cache management
  • Concurrent sharing
  • Hash join

Fingerprint

Dive into the research topics of 'A Chunk-Based Hash Table Caching Method for In-Memory Hash Joins'. Together they form a unique fingerprint.

Cite this