TY - JOUR
T1 - Towards the design of efficient hash-based indexing scheme for growing databases on non-volatile memory
AU - Ma, Zhulin
AU - Sha, Edwin H.M.
AU - Zhuge, Qingfeng
AU - Jiang, Weiwen
AU - Zhang, Runyu
AU - Gu, Shouzhen
N1 - Publisher Copyright:
© 2019
PY - 2020/4
Y1 - 2020/4
N2 - The index is a fundamental component in data intensive systems to accelerate data retrieval operations. In the design of Non-Volatile Memory (NVM) based indexes, the hash-based structure is one of the most promising candidates since it can take full advantages of byte-addressable property of NVM to perform query operations with constant time complexity. However, we found that the basic operation, “rehash operation”, may incur a large number of write activities on NVM, which is harmful to the endurance of NVM, and will cause drastic performance degradation. Additionally, range query operations cannot be efficiently conducted on hash-based indexes. In this paper, we first investigate how to design an NVM-friendly hash-based structure with the considerations of endurance and performance issues. Then, we propose a novel indexing scheme called “Bucket Hash”, which can significantly reduce the overhead caused by rehash operations and range query operations. We evaluate the proposed Bucket Hash using YCSB workloads. Compared with existing indexes, Bucket Hash achieves 40% reduction on average in the number of NVM writes, meanwhile gaining 30% improvement on timing performance.
AB - The index is a fundamental component in data intensive systems to accelerate data retrieval operations. In the design of Non-Volatile Memory (NVM) based indexes, the hash-based structure is one of the most promising candidates since it can take full advantages of byte-addressable property of NVM to perform query operations with constant time complexity. However, we found that the basic operation, “rehash operation”, may incur a large number of write activities on NVM, which is harmful to the endurance of NVM, and will cause drastic performance degradation. Additionally, range query operations cannot be efficiently conducted on hash-based indexes. In this paper, we first investigate how to design an NVM-friendly hash-based structure with the considerations of endurance and performance issues. Then, we propose a novel indexing scheme called “Bucket Hash”, which can significantly reduce the overhead caused by rehash operations and range query operations. We evaluate the proposed Bucket Hash using YCSB workloads. Compared with existing indexes, Bucket Hash achieves 40% reduction on average in the number of NVM writes, meanwhile gaining 30% improvement on timing performance.
KW - Hashing data structures
KW - Index schemes
KW - Non-volatile memory
UR - https://www.scopus.com/pages/publications/85074706670
U2 - 10.1016/j.future.2019.07.035
DO - 10.1016/j.future.2019.07.035
M3 - 文章
AN - SCOPUS:85074706670
SN - 0167-739X
VL - 105
SP - 1
EP - 12
JO - Future Generation Computer Systems
JF - Future Generation Computer Systems
ER -