TY - GEN
T1 - Niosit
T2 - 50th ACM Turing Conference - China, ACM TUR-C 2017
AU - Xiao, Bing
AU - Guo, Jinwei
AU - Qian, Weining
AU - Hu, Huiqi
AU - Zhou, Aoying
N1 - Publisher Copyright:
© 2017 ACM.
PY - 2017/5/12
Y1 - 2017/5/12
N2 - Recent years, the log-structured merge-tree(LSM-tree) style storage has been widely adopted in distributed data storage systems(e.g. Bigtable and HBase) and commercial database systems(e.g. Ocean-Base, Cassandra, SQLite, etc.) to provide both large-volume storage capacity and high-performance data updates. Write operations become easier as the LSM-tree style storage avoids writing in place by updating a data copy in memory. However, read operations are a.ected as it requires an additional step during a data compaction to check if there exists the newest update of data record in memory, which brings many of costly empty reads in real usage since the volume of immutable data is pervasively far more massive than incremental delta data. To address this issue, we design a new network request processing mechanism to allow data access being processed in an auxiliary lightweight network communication IO thread. And a Bloom filter is incorporated with the network IO thread to effiectively filter out the empty reads. We also analyze the efficiency advantage of the mechanism and introduce its detailed implementation based on the well-known OceanBase system. Experimental study using the YCSB benchmark demonstrates the proposed mechanism can significantly achieve 20 percent to 30 percent be.er performance than existing method.
AB - Recent years, the log-structured merge-tree(LSM-tree) style storage has been widely adopted in distributed data storage systems(e.g. Bigtable and HBase) and commercial database systems(e.g. Ocean-Base, Cassandra, SQLite, etc.) to provide both large-volume storage capacity and high-performance data updates. Write operations become easier as the LSM-tree style storage avoids writing in place by updating a data copy in memory. However, read operations are a.ected as it requires an additional step during a data compaction to check if there exists the newest update of data record in memory, which brings many of costly empty reads in real usage since the volume of immutable data is pervasively far more massive than incremental delta data. To address this issue, we design a new network request processing mechanism to allow data access being processed in an auxiliary lightweight network communication IO thread. And a Bloom filter is incorporated with the network IO thread to effiectively filter out the empty reads. We also analyze the efficiency advantage of the mechanism and introduce its detailed implementation based on the well-known OceanBase system. Experimental study using the YCSB benchmark demonstrates the proposed mechanism can significantly achieve 20 percent to 30 percent be.er performance than existing method.
KW - Bloom filter
KW - Data access
KW - Log-structured merge-tree
KW - Network io processing
UR - https://www.scopus.com/pages/publications/85021250108
U2 - 10.1145/3063955.3063977
DO - 10.1145/3063955.3063977
M3 - 会议稿件
AN - SCOPUS:85021250108
T3 - ACM International Conference Proceeding Series
BT - Proceedings of the ACM Turing 50th Celebration Conference - China, ACM TUR-C 2017
PB - Association for Computing Machinery
Y2 - 12 May 2017 through 14 May 2017
ER -