跳到主要导航 跳到搜索 跳到主要内容

PG-RAC: 基于 PostgreSQL 的共享缓存多写事务处理数据库

  • Yu Jie Yin
  • , Hao Yang Shi
  • , Zi Hao Fan
  • , Hua Hui Zhou
  • , Sheng Chi Liu
  • , Hui Qi Hu*
  • , Xing Wei
  • , He Dui Chen
  • , Yao Feng Tu
  • , Peng Cai
  • , Xuan Zhou
  • *此作品的通讯作者
  • East China Normal University
  • ZTE Corporation

科研成果: 期刊稿件文章同行评审

摘要

Single-master multi-slave is the mainstream architecture of cloud-native databases. In the cluster, slave nodes can share the read-only requests of the master node, while write requests are handled by the master node. Based on this, to further meet the demands of large-scale transaction expansion, some cloud databases attempt to implement multi-write transaction expansion. One possible approach to multi-write expansion is to introduce shared cache among computing nodes to support cross-node data access. For shared-cache database systems, the overhead of cross-node remote access is significantly higher than that of local access. Therefore, the design of cache protocol is a crucial factor that affects system performance and scalability. This study proposes two innovative improvements to the coherence protocol and implements PG-RAC, a shared-cache database, which supports multi-write transactions based on PostgreSQL. On one hand, PG-RAC proposes a new distributed chained routing strategy, which disperses routing information among computing nodes. Compared to the routing strategy that utilizes single-node directory management, it reduces the average transaction latency by approximately 20%. On the other hand, this study also enhances the duplicate page invalidation mechanism by separating invalidation operations from the transaction path, reducing the latency of the critical path in the transaction. Based on this, PG-RAC takes advantage of the characteristics of multi-version concurrency control (MVCC) and further proposes to delay the invalidation point of duplicate pages, which effectively improves cache utilization. TPC-C experimental results show that for a cluster with 4 compute nodes, the throughput is nearly 2 times that of PostgreSQL and 1.5 times that of the distributed database Citus.

投稿的翻译标题PG-RAC: PostgreSQL-based Database with Shared Cache for Multi-write Transaction
源语言繁体中文
页(从-至)1065-1083
页数19
期刊Ruan Jian Xue Bao/Journal of Software
36
3
DOI
出版状态已出版 - 2025

关键词

  • cache coherence protocol
  • cloud-native database
  • shared-cache database
  • transaction processing

指纹

探究 'PG-RAC: 基于 PostgreSQL 的共享缓存多写事务处理数据库' 的科研主题。它们共同构成独一无二的指纹。

引用此