跳到主要导航 跳到搜索 跳到主要内容

Karst: Transactional Data Ingestion Without Blocking on a Scalable Architecture

  • Zhifang Li
  • , Beicheng Peng
  • , Qiuli Huang
  • , Chuliang Weng*
  • *此作品的通讯作者
  • East China Normal University

科研成果: 期刊稿件文章同行评审

摘要

Although real-time analytics on the up-to-date dataset has become an emerging demand, many big data systems are still designed for offline analytics. Particularly, for critical applications like Fintech, transactional data ingestion ensures a timely, always-correct, and scalable dataset. To carry out append-only ingestion, existing OLTP/HTAP systems are based on strict transactions with imperfect scalability, while NoSQL-like systems support scalable but relaxed transactions. How to ensure essential transactional guarantees without harming scalability seems to be a non-trivial issue. This paper proposes Karst to bring transactional data ingestion for existing offline analytics. We notice that blocking two-phase commit (2PC) to resolve transactional data ingestion is a performance killer for the partitioned analytical systems. Karst introduces a scalable protocol called metadata-oriented commit (MOC) that converts each distributed transaction into multiple partial transactions to avoid 2PC. Moreover, to ingest massive data into plenty of partitions, Karst also employs lazy persistence, lightweight logging, and optimized data traffic. In experiments, Karst could achieve up to about 2x$\sim$∼10x performance over relevant systems and also shows remarkable scalability.

源语言英语
页(从-至)2241-2253
页数13
期刊IEEE Transactions on Knowledge and Data Engineering
34
5
DOI
出版状态已出版 - 1 5月 2022

指纹

探究 'Karst: Transactional Data Ingestion Without Blocking on a Scalable Architecture' 的科研主题。它们共同构成独一无二的指纹。

引用此