IMBridge: Impedance Mismatch Mitigation between Database Engine and Prediction Query Execution

  • Chenyang Zhang
  • , Junxiong Peng
  • , Chen Xu*
  • , Quanqing Xu
  • , Chuanhui Yang
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Prediction queries that apply machine learning (ML) models to perform analysis on data stored in the database are prevalent with the advance of research. Thanks to the prosperity of ML frameworks in Python, current database systems introduce Python UDFs into query engines for inference invocation. However, there are impedance mismatches between database engines and prediction query execution with this approach. In particular, the database engine is oblivious to the semantics within prediction functions, which incurs the repetitive inference context setup. Moreover, the evaluation of the prediction function is coupled with the operator, which results in an undesirable inference batch size with low inference throughput. To mitigate these, we propose a system called IMBridge, which leverages aprediction function rewriter to eliminate redundant inference context setup and introduces adecoupled prediction operator to ensure that the evaluation batch size matches the desirable inference batch size. In this demonstration, we will showcase how IMBridge addresses these mismatches and boosts prediction query execution.

Original languageEnglish
Title of host publicationSIGMOD-Companion 2024 - Companion of the 2024 International Conferaence on Management of Data
PublisherAssociation for Computing Machinery
Pages456-459
Number of pages4
ISBN (Electronic)9798400704222
DOIs
StatePublished - 9 Jun 2024
Event2024 International Conference on Management of Data, SIGMOD 2024 - Santiago, Chile
Duration: 9 Jun 202415 Jun 2024

Publication series

NameProceedings of the ACM SIGMOD International Conference on Management of Data
ISSN (Print)0730-8078

Conference

Conference2024 International Conference on Management of Data, SIGMOD 2024
Country/TerritoryChile
CitySantiago
Period9/06/2415/06/24

Keywords

  • machine learning prediction query
  • query optimization

Fingerprint

Dive into the research topics of 'IMBridge: Impedance Mismatch Mitigation between Database Engine and Prediction Query Execution'. Together they form a unique fingerprint.

Cite this