Online Index Recommendation for Slow Queries

  • Gan Peng
  • , Peng Cai*
  • , Kaikai Ye
  • , Kai Li
  • , Jinlong Cai
  • , Yufeng Shen
  • , Han Su
  • , Weiyuan Xu
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Database autonomy service (DAS) is a platform that provides assistance to database maintainers or administrators in managing a large number of database instances in major internet companies. An important task of DAS is to find missing indexes to improve the performance of slow queries reported from its managed online database instances. Traditional database systems provide the 'what-if' function or the hypothetical index technique. Index metadata is modified to simulate the benefits of indexes for queries without creating physical index files. Decades of research have led to plenty of ideas for index recommendation through the use of the 'what-if' function and different search strategies. However, the popular open-source database system MySQL, used by most internet companies, has not provided the 'what-if' function. In Meituan, tens of thousands of MySQL instances have been deployed across many business lines. Consequently, the DAS platform has accumulated lots of index creation samples. In this paper, we introduce index learner (IdxL), designed to learn index creation knowledge from these informative index data. IdxL resolves the problem of index recommendation by formulating it into an end-to-end supervised learning problem. Given a slow query, IdxL uses learned index creation knowledge to directly predict the missing indexes. Experimental results demonstrate: (1) IdxL is superior to the state-of-the-art index recommendation methods, especially when the error in cost estimation was propagated to the search in candidate index space, and (2) in particular, IdxL achieves up to 97% performance gain over a state-of-the-art method relying on the optimizer's cost estimation in the Meituan-specific index recommendation scenario. Finally, we present the applied results of IdxL in the Meituan DAS platform, demonstrating its ability to transfer index creation knowledge from certain databases to others.

Original languageEnglish
Title of host publicationProceedings - 2024 IEEE 40th International Conference on Data Engineering, ICDE 2024
PublisherIEEE Computer Society
Pages5294-5306
Number of pages13
ISBN (Electronic)9798350317152
DOIs
StatePublished - 2024
Event40th IEEE International Conference on Data Engineering, ICDE 2024 - Utrecht, Netherlands
Duration: 13 May 202417 May 2024

Publication series

NameProceedings - International Conference on Data Engineering
ISSN (Print)1084-4627
ISSN (Electronic)2375-0286

Conference

Conference40th IEEE International Conference on Data Engineering, ICDE 2024
Country/TerritoryNetherlands
CityUtrecht
Period13/05/2417/05/24

Keywords

  • Databases
  • Index Recom-mendation
  • Machine Learning

Fingerprint

Dive into the research topics of 'Online Index Recommendation for Slow Queries'. Together they form a unique fingerprint.

Cite this