TY - GEN
T1 - LATuner
T2 - European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, ECML PKDD 2024
AU - Fan, Chongjiong
AU - Pan, Zhicheng
AU - Sun, Wenwen
AU - Yang, Chengcheng
AU - Chen, Wei Neng
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2024.
PY - 2024
Y1 - 2024
N2 - Database Management Systems (DBMSs) offer a plethora of configurable parameters—termed “knobs”—that control the system behavior. Identifying the optimal configuration for these knobs, i.e., Knob Tuning (KT) is acknowledged as a critical way to enhance the DBMS performance. However, the increasing number of adjustable knobs and the complexity inherent in KTs have rendered manual tuning an antiquated and impractical approach. Recently, automatic KTs based on Machine Learning (ML) techniques have demonstrated significant potentials. Despite the advancements, they are also hindered by notable drawbacks such as the lack of domain knowledge and low tuning efficiency. Meanwhile, Large Language Model (LLM), which is pre-trained on diverse corpora including web content, database manuals, could offer a novel, training-free approach to significantly mitigate the aforementioned issues. In light of this, we propose an LLM-enhanced dAtabase Tuner, called LATuner. Firstly, since KT often suffers from the cold-start problem, we harness the extensive domain knowledge of LLMs to identify critical knobs and to warm start the tuning process, thus obtaining high-quality training samples. Secondly, as KT requires multiple rounds of sampling during the training process, we leverage LLMs to guide the sampling procedure, accelerating the convergence of the model training. Finally, to balance the tuning cost and efficiency between LLM-based KT and traditional ML-based KT, we design an adaptive surrogate strategy based on multi-armed bandit, achieving cost-effective tuning performance. Extensive experiments performed on well-established benchmarks have proven the efficacy and superiority of our proposal.
AB - Database Management Systems (DBMSs) offer a plethora of configurable parameters—termed “knobs”—that control the system behavior. Identifying the optimal configuration for these knobs, i.e., Knob Tuning (KT) is acknowledged as a critical way to enhance the DBMS performance. However, the increasing number of adjustable knobs and the complexity inherent in KTs have rendered manual tuning an antiquated and impractical approach. Recently, automatic KTs based on Machine Learning (ML) techniques have demonstrated significant potentials. Despite the advancements, they are also hindered by notable drawbacks such as the lack of domain knowledge and low tuning efficiency. Meanwhile, Large Language Model (LLM), which is pre-trained on diverse corpora including web content, database manuals, could offer a novel, training-free approach to significantly mitigate the aforementioned issues. In light of this, we propose an LLM-enhanced dAtabase Tuner, called LATuner. Firstly, since KT often suffers from the cold-start problem, we harness the extensive domain knowledge of LLMs to identify critical knobs and to warm start the tuning process, thus obtaining high-quality training samples. Secondly, as KT requires multiple rounds of sampling during the training process, we leverage LLMs to guide the sampling procedure, accelerating the convergence of the model training. Finally, to balance the tuning cost and efficiency between LLM-based KT and traditional ML-based KT, we design an adaptive surrogate strategy based on multi-armed bandit, achieving cost-effective tuning performance. Extensive experiments performed on well-established benchmarks have proven the efficacy and superiority of our proposal.
KW - Bayesian Optimization
KW - Database Tuning
KW - Large Language Model
KW - Multi-armed Bandit
UR - https://www.scopus.com/pages/publications/105023374118
U2 - 10.1007/978-3-031-70362-1_22
DO - 10.1007/978-3-031-70362-1_22
M3 - 会议稿件
AN - SCOPUS:105023374118
SN - 9783031703614
T3 - Lecture Notes in Computer Science
SP - 372
EP - 388
BT - Machine Learning and Knowledge Discovery in Databases. Research Track - European Conference, ECML PKDD 2024, Proceedings
A2 - Bifet, Albert
A2 - Davis, Jesse
A2 - Krilavicius, Tomas
A2 - Kull, Meelis
A2 - Ntoutsi, Eirini
A2 - Žliobaite, Indre
PB - Springer Science and Business Media Deutschland GmbH
Y2 - 9 September 2024 through 13 September 2024
ER -