TY - GEN
T1 - AutoTable
T2 - 25th International Conference on Web Information Systems Engineering, WISE 2024
AU - Huang, Shanshan
AU - Zhu, Junpeng
AU - Zhang, Fengyan
AU - Cai, Peng
AU - Dong, Qiwen
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2025.
PY - 2025
Y1 - 2025
N2 - Mission-critical data are commonly organized as tables within relational databases, and feature transformation from these tabular data is a pivotal component of the machine learning pipeline for business intelligence. However, automating this process poses a significant challenge, primarily due to the exponential growth in the size of the search space with an increase in the number of features (i.e., columns or dimensions of the table) and transform functions. This paper presents AutoTable, an effective and efficient feature transformation framework for tabular data using reinforcement learning. Specifically, AutoTable formulates the feature transformation problem as a search process carried out on a transformation tree, which offers a more well-structured search space and facilitates fine-grained exploration, empowering domain experts to perform AFT tasks with minimal statistical and machine learning expertise. To further improve search performance, we propose merging and lazy loading mechanisms. Experimental results demonstrate that AutoTable outperforms state-of-the-art approaches in terms of both efficiency and effectiveness.
AB - Mission-critical data are commonly organized as tables within relational databases, and feature transformation from these tabular data is a pivotal component of the machine learning pipeline for business intelligence. However, automating this process poses a significant challenge, primarily due to the exponential growth in the size of the search space with an increase in the number of features (i.e., columns or dimensions of the table) and transform functions. This paper presents AutoTable, an effective and efficient feature transformation framework for tabular data using reinforcement learning. Specifically, AutoTable formulates the feature transformation problem as a search process carried out on a transformation tree, which offers a more well-structured search space and facilitates fine-grained exploration, empowering domain experts to perform AFT tasks with minimal statistical and machine learning expertise. To further improve search performance, we propose merging and lazy loading mechanisms. Experimental results demonstrate that AutoTable outperforms state-of-the-art approaches in terms of both efficiency and effectiveness.
KW - Automated Feature Transformation
KW - Reinforcement Learning
KW - Tabular Data
UR - https://www.scopus.com/pages/publications/85211319212
U2 - 10.1007/978-981-96-0579-8_32
DO - 10.1007/978-981-96-0579-8_32
M3 - 会议稿件
AN - SCOPUS:85211319212
SN - 9789819605781
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 461
EP - 476
BT - Web Information Systems Engineering – WISE 2024 - 25th International Conference, Proceedings
A2 - Barhamgi, Mahmoud
A2 - Wang, Hua
A2 - Wang, Xin
PB - Springer Science and Business Media Deutschland GmbH
Y2 - 2 December 2024 through 5 December 2024
ER -