AutoTable: Effective and Efficient Automated Feature Transformation for Tabular Data

Shanshan Huang, Junpeng Zhu, Fengyan Zhang, Peng Cai*, Qiwen Dong

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Mission-critical data are commonly organized as tables within relational databases, and feature transformation from these tabular data is a pivotal component of the machine learning pipeline for business intelligence. However, automating this process poses a significant challenge, primarily due to the exponential growth in the size of the search space with an increase in the number of features (i.e., columns or dimensions of the table) and transform functions. This paper presents AutoTable, an effective and efficient feature transformation framework for tabular data using reinforcement learning. Specifically, AutoTable formulates the feature transformation problem as a search process carried out on a transformation tree, which offers a more well-structured search space and facilitates fine-grained exploration, empowering domain experts to perform AFT tasks with minimal statistical and machine learning expertise. To further improve search performance, we propose merging and lazy loading mechanisms. Experimental results demonstrate that AutoTable outperforms state-of-the-art approaches in terms of both efficiency and effectiveness.

Original languageEnglish
Title of host publicationWeb Information Systems Engineering – WISE 2024 - 25th International Conference, Proceedings
EditorsMahmoud Barhamgi, Hua Wang, Xin Wang
PublisherSpringer Science and Business Media Deutschland GmbH
Pages461-476
Number of pages16
ISBN (Print)9789819605781
DOIs
StatePublished - 2025
Event25th International Conference on Web Information Systems Engineering, WISE 2024 - Doha, Qatar
Duration: 2 Dec 20245 Dec 2024

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume15436 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference25th International Conference on Web Information Systems Engineering, WISE 2024
Country/TerritoryQatar
CityDoha
Period2/12/245/12/24

Keywords

  • Automated Feature Transformation
  • Reinforcement Learning
  • Tabular Data

Fingerprint

Dive into the research topics of 'AutoTable: Effective and Efficient Automated Feature Transformation for Tabular Data'. Together they form a unique fingerprint.

Cite this