TY - GEN
T1 - Genetic Programming-based Evolutionary Feature Construction for Heterogeneous Ensemble Learning [Hot of the Press]
AU - Zhang, Hengzhe
AU - Zhou, Aimin
AU - Chen, Qi
AU - Xue, Bing
AU - Zhang, Mengjie
N1 - Publisher Copyright:
© 2023 Copyright held by the owner/author(s).
PY - 2023/7/15
Y1 - 2023/7/15
N2 - This Hof-off-the-Press paper summarizes our recently published work, "SR-Forest: A Genetic Programming based Heterogeneous Ensemble Learning Method," published in IEEE Transactions on Evolutionary Computation [4]. This paper presents SR-Forest, a novel genetic programming-based heterogeneous ensemble learning method, which combines the strengths of decision trees and genetic programming-based symbolic regression methods. Rather than treating genetic programming-based symbolic regression methods as competitors to random forests, we propose to enhance the performance of random forests by incorporating genetic programming as a complementary technique. We introduce a guided mutation operator, a multi-fidelity evaluation strategy, and an ensemble selection mechanism to accelerate the search process, reduce computational costs, and improve predictive performance. Experimental results on a regression benchmark with 120 datasets show that SR-Forest outperforms 25 existing symbolic regression and ensemble learning methods. Moreover, we demonstrate the effectiveness of SR-Forest on an XGBoost hyperparameter performance prediction task, which is an important application area of ensemble learning methods. Overall, SR-Forest provides a promising approach to solving regression problems and can serve as a valuable tool in real-world applications.
AB - This Hof-off-the-Press paper summarizes our recently published work, "SR-Forest: A Genetic Programming based Heterogeneous Ensemble Learning Method," published in IEEE Transactions on Evolutionary Computation [4]. This paper presents SR-Forest, a novel genetic programming-based heterogeneous ensemble learning method, which combines the strengths of decision trees and genetic programming-based symbolic regression methods. Rather than treating genetic programming-based symbolic regression methods as competitors to random forests, we propose to enhance the performance of random forests by incorporating genetic programming as a complementary technique. We introduce a guided mutation operator, a multi-fidelity evaluation strategy, and an ensemble selection mechanism to accelerate the search process, reduce computational costs, and improve predictive performance. Experimental results on a regression benchmark with 120 datasets show that SR-Forest outperforms 25 existing symbolic regression and ensemble learning methods. Moreover, we demonstrate the effectiveness of SR-Forest on an XGBoost hyperparameter performance prediction task, which is an important application area of ensemble learning methods. Overall, SR-Forest provides a promising approach to solving regression problems and can serve as a valuable tool in real-world applications.
KW - Evolutionary Feature Construction
KW - Genetic Programming
KW - Heterogeneous Ensemble Learning
UR - https://www.scopus.com/pages/publications/85169064181
U2 - 10.1145/3583133.3595831
DO - 10.1145/3583133.3595831
M3 - 会议稿件
AN - SCOPUS:85169064181
T3 - GECCO 2023 Companion - Proceedings of the 2023 Genetic and Evolutionary Computation Conference Companion
SP - 49
EP - 50
BT - GECCO 2023 Companion - Proceedings of the 2023 Genetic and Evolutionary Computation Conference Companion
PB - Association for Computing Machinery, Inc
T2 - 2023 Genetic and Evolutionary Computation Conference Companion, GECCO 2023 Companion
Y2 - 15 July 2023 through 19 July 2023
ER -