TY - GEN
T1 - Learning Behavior Trees for Automated Guided Vehicles via Genetic and Reinforcement Methods
AU - Yang, Wenzheng
AU - Zhao, Yongxin
AU - Wang, Qiang
AU - Li, Yongjian
AU - Tian, Yudan
N1 - Publisher Copyright:
© 2025 Knowledge Systems Institute Graduate School. All rights reserved.
PY - 2025
Y1 - 2025
N2 - Behavior Trees (BTs) is a robust framework for decision-making that is highly applicable in automated guided vehicles (AGVs), offering a way to manage complex tasks and respond to dynamic changes in their environment. To enhance the adaptability and efficiency of BTs in AGVs, we introduce a hybrid approach that integrates Genetic Programming (GP) and Reinforcement Learning (RL). The GP evolves the BT structure by selecting potential actions from an action pool, guided by tailored constraints that ensure the trees remain interpretable and relevant to AGV tasks. Meanwhile, to mitigate node dependency issues in BTs, we employ RL, which incorporates a parameter-dependent dynamic updating (PDDU) algorithm to monitor and manage the relationships between parameters. Furthermore, we implement a weighted ϵ-greedy algorithm to refine the parameter update process. Our methodology is validated through simulated AGV scenarios, demonstrating that the evolved BT significantly improve AGV autonomy. This innovative fusion of GP and RL techniques sets a foundation for future developments in AGV technology, with potential applications extending beyond the factory floor to any environment where AGVs are deployed.
AB - Behavior Trees (BTs) is a robust framework for decision-making that is highly applicable in automated guided vehicles (AGVs), offering a way to manage complex tasks and respond to dynamic changes in their environment. To enhance the adaptability and efficiency of BTs in AGVs, we introduce a hybrid approach that integrates Genetic Programming (GP) and Reinforcement Learning (RL). The GP evolves the BT structure by selecting potential actions from an action pool, guided by tailored constraints that ensure the trees remain interpretable and relevant to AGV tasks. Meanwhile, to mitigate node dependency issues in BTs, we employ RL, which incorporates a parameter-dependent dynamic updating (PDDU) algorithm to monitor and manage the relationships between parameters. Furthermore, we implement a weighted ϵ-greedy algorithm to refine the parameter update process. Our methodology is validated through simulated AGV scenarios, demonstrating that the evolved BT significantly improve AGV autonomy. This innovative fusion of GP and RL techniques sets a foundation for future developments in AGV technology, with potential applications extending beyond the factory floor to any environment where AGVs are deployed.
KW - Automated Guided Vehicles
KW - Behavior Trees
KW - Genetic Programming
KW - Reinforcement Learning
UR - https://www.scopus.com/pages/publications/105018738231
U2 - 10.18293/SEKE2025-054
DO - 10.18293/SEKE2025-054
M3 - 会议稿件
AN - SCOPUS:105018738231
T3 - Proceedings of the International Conference on Software Engineering and Knowledge Engineering, SEKE
SP - 336
EP - 341
BT - Proceedings - SEKE 2025
PB - Knowledge Systems Institute Graduate School
T2 - 37th International Conference on Software Engineering and Knowledge Engineering, SEKE 2025
Y2 - 29 September 2025 through 4 October 2025
ER -