TY - GEN
T1 - Deep Reinforcement Learning for Autonomous Driving with Multiple Expert Demonstrations
AU - Wang, Chenghao
AU - Li, Miaodi
AU - Zhang, Muxiang
AU - Zhang, Min
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2025
Y1 - 2025
N2 - Deep reinforcement learning (DRL) has emerged as a promising approach to address challenges in autonomous driving. Recently, reinforcement learning from expert demonstrations has gained considerable attention, offering a synergistic integration of imitation learning (IL) and reinforcement learning to enhance model performance. However, in complex driving scenarios, expert decisions may be suboptimal, leading to inefficient training and susceptibility to local optima. In light of this, we propose a novel DRL-based model, termed PME. In the decision-making process, we introduce an enhanced algorithm built upon Proximal Policy Optimization (PPO), which integrates multiple expert policies to mitigate the negative impact of erroneous expert decisions, thereby significantly improving the overall performance of the autonomous driving model. Experimental results, conducted in the CARLA simulator, demonstrate that the proposed PME model surpasses conventional reinforcement learning algorithms across various driving scenarios. The framework consistently achieves higher success rates under diverse conditions, showcasing improved learning stability and superior generalization capabilities.
AB - Deep reinforcement learning (DRL) has emerged as a promising approach to address challenges in autonomous driving. Recently, reinforcement learning from expert demonstrations has gained considerable attention, offering a synergistic integration of imitation learning (IL) and reinforcement learning to enhance model performance. However, in complex driving scenarios, expert decisions may be suboptimal, leading to inefficient training and susceptibility to local optima. In light of this, we propose a novel DRL-based model, termed PME. In the decision-making process, we introduce an enhanced algorithm built upon Proximal Policy Optimization (PPO), which integrates multiple expert policies to mitigate the negative impact of erroneous expert decisions, thereby significantly improving the overall performance of the autonomous driving model. Experimental results, conducted in the CARLA simulator, demonstrate that the proposed PME model surpasses conventional reinforcement learning algorithms across various driving scenarios. The framework consistently achieves higher success rates under diverse conditions, showcasing improved learning stability and superior generalization capabilities.
KW - autonomous driving
KW - deep reinforcement learning
KW - multiple expert demonstrations
UR - https://www.scopus.com/pages/publications/105023979912
U2 - 10.1109/IJCNN64981.2025.11227385
DO - 10.1109/IJCNN64981.2025.11227385
M3 - 会议稿件
AN - SCOPUS:105023979912
T3 - Proceedings of the International Joint Conference on Neural Networks
BT - International Joint Conference on Neural Networks, IJCNN 2025 - Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2025 International Joint Conference on Neural Networks, IJCNN 2025
Y2 - 30 June 2025 through 5 July 2025
ER -