TY - GEN
T1 - Autonomous Driving Systems
T2 - 7th International Conference on Web Research, ICWR 2021
AU - Jamshidi, Faezeh
AU - Zhang, Lei
AU - Nezhadalinaei, Fahimeh
N1 - Publisher Copyright:
© 2021 IEEE.
PY - 2021/5/19
Y1 - 2021/5/19
N2 - Autonomous driving is the most attractive field to research by academic and industrial socials that intelligent transportation play a vital role in structure of autonomous driving systems. Artificial Intelligence (AI) is an infrastructure for autonomous driving by designing of intelligent machine. Deep Learning is one of subfields of Artificial Intelligence that create models by mimicking human brain's functioning to make decision that it has shown great success in autonomous diving systems field. However, it performs very poorly in some stochastic environments caused by large overestimations of action values. Thus, we use the double estimator to Q-learning to construct Double Q-learning with a new off-policy reinforcement learning algorithm. By this algorithm, we can approximate the maximum expected value for any number of random variables and it underestimate rather than overestimate the maximum expected value. Moreover, we use an optimization method based on A* to improve routing in automation driving. Our proposed approach based on double Q-Learning and A* is evaluated on an example environment with random obstacles and compare results to use Q-Learning alone. Results show the proposed approach has better performance based on duration of trip to destination and collision to obstacles.
AB - Autonomous driving is the most attractive field to research by academic and industrial socials that intelligent transportation play a vital role in structure of autonomous driving systems. Artificial Intelligence (AI) is an infrastructure for autonomous driving by designing of intelligent machine. Deep Learning is one of subfields of Artificial Intelligence that create models by mimicking human brain's functioning to make decision that it has shown great success in autonomous diving systems field. However, it performs very poorly in some stochastic environments caused by large overestimations of action values. Thus, we use the double estimator to Q-learning to construct Double Q-learning with a new off-policy reinforcement learning algorithm. By this algorithm, we can approximate the maximum expected value for any number of random variables and it underestimate rather than overestimate the maximum expected value. Moreover, we use an optimization method based on A* to improve routing in automation driving. Our proposed approach based on double Q-Learning and A* is evaluated on an example environment with random obstacles and compare results to use Q-Learning alone. Results show the proposed approach has better performance based on duration of trip to destination and collision to obstacles.
KW - Autonomous Driving Systems
KW - Autonomous Vehicles
KW - Deep Learning
KW - Double Q-Learning and A
UR - https://www.scopus.com/pages/publications/85107647128
U2 - 10.1109/ICWR51868.2021.9443139
DO - 10.1109/ICWR51868.2021.9443139
M3 - 会议稿件
AN - SCOPUS:85107647128
T3 - 2021 7th International Conference on Web Research, ICWR 2021
SP - 82
EP - 85
BT - 2021 7th International Conference on Web Research, ICWR 2021
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 19 May 2021 through 20 May 2021
ER -