跳到主要导航 跳到搜索 跳到主要内容

IPDALight: Intensity- and phase duration-aware traffic signal control based on Reinforcement Learning[Formula presented]

  • East China Normal University

科研成果: 期刊稿件文章同行评审

摘要

Reinforcement Learning (RL) has been recognized as one of the most effective methods to optimize traffic signal control. However, due to the inappropriate design of RL elements (i.e., reward and state) for complex traffic dynamics, existing RL-based approaches suffer from slow convergence to optimal traffic signal plans. Meanwhile, to simplify the traffic modeling, most optimization methods assume that the phase duration of traffic signals is constant, which strongly limits the RL capability to search for traffic signal control policies with shorter average vehicle travel time and better GreenWave control. To address these issues, this paper proposes a novel intensity- and phase duration-aware RL-based method named IPDALight for the optimization of traffic signal control. Inspired by the Max Pressure (MP)-based traffic control strategy used in the transportation field, we introduce a new concept named intensity, which ensures that our reward design and state representation can accurately reflect the status of vehicles. By taking the coordination of neighboring intersections into account, our approach enables the fine-tuning of phase duration of traffic signals to adapt to dynamic traffic situations. Comprehensive experimental results on both synthetic and real-world traffic scenarios show that, compared with the state-of-the-art RL methods, IPDALight can not only achieve better average vehicle travel time and greenwave control for various multi-intersection scenarios, but also converge to optimal solutions much faster.

源语言英语
文章编号102374
期刊Journal of Systems Architecture
123
DOI
出版状态已出版 - 2月 2022

指纹

探究 'IPDALight: Intensity- and phase duration-aware traffic signal control based on Reinforcement Learning[Formula presented]' 的科研主题。它们共同构成独一无二的指纹。

引用此