IPDALight: Intensity- and phase duration-aware traffic signal control based on Reinforcement Learning[Formula presented]

  • Wupan Zhao
  • , Yutong Ye
  • , Jiepin Ding
  • , Ting Wang
  • , Tongquan Wei
  • , Mingsong Chen*
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

42 Scopus citations

Abstract

Reinforcement Learning (RL) has been recognized as one of the most effective methods to optimize traffic signal control. However, due to the inappropriate design of RL elements (i.e., reward and state) for complex traffic dynamics, existing RL-based approaches suffer from slow convergence to optimal traffic signal plans. Meanwhile, to simplify the traffic modeling, most optimization methods assume that the phase duration of traffic signals is constant, which strongly limits the RL capability to search for traffic signal control policies with shorter average vehicle travel time and better GreenWave control. To address these issues, this paper proposes a novel intensity- and phase duration-aware RL-based method named IPDALight for the optimization of traffic signal control. Inspired by the Max Pressure (MP)-based traffic control strategy used in the transportation field, we introduce a new concept named intensity, which ensures that our reward design and state representation can accurately reflect the status of vehicles. By taking the coordination of neighboring intersections into account, our approach enables the fine-tuning of phase duration of traffic signals to adapt to dynamic traffic situations. Comprehensive experimental results on both synthetic and real-world traffic scenarios show that, compared with the state-of-the-art RL methods, IPDALight can not only achieve better average vehicle travel time and greenwave control for various multi-intersection scenarios, but also converge to optimal solutions much faster.

Original languageEnglish
Article number102374
JournalJournal of Systems Architecture
Volume123
DOIs
StatePublished - Feb 2022

Keywords

  • Average travel time
  • Greenwave control
  • Max Pressure
  • Phase duration
  • Reinforcement Learning
  • Traffic signal control

Fingerprint

Dive into the research topics of 'IPDALight: Intensity- and phase duration-aware traffic signal control based on Reinforcement Learning[Formula presented]'. Together they form a unique fingerprint.

Cite this