Xiaocai Zhang, Lok Sang Chan, Neema Nassir, Majid Sarvi
{"title":"Towards fair lights: A multi-agent masked deep reinforcement learning for efficient corridor-level traffic signal control","authors":"Xiaocai Zhang, Lok Sang Chan, Neema Nassir, Majid Sarvi","doi":"10.1016/j.commtr.2025.100203","DOIUrl":null,"url":null,"abstract":"<div><div>This study presents an adaptive traffic signal control (ATSC) method for managing multiple intersections at the corridor level by proposing a novel multi-agent masked deep reinforcement learning (DRL) framework. The method extends the hybrid soft-actor-critic architecture to optimize green light timings for intersections across a corridor network, fostering a balance between vehicle flow and pedestrian movements with an emphasis on humanism, fairness, and equality. By integrating an innovative phase mask mechanism, our model dynamically adapts to the fluctuating demand of different transportation modalities by discovering new states or actions that could avoid local optima and achieve higher rewards. We comprehensively test our method using five naturalistic traffic scenarios in Melbourne, Australia. The results demonstrate a significant improvement in reducing the number of impacted travellers compared to existing DRL and other baseline methods. Furthermore, the inclusion of the phase mask mechanism enhances our model's performance through ablation analyses. The proposed framework not only supports a fairer traffic signal system but also provides a scalable, adaptable solution for diverse urban traffic conditions. .</div></div>","PeriodicalId":100292,"journal":{"name":"Communications in Transportation Research","volume":"5 ","pages":"Article 100203"},"PeriodicalIF":14.5000,"publicationDate":"2025-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Communications in Transportation Research","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2772424725000435","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"TRANSPORTATION","Score":null,"Total":0}
引用次数: 0
Abstract
This study presents an adaptive traffic signal control (ATSC) method for managing multiple intersections at the corridor level by proposing a novel multi-agent masked deep reinforcement learning (DRL) framework. The method extends the hybrid soft-actor-critic architecture to optimize green light timings for intersections across a corridor network, fostering a balance between vehicle flow and pedestrian movements with an emphasis on humanism, fairness, and equality. By integrating an innovative phase mask mechanism, our model dynamically adapts to the fluctuating demand of different transportation modalities by discovering new states or actions that could avoid local optima and achieve higher rewards. We comprehensively test our method using five naturalistic traffic scenarios in Melbourne, Australia. The results demonstrate a significant improvement in reducing the number of impacted travellers compared to existing DRL and other baseline methods. Furthermore, the inclusion of the phase mask mechanism enhances our model's performance through ablation analyses. The proposed framework not only supports a fairer traffic signal system but also provides a scalable, adaptable solution for diverse urban traffic conditions. .