Xiaocai Zhang, Lok Sang Chan, Neema Nassir, Majid Sarvi
{"title":"走向公平的灯光:用于有效的走廊级交通信号控制的多智能体掩膜深度强化学习","authors":"Xiaocai Zhang, Lok Sang Chan, Neema Nassir, Majid Sarvi","doi":"10.1016/j.commtr.2025.100203","DOIUrl":null,"url":null,"abstract":"<div><div>This study presents an adaptive traffic signal control (ATSC) method for managing multiple intersections at the corridor level by proposing a novel multi-agent masked deep reinforcement learning (DRL) framework. The method extends the hybrid soft-actor-critic architecture to optimize green light timings for intersections across a corridor network, fostering a balance between vehicle flow and pedestrian movements with an emphasis on humanism, fairness, and equality. By integrating an innovative phase mask mechanism, our model dynamically adapts to the fluctuating demand of different transportation modalities by discovering new states or actions that could avoid local optima and achieve higher rewards. We comprehensively test our method using five naturalistic traffic scenarios in Melbourne, Australia. The results demonstrate a significant improvement in reducing the number of impacted travellers compared to existing DRL and other baseline methods. Furthermore, the inclusion of the phase mask mechanism enhances our model's performance through ablation analyses. The proposed framework not only supports a fairer traffic signal system but also provides a scalable, adaptable solution for diverse urban traffic conditions. .</div></div>","PeriodicalId":100292,"journal":{"name":"Communications in Transportation Research","volume":"5 ","pages":"Article 100203"},"PeriodicalIF":14.5000,"publicationDate":"2025-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Towards fair lights: A multi-agent masked deep reinforcement learning for efficient corridor-level traffic signal control\",\"authors\":\"Xiaocai Zhang, Lok Sang Chan, Neema Nassir, Majid Sarvi\",\"doi\":\"10.1016/j.commtr.2025.100203\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>This study presents an adaptive traffic signal control (ATSC) method for managing multiple intersections at the corridor level by proposing a novel multi-agent masked deep reinforcement learning (DRL) framework. The method extends the hybrid soft-actor-critic architecture to optimize green light timings for intersections across a corridor network, fostering a balance between vehicle flow and pedestrian movements with an emphasis on humanism, fairness, and equality. By integrating an innovative phase mask mechanism, our model dynamically adapts to the fluctuating demand of different transportation modalities by discovering new states or actions that could avoid local optima and achieve higher rewards. We comprehensively test our method using five naturalistic traffic scenarios in Melbourne, Australia. The results demonstrate a significant improvement in reducing the number of impacted travellers compared to existing DRL and other baseline methods. Furthermore, the inclusion of the phase mask mechanism enhances our model's performance through ablation analyses. The proposed framework not only supports a fairer traffic signal system but also provides a scalable, adaptable solution for diverse urban traffic conditions. .</div></div>\",\"PeriodicalId\":100292,\"journal\":{\"name\":\"Communications in Transportation Research\",\"volume\":\"5 \",\"pages\":\"Article 100203\"},\"PeriodicalIF\":14.5000,\"publicationDate\":\"2025-08-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Communications in Transportation Research\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2772424725000435\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"TRANSPORTATION\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Communications in Transportation Research","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2772424725000435","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"TRANSPORTATION","Score":null,"Total":0}
Towards fair lights: A multi-agent masked deep reinforcement learning for efficient corridor-level traffic signal control
This study presents an adaptive traffic signal control (ATSC) method for managing multiple intersections at the corridor level by proposing a novel multi-agent masked deep reinforcement learning (DRL) framework. The method extends the hybrid soft-actor-critic architecture to optimize green light timings for intersections across a corridor network, fostering a balance between vehicle flow and pedestrian movements with an emphasis on humanism, fairness, and equality. By integrating an innovative phase mask mechanism, our model dynamically adapts to the fluctuating demand of different transportation modalities by discovering new states or actions that could avoid local optima and achieve higher rewards. We comprehensively test our method using five naturalistic traffic scenarios in Melbourne, Australia. The results demonstrate a significant improvement in reducing the number of impacted travellers compared to existing DRL and other baseline methods. Furthermore, the inclusion of the phase mask mechanism enhances our model's performance through ablation analyses. The proposed framework not only supports a fairer traffic signal system but also provides a scalable, adaptable solution for diverse urban traffic conditions. .