{"title":"基于异步强化学习的多交叉口信号控制","authors":"Jixiang Wang, Siqi Chen, Jing Wei, Boao Wang, Haiyang Yu","doi":"10.1155/atr/3890878","DOIUrl":null,"url":null,"abstract":"<div>\n <p>State-of-the-art theoretical models and new traffic signal control technologies are key guarantees for improving the management and safety performance of transportation systems, and multiagent reinforcement learning (MARL) methods have been widely applied in the field of signal control. Researchers in the transportation domain have effectively addressed the issues of poor convergence and suboptimal optimization encountered in RL for multi-intersection signal control scenarios by adopting the centralized training with decentralized execution (CTDE) approach. However, due to the heterogeneity among intersections, simply decomposing the global reward into a sum of intersection-level rewards is unreasonable, posing a challenge in balancing the interests of individual intersections and the entire road network. Additionally, the assumption that all intersections within the system make decisions synchronously is rather strong. Therefore, this paper proposes a distributed traffic model tailored for synchronous decision-making and, based on that, introduces an asynchronous decision-making traffic model according to decoupled intersection control. Simulation experiments show that the asynchronous decision-making method proposed in this paper not only improves the model convergence speed by at least 19% compared to the multiagent deep RL (MADRL) algorithm used for synchronous decision-making, but also improves the model by at least 10.5% in vehicle driving speed, maximum queue length, and average queue length within the decodable range (the traffic density is between 100 vehicles/km and 400 vehicles/km). In the same traffic scenario, the MADRL algorithm used for asynchronous decision-making has improved the average vehicle delay and average queue length by at least 55% compared to traditional arterial green wave control methods and adaptive control methods, and by at least 5% compared to SAC and A2C methods.</p>\n </div>","PeriodicalId":50259,"journal":{"name":"Journal of Advanced Transportation","volume":"2025 1","pages":""},"PeriodicalIF":2.0000,"publicationDate":"2025-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1155/atr/3890878","citationCount":"0","resultStr":"{\"title\":\"Multi-Intersection Signal Control Based on Asynchronous Reinforcement Learning\",\"authors\":\"Jixiang Wang, Siqi Chen, Jing Wei, Boao Wang, Haiyang Yu\",\"doi\":\"10.1155/atr/3890878\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div>\\n <p>State-of-the-art theoretical models and new traffic signal control technologies are key guarantees for improving the management and safety performance of transportation systems, and multiagent reinforcement learning (MARL) methods have been widely applied in the field of signal control. Researchers in the transportation domain have effectively addressed the issues of poor convergence and suboptimal optimization encountered in RL for multi-intersection signal control scenarios by adopting the centralized training with decentralized execution (CTDE) approach. However, due to the heterogeneity among intersections, simply decomposing the global reward into a sum of intersection-level rewards is unreasonable, posing a challenge in balancing the interests of individual intersections and the entire road network. Additionally, the assumption that all intersections within the system make decisions synchronously is rather strong. Therefore, this paper proposes a distributed traffic model tailored for synchronous decision-making and, based on that, introduces an asynchronous decision-making traffic model according to decoupled intersection control. Simulation experiments show that the asynchronous decision-making method proposed in this paper not only improves the model convergence speed by at least 19% compared to the multiagent deep RL (MADRL) algorithm used for synchronous decision-making, but also improves the model by at least 10.5% in vehicle driving speed, maximum queue length, and average queue length within the decodable range (the traffic density is between 100 vehicles/km and 400 vehicles/km). In the same traffic scenario, the MADRL algorithm used for asynchronous decision-making has improved the average vehicle delay and average queue length by at least 55% compared to traditional arterial green wave control methods and adaptive control methods, and by at least 5% compared to SAC and A2C methods.</p>\\n </div>\",\"PeriodicalId\":50259,\"journal\":{\"name\":\"Journal of Advanced Transportation\",\"volume\":\"2025 1\",\"pages\":\"\"},\"PeriodicalIF\":2.0000,\"publicationDate\":\"2025-04-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1155/atr/3890878\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Advanced Transportation\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1155/atr/3890878\",\"RegionNum\":4,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENGINEERING, CIVIL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Advanced Transportation","FirstCategoryId":"5","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1155/atr/3890878","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, CIVIL","Score":null,"Total":0}
Multi-Intersection Signal Control Based on Asynchronous Reinforcement Learning
State-of-the-art theoretical models and new traffic signal control technologies are key guarantees for improving the management and safety performance of transportation systems, and multiagent reinforcement learning (MARL) methods have been widely applied in the field of signal control. Researchers in the transportation domain have effectively addressed the issues of poor convergence and suboptimal optimization encountered in RL for multi-intersection signal control scenarios by adopting the centralized training with decentralized execution (CTDE) approach. However, due to the heterogeneity among intersections, simply decomposing the global reward into a sum of intersection-level rewards is unreasonable, posing a challenge in balancing the interests of individual intersections and the entire road network. Additionally, the assumption that all intersections within the system make decisions synchronously is rather strong. Therefore, this paper proposes a distributed traffic model tailored for synchronous decision-making and, based on that, introduces an asynchronous decision-making traffic model according to decoupled intersection control. Simulation experiments show that the asynchronous decision-making method proposed in this paper not only improves the model convergence speed by at least 19% compared to the multiagent deep RL (MADRL) algorithm used for synchronous decision-making, but also improves the model by at least 10.5% in vehicle driving speed, maximum queue length, and average queue length within the decodable range (the traffic density is between 100 vehicles/km and 400 vehicles/km). In the same traffic scenario, the MADRL algorithm used for asynchronous decision-making has improved the average vehicle delay and average queue length by at least 55% compared to traditional arterial green wave control methods and adaptive control methods, and by at least 5% compared to SAC and A2C methods.
期刊介绍:
The Journal of Advanced Transportation (JAT) is a fully peer reviewed international journal in transportation research areas related to public transit, road traffic, transport networks and air transport.
It publishes theoretical and innovative papers on analysis, design, operations, optimization and planning of multi-modal transport networks, transit & traffic systems, transport technology and traffic safety. Urban rail and bus systems, Pedestrian studies, traffic flow theory and control, Intelligent Transport Systems (ITS) and automated and/or connected vehicles are some topics of interest.
Highway engineering, railway engineering and logistics do not fall within the aims and scope of JAT.