基于图的协同多智能体强化学习智能交通信号控制

IF 8.9 1区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

IEEE Internet of Things Journal Pub Date : 2025-01-03 DOI:10.1109/JIOT.2025.3525640

Jing Shang;Shunmei Meng;Jun Hou;Xiaoran Zhao;Xiaokang Zhou;Rong Jiang;Lianyong Qi;Qianmu Li

{"title":"基于图的协同多智能体强化学习智能交通信号控制","authors":"Jing Shang;Shunmei Meng;Jun Hou;Xiaoran Zhao;Xiaokang Zhou;Rong Jiang;Lianyong Qi;Qianmu Li","doi":"10.1109/JIOT.2025.3525640","DOIUrl":null,"url":null,"abstract":"In the trend of continuously advancing urban intelligent transport construction, traditional traffic signal control (TSC) struggles to make effective decisions with complex traffic conditions. Although multiagent deep reinforcement learning shows promise in optimizing traffic flow, most existing studies ignore the complex relationships between signal lights and fail to communicate with neighbors effectively. Moreover, the deterministic strategies generated by Q-learning-based methods struggle to be extended to large-scale urban road networks. Therefore, this article proposes a multiagent graph-based soft actor-critic (MAGSAC) approach for TSC, which combines graph neural networks with the soft actor-critic (SAC) algorithm and extends it to multiagent environments to address the TSC problem. Specifically, we employ graph-based networks and attention mechanism to expand the receptive domain of agents, enable environmental information to be shared among agents, and utilize the attention mechanism to filter out unimportant information. The algorithm adheres to the centralized training decentralized execution (CTDE) paradigm to minimize the nonstationarity of MARL. Finally, a rigorous experimental evaluation was conducted using the CityFlow simulator on both synthetic traffic grids and real-world urban road networks. Experimental results show that MAGSAC outperforms other TSC methods in performance metrics, including average queue length and waiting time, and achieves excellent performance under complex urban traffic conditions.","PeriodicalId":54347,"journal":{"name":"IEEE Internet of Things Journal","volume":"12 10","pages":"14362-14374"},"PeriodicalIF":8.9000,"publicationDate":"2025-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Graph-Based Cooperation Multiagent Reinforcement Learning for Intelligent Traffic Signal Control\",\"authors\":\"Jing Shang;Shunmei Meng;Jun Hou;Xiaoran Zhao;Xiaokang Zhou;Rong Jiang;Lianyong Qi;Qianmu Li\",\"doi\":\"10.1109/JIOT.2025.3525640\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In the trend of continuously advancing urban intelligent transport construction, traditional traffic signal control (TSC) struggles to make effective decisions with complex traffic conditions. Although multiagent deep reinforcement learning shows promise in optimizing traffic flow, most existing studies ignore the complex relationships between signal lights and fail to communicate with neighbors effectively. Moreover, the deterministic strategies generated by Q-learning-based methods struggle to be extended to large-scale urban road networks. Therefore, this article proposes a multiagent graph-based soft actor-critic (MAGSAC) approach for TSC, which combines graph neural networks with the soft actor-critic (SAC) algorithm and extends it to multiagent environments to address the TSC problem. Specifically, we employ graph-based networks and attention mechanism to expand the receptive domain of agents, enable environmental information to be shared among agents, and utilize the attention mechanism to filter out unimportant information. The algorithm adheres to the centralized training decentralized execution (CTDE) paradigm to minimize the nonstationarity of MARL. Finally, a rigorous experimental evaluation was conducted using the CityFlow simulator on both synthetic traffic grids and real-world urban road networks. Experimental results show that MAGSAC outperforms other TSC methods in performance metrics, including average queue length and waiting time, and achieves excellent performance under complex urban traffic conditions.\",\"PeriodicalId\":54347,\"journal\":{\"name\":\"IEEE Internet of Things Journal\",\"volume\":\"12 10\",\"pages\":\"14362-14374\"},\"PeriodicalIF\":8.9000,\"publicationDate\":\"2025-01-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Internet of Things Journal\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10824857/\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Internet of Things Journal","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10824857/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

摘要

在城市智能交通建设不断推进的趋势下，传统的交通信号控制（TSC）难以在复杂的交通条件下做出有效的决策。尽管多智能体深度强化学习在优化交通流方面显示出希望，但大多数现有研究都忽略了信号灯之间的复杂关系，并且无法有效地与邻居进行通信。此外，基于q学习的方法生成的确定性策略难以扩展到大规模的城市道路网络。因此，本文提出了一种基于多智能体图的TSC软行为者评价（MAGSAC）方法，该方法将图神经网络与软行为者评价（SAC）算法相结合，并将其扩展到多智能体环境中来解决TSC问题。具体而言，我们采用基于图的网络和注意机制来扩展智能体的接受域，实现环境信息在智能体之间的共享，并利用注意机制过滤掉不重要的信息。该算法遵循集中训练分散执行（CTDE）范式，最大限度地减少MARL的非平稳性。最后，利用CityFlow模拟器对合成交通网格和真实城市道路网络进行了严格的实验评估。实验结果表明，MAGSAC在平均队列长度和等待时间等性能指标上优于其他TSC方法，在复杂的城市交通条件下取得了优异的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Graph-Based Cooperation Multiagent Reinforcement Learning for Intelligent Traffic Signal Control

In the trend of continuously advancing urban intelligent transport construction, traditional traffic signal control (TSC) struggles to make effective decisions with complex traffic conditions. Although multiagent deep reinforcement learning shows promise in optimizing traffic flow, most existing studies ignore the complex relationships between signal lights and fail to communicate with neighbors effectively. Moreover, the deterministic strategies generated by Q-learning-based methods struggle to be extended to large-scale urban road networks. Therefore, this article proposes a multiagent graph-based soft actor-critic (MAGSAC) approach for TSC, which combines graph neural networks with the soft actor-critic (SAC) algorithm and extends it to multiagent environments to address the TSC problem. Specifically, we employ graph-based networks and attention mechanism to expand the receptive domain of agents, enable environmental information to be shared among agents, and utilize the attention mechanism to filter out unimportant information. The algorithm adheres to the centralized training decentralized execution (CTDE) paradigm to minimize the nonstationarity of MARL. Finally, a rigorous experimental evaluation was conducted using the CityFlow simulator on both synthetic traffic grids and real-world urban road networks. Experimental results show that MAGSAC outperforms other TSC methods in performance metrics, including average queue length and waiting time, and achieves excellent performance under complex urban traffic conditions.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IEEE Internet of Things Journal Computer Science-Information Systems

CiteScore

17.60

自引率

13.20%

发文量

1982

期刊介绍： The EEE Internet of Things (IoT) Journal publishes articles and review articles covering various aspects of IoT, including IoT system architecture, IoT enabling technologies, IoT communication and networking protocols such as network coding, and IoT services and applications. Topics encompass IoT's impacts on sensor technologies, big data management, and future internet design for applications like smart cities and smart homes. Fields of interest include IoT architecture such as things-centric, data-centric, service-oriented IoT architecture; IoT enabling technologies and systematic integration such as sensor technologies, big sensor data management, and future Internet design for IoT; IoT services, applications, and test-beds such as IoT service middleware, IoT application programming interface (API), IoT application design, and IoT trials/experiments; IoT standardization activities and technology development in different standard development organizations (SDO) such as IEEE, IETF, ITU, 3GPP, ETSI, etc.