基于图的协同多智能体强化学习智能交通信号控制

IF 8.9 1区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS
Jing Shang;Shunmei Meng;Jun Hou;Xiaoran Zhao;Xiaokang Zhou;Rong Jiang;Lianyong Qi;Qianmu Li
{"title":"基于图的协同多智能体强化学习智能交通信号控制","authors":"Jing Shang;Shunmei Meng;Jun Hou;Xiaoran Zhao;Xiaokang Zhou;Rong Jiang;Lianyong Qi;Qianmu Li","doi":"10.1109/JIOT.2025.3525640","DOIUrl":null,"url":null,"abstract":"In the trend of continuously advancing urban intelligent transport construction, traditional traffic signal control (TSC) struggles to make effective decisions with complex traffic conditions. Although multiagent deep reinforcement learning shows promise in optimizing traffic flow, most existing studies ignore the complex relationships between signal lights and fail to communicate with neighbors effectively. Moreover, the deterministic strategies generated by Q-learning-based methods struggle to be extended to large-scale urban road networks. Therefore, this article proposes a multiagent graph-based soft actor-critic (MAGSAC) approach for TSC, which combines graph neural networks with the soft actor-critic (SAC) algorithm and extends it to multiagent environments to address the TSC problem. Specifically, we employ graph-based networks and attention mechanism to expand the receptive domain of agents, enable environmental information to be shared among agents, and utilize the attention mechanism to filter out unimportant information. The algorithm adheres to the centralized training decentralized execution (CTDE) paradigm to minimize the nonstationarity of MARL. Finally, a rigorous experimental evaluation was conducted using the CityFlow simulator on both synthetic traffic grids and real-world urban road networks. Experimental results show that MAGSAC outperforms other TSC methods in performance metrics, including average queue length and waiting time, and achieves excellent performance under complex urban traffic conditions.","PeriodicalId":54347,"journal":{"name":"IEEE Internet of Things Journal","volume":"12 10","pages":"14362-14374"},"PeriodicalIF":8.9000,"publicationDate":"2025-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Graph-Based Cooperation Multiagent Reinforcement Learning for Intelligent Traffic Signal Control\",\"authors\":\"Jing Shang;Shunmei Meng;Jun Hou;Xiaoran Zhao;Xiaokang Zhou;Rong Jiang;Lianyong Qi;Qianmu Li\",\"doi\":\"10.1109/JIOT.2025.3525640\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In the trend of continuously advancing urban intelligent transport construction, traditional traffic signal control (TSC) struggles to make effective decisions with complex traffic conditions. Although multiagent deep reinforcement learning shows promise in optimizing traffic flow, most existing studies ignore the complex relationships between signal lights and fail to communicate with neighbors effectively. Moreover, the deterministic strategies generated by Q-learning-based methods struggle to be extended to large-scale urban road networks. Therefore, this article proposes a multiagent graph-based soft actor-critic (MAGSAC) approach for TSC, which combines graph neural networks with the soft actor-critic (SAC) algorithm and extends it to multiagent environments to address the TSC problem. Specifically, we employ graph-based networks and attention mechanism to expand the receptive domain of agents, enable environmental information to be shared among agents, and utilize the attention mechanism to filter out unimportant information. The algorithm adheres to the centralized training decentralized execution (CTDE) paradigm to minimize the nonstationarity of MARL. Finally, a rigorous experimental evaluation was conducted using the CityFlow simulator on both synthetic traffic grids and real-world urban road networks. Experimental results show that MAGSAC outperforms other TSC methods in performance metrics, including average queue length and waiting time, and achieves excellent performance under complex urban traffic conditions.\",\"PeriodicalId\":54347,\"journal\":{\"name\":\"IEEE Internet of Things Journal\",\"volume\":\"12 10\",\"pages\":\"14362-14374\"},\"PeriodicalIF\":8.9000,\"publicationDate\":\"2025-01-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Internet of Things Journal\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10824857/\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Internet of Things Journal","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10824857/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0

摘要

在城市智能交通建设不断推进的趋势下,传统的交通信号控制(TSC)难以在复杂的交通条件下做出有效的决策。尽管多智能体深度强化学习在优化交通流方面显示出希望,但大多数现有研究都忽略了信号灯之间的复杂关系,并且无法有效地与邻居进行通信。此外,基于q学习的方法生成的确定性策略难以扩展到大规模的城市道路网络。因此,本文提出了一种基于多智能体图的TSC软行为者评价(MAGSAC)方法,该方法将图神经网络与软行为者评价(SAC)算法相结合,并将其扩展到多智能体环境中来解决TSC问题。具体而言,我们采用基于图的网络和注意机制来扩展智能体的接受域,实现环境信息在智能体之间的共享,并利用注意机制过滤掉不重要的信息。该算法遵循集中训练分散执行(CTDE)范式,最大限度地减少MARL的非平稳性。最后,利用CityFlow模拟器对合成交通网格和真实城市道路网络进行了严格的实验评估。实验结果表明,MAGSAC在平均队列长度和等待时间等性能指标上优于其他TSC方法,在复杂的城市交通条件下取得了优异的性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Graph-Based Cooperation Multiagent Reinforcement Learning for Intelligent Traffic Signal Control
In the trend of continuously advancing urban intelligent transport construction, traditional traffic signal control (TSC) struggles to make effective decisions with complex traffic conditions. Although multiagent deep reinforcement learning shows promise in optimizing traffic flow, most existing studies ignore the complex relationships between signal lights and fail to communicate with neighbors effectively. Moreover, the deterministic strategies generated by Q-learning-based methods struggle to be extended to large-scale urban road networks. Therefore, this article proposes a multiagent graph-based soft actor-critic (MAGSAC) approach for TSC, which combines graph neural networks with the soft actor-critic (SAC) algorithm and extends it to multiagent environments to address the TSC problem. Specifically, we employ graph-based networks and attention mechanism to expand the receptive domain of agents, enable environmental information to be shared among agents, and utilize the attention mechanism to filter out unimportant information. The algorithm adheres to the centralized training decentralized execution (CTDE) paradigm to minimize the nonstationarity of MARL. Finally, a rigorous experimental evaluation was conducted using the CityFlow simulator on both synthetic traffic grids and real-world urban road networks. Experimental results show that MAGSAC outperforms other TSC methods in performance metrics, including average queue length and waiting time, and achieves excellent performance under complex urban traffic conditions.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
IEEE Internet of Things Journal
IEEE Internet of Things Journal Computer Science-Information Systems
CiteScore
17.60
自引率
13.20%
发文量
1982
期刊介绍: The EEE Internet of Things (IoT) Journal publishes articles and review articles covering various aspects of IoT, including IoT system architecture, IoT enabling technologies, IoT communication and networking protocols such as network coding, and IoT services and applications. Topics encompass IoT's impacts on sensor technologies, big data management, and future internet design for applications like smart cities and smart homes. Fields of interest include IoT architecture such as things-centric, data-centric, service-oriented IoT architecture; IoT enabling technologies and systematic integration such as sensor technologies, big sensor data management, and future Internet design for IoT; IoT services, applications, and test-beds such as IoT service middleware, IoT application programming interface (API), IoT application design, and IoT trials/experiments; IoT standardization activities and technology development in different standard development organizations (SDO) such as IEEE, IETF, ITU, 3GPP, ETSI, etc.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信