Kang Wang, Zhishu Shen, Zhenwei Wang, Tiehua Zhang
{"title":"Towards Multi-agent Policy-based Directed Hypergraph Learning for Traffic Signal Control","authors":"Kang Wang, Zhishu Shen, Zhenwei Wang, Tiehua Zhang","doi":"arxiv-2409.05037","DOIUrl":null,"url":null,"abstract":"Deep reinforcement learning (DRL) methods that incorporate graph neural\nnetworks (GNNs) have been extensively studied for intelligent traffic signal\ncontrol, which aims to coordinate traffic signals effectively across multiple\nintersections. Despite this progress, the standard graph learning used in these\nmethods still struggles to capture higher-order correlations in real-world\ntraffic flow. In this paper, we propose a multi-agent proximal policy\noptimization framework DHG-PPO, which incorporates PPO and directed hypergraph\nmodule to extract the spatio-temporal attributes of the road networks. DHG-PPO\nenables multiple agents to ingeniously interact through the dynamical\nconstruction of hypergraph. The effectiveness of DHG-PPO is validated in terms\nof average travel time and throughput against state-of-the-art baselines\nthrough extensive experiments.","PeriodicalId":501315,"journal":{"name":"arXiv - CS - Multiagent Systems","volume":"22 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Multiagent Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.05037","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Deep reinforcement learning (DRL) methods that incorporate graph neural
networks (GNNs) have been extensively studied for intelligent traffic signal
control, which aims to coordinate traffic signals effectively across multiple
intersections. Despite this progress, the standard graph learning used in these
methods still struggles to capture higher-order correlations in real-world
traffic flow. In this paper, we propose a multi-agent proximal policy
optimization framework DHG-PPO, which incorporates PPO and directed hypergraph
module to extract the spatio-temporal attributes of the road networks. DHG-PPO
enables multiple agents to ingeniously interact through the dynamical
construction of hypergraph. The effectiveness of DHG-PPO is validated in terms
of average travel time and throughput against state-of-the-art baselines
through extensive experiments.