Ruochuan Fan , Jian Lu , David Z.W. Wang , Meng Li , Qingchao Liu
{"title":"Group-benefit control strategy for connected automated vehicles in mixed traffic at intersections","authors":"Ruochuan Fan , Jian Lu , David Z.W. Wang , Meng Li , Qingchao Liu","doi":"10.1016/j.trc.2025.105121","DOIUrl":null,"url":null,"abstract":"<div><div>The unpredictable driving behavior of Human-driven Vehicles (HVs) exhibits significant complexity when mixed with Connected Autonomous Vehicles (CAVs) and the periodic variations of the intersection signal phase impact the efficiency and stability of traffic flows. In response to these challenges, this study proposed a group-benefit driving behavior control strategy based on a Multi-Agent Deep Deterministic Policy Gradient (MADDPG) algorithm to provide comprehensive and overall efficient driving guidance for CAVs in multi-agent scenarios and reduce the impact of heterogeneous traffic flow on the signalized intersections. The dynamically encoded state enables the model to effectively adapt to dynamically changing environments, and each agent independently makes combined action decisions to achieve group benefits. Regularization methods and the attention mechanism are integrated to prevent overfitting and enhance stability, with the attention mechanism having a more significant impact. The results indicate that the model effectively reduces delays and fuel consumption even at low CAV Market Penetration Rates (MPRs), with the optimization effects significantly improving as MPR increases. Compared to the HV traffic flow, the optimized CAV traffic flow reduces average vehicle delay by 10.11% and CO<sub>2</sub> emissions by 20.93%. The comparison with test models, which separately adjust traffic flow and signal phases, and interpolation experiments, which modify the MPR, demonstrates the model’s strong generalization ability and adaptability. Furthermore, compared with the Multi-Agent Deep Q-Network (MADQN) model, the proposed model demonstrates greater robustness with significantly better optimization results.</div></div>","PeriodicalId":54417,"journal":{"name":"Transportation Research Part C-Emerging Technologies","volume":"175 ","pages":"Article 105121"},"PeriodicalIF":7.6000,"publicationDate":"2025-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Transportation Research Part C-Emerging Technologies","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0968090X25001251","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"TRANSPORTATION SCIENCE & TECHNOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
The unpredictable driving behavior of Human-driven Vehicles (HVs) exhibits significant complexity when mixed with Connected Autonomous Vehicles (CAVs) and the periodic variations of the intersection signal phase impact the efficiency and stability of traffic flows. In response to these challenges, this study proposed a group-benefit driving behavior control strategy based on a Multi-Agent Deep Deterministic Policy Gradient (MADDPG) algorithm to provide comprehensive and overall efficient driving guidance for CAVs in multi-agent scenarios and reduce the impact of heterogeneous traffic flow on the signalized intersections. The dynamically encoded state enables the model to effectively adapt to dynamically changing environments, and each agent independently makes combined action decisions to achieve group benefits. Regularization methods and the attention mechanism are integrated to prevent overfitting and enhance stability, with the attention mechanism having a more significant impact. The results indicate that the model effectively reduces delays and fuel consumption even at low CAV Market Penetration Rates (MPRs), with the optimization effects significantly improving as MPR increases. Compared to the HV traffic flow, the optimized CAV traffic flow reduces average vehicle delay by 10.11% and CO2 emissions by 20.93%. The comparison with test models, which separately adjust traffic flow and signal phases, and interpolation experiments, which modify the MPR, demonstrates the model’s strong generalization ability and adaptability. Furthermore, compared with the Multi-Agent Deep Q-Network (MADQN) model, the proposed model demonstrates greater robustness with significantly better optimization results.
期刊介绍:
Transportation Research: Part C (TR_C) is dedicated to showcasing high-quality, scholarly research that delves into the development, applications, and implications of transportation systems and emerging technologies. Our focus lies not solely on individual technologies, but rather on their broader implications for the planning, design, operation, control, maintenance, and rehabilitation of transportation systems, services, and components. In essence, the intellectual core of the journal revolves around the transportation aspect rather than the technology itself. We actively encourage the integration of quantitative methods from diverse fields such as operations research, control systems, complex networks, computer science, and artificial intelligence. Join us in exploring the intersection of transportation systems and emerging technologies to drive innovation and progress in the field.