Zhiwei Yang , Zuduo Zheng , Jiwon Kim , Hesham Rakha
{"title":"基于单智能体和多智能体强化学习的混合交通环境下队列生态合作自适应巡航控制","authors":"Zhiwei Yang , Zuduo Zheng , Jiwon Kim , Hesham Rakha","doi":"10.1016/j.trd.2025.104658","DOIUrl":null,"url":null,"abstract":"<div><div>Signalized arterials create stop-and-go traffic, leading to collisions, delays, wasted energy, and discomfort. Connected Automated Vehicles (CAVs), using Cooperative Adaptive Cruise Control (CACC), can potentially mitigate these issues by optimizing speeds with shared information. However, the traffic environment in CACC research on signalized roads is predominantly generated through simulations. This paper compares various eco-friendly CACC methods based on reinforcement learning (RL) for CAVs operating with Human-driven Vehicles (HVs) on signalized arterials. Methods analyzed include Deep Deterministic Policy Gradient (DDPG), Soft Actor-Critic (SAC), and their multi-agent versions (MADDPG, MASAC), trained and tested on naturalistic data from the pNEUMA dataset. These RL methods are benchmarked against human-driven trajectories and the Intelligent Driver Model (IDM) in mixed platoon scenarios. Results show that DDPG and SAC excel in vehicle performance (safety, efficiency, energy, comfort), while MADDPG and MASAC perform best in platoon stability. Key factors influencing performance include platoon characteristics, vehicle position, and preceding vehicle type.</div></div>","PeriodicalId":23277,"journal":{"name":"Transportation Research Part D-transport and Environment","volume":"142 ","pages":"Article 104658"},"PeriodicalIF":7.3000,"publicationDate":"2025-03-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Eco-cooperative adaptive cruise control for platoons in mixed traffic using single-agent and multi-agent reinforcement learning\",\"authors\":\"Zhiwei Yang , Zuduo Zheng , Jiwon Kim , Hesham Rakha\",\"doi\":\"10.1016/j.trd.2025.104658\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Signalized arterials create stop-and-go traffic, leading to collisions, delays, wasted energy, and discomfort. Connected Automated Vehicles (CAVs), using Cooperative Adaptive Cruise Control (CACC), can potentially mitigate these issues by optimizing speeds with shared information. However, the traffic environment in CACC research on signalized roads is predominantly generated through simulations. This paper compares various eco-friendly CACC methods based on reinforcement learning (RL) for CAVs operating with Human-driven Vehicles (HVs) on signalized arterials. Methods analyzed include Deep Deterministic Policy Gradient (DDPG), Soft Actor-Critic (SAC), and their multi-agent versions (MADDPG, MASAC), trained and tested on naturalistic data from the pNEUMA dataset. These RL methods are benchmarked against human-driven trajectories and the Intelligent Driver Model (IDM) in mixed platoon scenarios. Results show that DDPG and SAC excel in vehicle performance (safety, efficiency, energy, comfort), while MADDPG and MASAC perform best in platoon stability. Key factors influencing performance include platoon characteristics, vehicle position, and preceding vehicle type.</div></div>\",\"PeriodicalId\":23277,\"journal\":{\"name\":\"Transportation Research Part D-transport and Environment\",\"volume\":\"142 \",\"pages\":\"Article 104658\"},\"PeriodicalIF\":7.3000,\"publicationDate\":\"2025-03-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Transportation Research Part D-transport and Environment\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1361920925000689\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENVIRONMENTAL STUDIES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Transportation Research Part D-transport and Environment","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1361920925000689","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL STUDIES","Score":null,"Total":0}
Eco-cooperative adaptive cruise control for platoons in mixed traffic using single-agent and multi-agent reinforcement learning
Signalized arterials create stop-and-go traffic, leading to collisions, delays, wasted energy, and discomfort. Connected Automated Vehicles (CAVs), using Cooperative Adaptive Cruise Control (CACC), can potentially mitigate these issues by optimizing speeds with shared information. However, the traffic environment in CACC research on signalized roads is predominantly generated through simulations. This paper compares various eco-friendly CACC methods based on reinforcement learning (RL) for CAVs operating with Human-driven Vehicles (HVs) on signalized arterials. Methods analyzed include Deep Deterministic Policy Gradient (DDPG), Soft Actor-Critic (SAC), and their multi-agent versions (MADDPG, MASAC), trained and tested on naturalistic data from the pNEUMA dataset. These RL methods are benchmarked against human-driven trajectories and the Intelligent Driver Model (IDM) in mixed platoon scenarios. Results show that DDPG and SAC excel in vehicle performance (safety, efficiency, energy, comfort), while MADDPG and MASAC perform best in platoon stability. Key factors influencing performance include platoon characteristics, vehicle position, and preceding vehicle type.
期刊介绍:
Transportation Research Part D: Transport and Environment focuses on original research exploring the environmental impacts of transportation, policy responses to these impacts, and their implications for transportation system design, planning, and management. The journal comprehensively covers the interaction between transportation and the environment, ranging from local effects on specific geographical areas to global implications such as natural resource depletion and atmospheric pollution.
We welcome research papers across all transportation modes, including maritime, air, and land transportation, assessing their environmental impacts broadly. Papers addressing both mobile aspects and transportation infrastructure are considered. The journal prioritizes empirical findings and policy responses of regulatory, planning, technical, or fiscal nature. Articles are policy-driven, accessible, and applicable to readers from diverse disciplines, emphasizing relevance and practicality. We encourage interdisciplinary submissions and welcome contributions from economically developing and advanced countries alike, reflecting our international orientation.