基于时间段划分和深度强化学习的自适应交通信号控制

2022 China Automation Congress (CAC) Pub Date : 2022-11-25 DOI:10.1109/CAC57257.2022.10055131

Baolin Gong, Wenxing Zhu

{"title":"基于时间段划分和深度强化学习的自适应交通信号控制","authors":"Baolin Gong, Wenxing Zhu","doi":"10.1109/CAC57257.2022.10055131","DOIUrl":null,"url":null,"abstract":"In this paper, we propose an adaptive traffic signal control model combining time period division and deep reinforcement learning to improve the efficiency of traffic by dynamically changing the traffic phase duration according to the real-time situation. In our model, a day-time period is divided into two overlap period parts representing the morning situation and the evening situation, then the deep reinforcement learning algorithm-TD3 is selected to train the corresponding agent in each part, and finally a fuzzy method is used to coordinate these two agents at different time. In order to get better performance, we make some improvements in TD3. We improve the algorithm’s experience-replay mechanism and use some tricks in training. Simulation results shows that our model can effectively reduce vehicles’ accumulative waiting time, queue length and alleviate CO2 emission.","PeriodicalId":287137,"journal":{"name":"2022 China Automation Congress (CAC)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Adaptive Traffic Signal Control Through Time Period Division and Deep Reinforcement Learning\",\"authors\":\"Baolin Gong, Wenxing Zhu\",\"doi\":\"10.1109/CAC57257.2022.10055131\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we propose an adaptive traffic signal control model combining time period division and deep reinforcement learning to improve the efficiency of traffic by dynamically changing the traffic phase duration according to the real-time situation. In our model, a day-time period is divided into two overlap period parts representing the morning situation and the evening situation, then the deep reinforcement learning algorithm-TD3 is selected to train the corresponding agent in each part, and finally a fuzzy method is used to coordinate these two agents at different time. In order to get better performance, we make some improvements in TD3. We improve the algorithm’s experience-replay mechanism and use some tricks in training. Simulation results shows that our model can effectively reduce vehicles’ accumulative waiting time, queue length and alleviate CO2 emission.\",\"PeriodicalId\":287137,\"journal\":{\"name\":\"2022 China Automation Congress (CAC)\",\"volume\":\"18 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 China Automation Congress (CAC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CAC57257.2022.10055131\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 China Automation Congress (CAC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CAC57257.2022.10055131","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

本文提出了一种结合时间段划分和深度强化学习的自适应交通信号控制模型，通过根据实时情况动态改变交通相位持续时间来提高交通效率。在我们的模型中，将一个白天时间段划分为代表早晨和晚上情况的两个重叠时间段部分，然后选择深度强化学习算法- td3在每个部分训练相应的智能体，最后使用模糊方法在不同时间协调这两个智能体。为了获得更好的性能，我们在TD3中做了一些改进。我们改进了算法的经验重放机制，并在训练中使用了一些技巧。仿真结果表明，该模型可以有效地减少车辆的累计等待时间和排队长度，减少CO2排放。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Adaptive Traffic Signal Control Through Time Period Division and Deep Reinforcement Learning

In this paper, we propose an adaptive traffic signal control model combining time period division and deep reinforcement learning to improve the efficiency of traffic by dynamically changing the traffic phase duration according to the real-time situation. In our model, a day-time period is divided into two overlap period parts representing the morning situation and the evening situation, then the deep reinforcement learning algorithm-TD3 is selected to train the corresponding agent in each part, and finally a fuzzy method is used to coordinate these two agents at different time. In order to get better performance, we make some improvements in TD3. We improve the algorithm’s experience-replay mechanism and use some tricks in training. Simulation results shows that our model can effectively reduce vehicles’ accumulative waiting time, queue length and alleviate CO2 emission.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2022 China Automation Congress (CAC)

自引率

0.00%

发文量