基于时间段划分和深度强化学习的自适应交通信号控制

Baolin Gong, Wenxing Zhu
{"title":"基于时间段划分和深度强化学习的自适应交通信号控制","authors":"Baolin Gong, Wenxing Zhu","doi":"10.1109/CAC57257.2022.10055131","DOIUrl":null,"url":null,"abstract":"In this paper, we propose an adaptive traffic signal control model combining time period division and deep reinforcement learning to improve the efficiency of traffic by dynamically changing the traffic phase duration according to the real-time situation. In our model, a day-time period is divided into two overlap period parts representing the morning situation and the evening situation, then the deep reinforcement learning algorithm-TD3 is selected to train the corresponding agent in each part, and finally a fuzzy method is used to coordinate these two agents at different time. In order to get better performance, we make some improvements in TD3. We improve the algorithm’s experience-replay mechanism and use some tricks in training. Simulation results shows that our model can effectively reduce vehicles’ accumulative waiting time, queue length and alleviate CO2 emission.","PeriodicalId":287137,"journal":{"name":"2022 China Automation Congress (CAC)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Adaptive Traffic Signal Control Through Time Period Division and Deep Reinforcement Learning\",\"authors\":\"Baolin Gong, Wenxing Zhu\",\"doi\":\"10.1109/CAC57257.2022.10055131\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we propose an adaptive traffic signal control model combining time period division and deep reinforcement learning to improve the efficiency of traffic by dynamically changing the traffic phase duration according to the real-time situation. In our model, a day-time period is divided into two overlap period parts representing the morning situation and the evening situation, then the deep reinforcement learning algorithm-TD3 is selected to train the corresponding agent in each part, and finally a fuzzy method is used to coordinate these two agents at different time. In order to get better performance, we make some improvements in TD3. We improve the algorithm’s experience-replay mechanism and use some tricks in training. Simulation results shows that our model can effectively reduce vehicles’ accumulative waiting time, queue length and alleviate CO2 emission.\",\"PeriodicalId\":287137,\"journal\":{\"name\":\"2022 China Automation Congress (CAC)\",\"volume\":\"18 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 China Automation Congress (CAC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CAC57257.2022.10055131\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 China Automation Congress (CAC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CAC57257.2022.10055131","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

本文提出了一种结合时间段划分和深度强化学习的自适应交通信号控制模型,通过根据实时情况动态改变交通相位持续时间来提高交通效率。在我们的模型中,将一个白天时间段划分为代表早晨和晚上情况的两个重叠时间段部分,然后选择深度强化学习算法- td3在每个部分训练相应的智能体,最后使用模糊方法在不同时间协调这两个智能体。为了获得更好的性能,我们在TD3中做了一些改进。我们改进了算法的经验重放机制,并在训练中使用了一些技巧。仿真结果表明,该模型可以有效地减少车辆的累计等待时间和排队长度,减少CO2排放。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Adaptive Traffic Signal Control Through Time Period Division and Deep Reinforcement Learning
In this paper, we propose an adaptive traffic signal control model combining time period division and deep reinforcement learning to improve the efficiency of traffic by dynamically changing the traffic phase duration according to the real-time situation. In our model, a day-time period is divided into two overlap period parts representing the morning situation and the evening situation, then the deep reinforcement learning algorithm-TD3 is selected to train the corresponding agent in each part, and finally a fuzzy method is used to coordinate these two agents at different time. In order to get better performance, we make some improvements in TD3. We improve the algorithm’s experience-replay mechanism and use some tricks in training. Simulation results shows that our model can effectively reduce vehicles’ accumulative waiting time, queue length and alleviate CO2 emission.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信