{"title":"Adaptive Traffic Signal Control Through Time Period Division and Deep Reinforcement Learning","authors":"Baolin Gong, Wenxing Zhu","doi":"10.1109/CAC57257.2022.10055131","DOIUrl":null,"url":null,"abstract":"In this paper, we propose an adaptive traffic signal control model combining time period division and deep reinforcement learning to improve the efficiency of traffic by dynamically changing the traffic phase duration according to the real-time situation. In our model, a day-time period is divided into two overlap period parts representing the morning situation and the evening situation, then the deep reinforcement learning algorithm-TD3 is selected to train the corresponding agent in each part, and finally a fuzzy method is used to coordinate these two agents at different time. In order to get better performance, we make some improvements in TD3. We improve the algorithm’s experience-replay mechanism and use some tricks in training. Simulation results shows that our model can effectively reduce vehicles’ accumulative waiting time, queue length and alleviate CO2 emission.","PeriodicalId":287137,"journal":{"name":"2022 China Automation Congress (CAC)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 China Automation Congress (CAC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CAC57257.2022.10055131","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In this paper, we propose an adaptive traffic signal control model combining time period division and deep reinforcement learning to improve the efficiency of traffic by dynamically changing the traffic phase duration according to the real-time situation. In our model, a day-time period is divided into two overlap period parts representing the morning situation and the evening situation, then the deep reinforcement learning algorithm-TD3 is selected to train the corresponding agent in each part, and finally a fuzzy method is used to coordinate these two agents at different time. In order to get better performance, we make some improvements in TD3. We improve the algorithm’s experience-replay mechanism and use some tricks in training. Simulation results shows that our model can effectively reduce vehicles’ accumulative waiting time, queue length and alleviate CO2 emission.